Data Engineer II

Data Engineer II

About BlueLabs

BlueLabs is a leading provider of analytics services and technology for a variety of industry clients: including government, business, and political campaigns. We help our clients optimize their engagements with individual customers, supporters, and stakeholders to achieve their goals. Simply put: we help our partners do the most good by getting the most from their data.


Today, our team of data analysts, scientists, engineers, and strategists come together from diverse backgrounds to share a passion for using data to solve the world’s greatest social and analytical challenges. We’ve served more than 400 organizations ranging from government agencies, advocacy groups, unions, political campaigns, international groups, and companies. Along the way, we’ve developed some of the most innovative tools available in analytics, media optimization, reporting, and influencer outreach-- serving a diverse set of industries, including the automotive, travel, consumer packaged goods, entertainment, healthcare, media, telecom, and more.

About the team

The BlueLabs Civic Tech practice revolutionizes the way government agencies use data to reduce the friction between residents and the essential services they use. Our team develops deep expertise within the areas our clients care about most, then builds data programs that create impact at scale. We work closely with internal government innovation groups, and build on the analytics methodology pioneered in e-commerce, advocacy, politics, and consumer finance.


About the role

This role supports creation of an online system for consumers who received surprise medical bills after receiving care they thought was covered by their insurance to report them to Centers for Medicare & Medicaid Services for investigation and follow up. The engineer will specifically develop integrations between multiple CRM REST APIs to ensure that investigators receive timely reports of surprise bills and consumers receive updates about the status of their complaints.


As a Data Engineer II, you will lead some of our project teams’ efforts which require complex and nuanced data pipelines for our client engagements. The Data Engineer II is responsible for the continuous development, review, and documentation of our clients’ data wrangling and pipeline solutions - including quality control of: solutions to data quality issues and downstream data delivery. You should have experience working in a rapid development team in the past in which you were responsible for significant contributions to client data pipeline solutions.

This role will report to an Analytics Lead internally and a client lead externally.


In this position you will

  • Establish and maintain complex, nuanced data pipelines that will contribute to client-facing tools, including writing and reviewing API endpoints for the tools
  • Work closely with data analysts to identify and effectively respond to their specific needs, especially related to client deliverables
  • Support data analysts and scientists on specific projects with computationally intensive requirements (i.e. geospatial analysis or realtime model scoring).
  • You may contribute to web-based data-oriented internal tools using frames like R/Shiny or Python/Dash.
  • Coordinate with the client, their stakeholders, and other contracting teams as needed in order to stand-up or respond to issues with data pipelines 
  • Support your teammates, including analysts with less technical experience, in implementing efficient and resilient data processes, with improvements such as optimization, good error handling, and incoming data checks.
  • Partner with DevOps engineers to deploy and troubleshoot Pipeline related tools, automations, and integrations.


What we are seeking:

  • Garnered 3+ years of experience as a contributor to technical projects, such as working with complex data pipelines or software applications
  • Experience processing data using scripting languages like python or R, or compiled languages like Java, C++, or Scala
  • Experience designing data models that keep in mind the upstream and downstream system dependencies
  • Experience working in modern data processing stacks using tools like Apache Airflow, AWS Glue, and dbt
  • Comfortability with distributed data processing tools (such as Spark, Hadoop, others)
  • Experience with an MPP database such as Amazon Redshift, Vertica, or Snowflake and/or experience writing complex analytics queries in a general purpose database such as Oracle or Postgresql
  • Advanced understanding of how to manipulate data using SQL [or Python]
  • Experience delivering on client priorities that operate on a regular deployment schedule
  • Ability to manage your individual priorities and comfortably context-switch between active development, client discussion, and issue response
  • Effective communication skills when working with team members of varied backgrounds, roles, and functions
  • Passion in applying your skills to our social mission to problem-solve and collaborate within a cross-functional, client-facing team environment
  • The ability to successfully attain and maintain a Federal Public Trust background investigation that our government clients require; this includes a requirement that the individual has U.S. Citizenship or U.S. residency for three of the past five years


You may have

  • Experience developing software in a compiled language such as Java, C++, or Scala
  • Experience in a consultancy
  • Experience working in marketing analytics and/or infrastructure analytics industries


What recruitment looks like

We expect to hire this position in August 2022. To get there, we anticipate the successful candidate will complete three interviews (HR 15 minutes, panel interview 60 minutes, and client interview 30 minutes), all virtually. During the interview process, you will be asked questions to describe your background and experience relevant to the position. This may include providing examples of projects you worked on, tools or applications you've used, and knowledge you have applied.  We often look for explanations of "how or why" so it's helpful to have details ready.


What we offer

BlueLabs offers a friendly work environment and competitive benefits package including:

  • Premier health insurance plan
  • 401K matching
  • Unlimited vacation leave
  • Paid sick, personal, and volunteer leave
  • 13 paid holidays
  • 15 weeks paid parental leave
  • Professional development stipend & tuition reimbursement
  • Macbook Pro laptop & tech accessories
  • Bring Your Own Device (BYOD) stipend for mobile device
  • Employee Assistance Program (EAP)
  • Supportive & collaborative culture 
  • Flexible working hours
  • Remote friendly (within the U.S.)
  • Pre-tax transportation options for commuting to our office in Washington, DC
  • Lunches and snacks
  • And more! 


The salary for this position is $95,000+ annually.


While we have an office in Washington, DC, we are open to considering candidates from within the U.S.


At BlueLabs, we celebrate, support and thrive on differences. Not only do they benefit our services, products, and community, but most importantly, they are to the benefit of our team. Qualified people of all races, ethnicities, ages, sex, genders, sexual orientations, national origins, gender identities, marital status, religions, veterans statuses, disabilities and any other protected classes are strongly encouraged to apply. As an equal opportunity workplace and an affirmative action employer, BlueLabs is committed to creating an inclusive environment for all employees. BlueLabs endeavors to make reasonable accommodations to the known physical or mental limitations of qualified applicants with a disability unless the accommodation would impose an undue hardship on the operation of our business. If an applicant believes they require such assistance to complete the application or to participate in an interview, or has any questions or concerns, they should contact the Director, People Operations.  BlueLabs participates in E-verify. EEO is the Law (Link to external DOL site)

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.