Data Engineer

Data Engineer

This job is no longer open

Data Scientist

About Rhythm

Rhythm is a renewable energy and technology company empowering you to take control of your budget and your footprint.  We combine energy market expertise with technology, design, and data science to create best-in-class products and services that are simple, delightful, and seamlessly integrated with the rhythm of your life. Our mission is to upend the energy status quo by setting a new standard of service excellence and customer partnership. We are a mission-driven, results-oriented group of engineers, businesspeople, designers and artists who love solving tough problems, all while making a positive impact on our communities.

About the Data Engineer

The ideal candidate loves being hands-on with data. You love building and maintaining the systems that see data orchestrated throughout its entire lifecycle, from source system to integrated warehouse, to data science models, and finally to a business intelligence environment where it is surfaced. You have extensive experience building batch and streaming data pipelines to ingest data, managing complex ETL dependencies using tools such as Airflow, and building out scalable ML pipelines. You like sitting at the intersection of data science and engineering, and are willing to jump on whatever task needs doing, whether it’s troubleshooting a task queue or helping a business user understand the data they’re seeing in a dashboard. At the end of the day, you’re a team player, and are motivated by our mission to make saving money with clean, renewable energy simple for our customers.

Responsibilities

  • Work with our Engineering team to maintain the pipelines through which we ingest data into our Snowflake data warehouse, from both internal and 3rd-party platforms 
  • Own the ETL pipeline through which that data is transformed, managing complex data and timing dependencies and a mixture of SQL and python-based transformations
  • Own our data science application, which hosts our data and analytics services, including its containerized deployment, data access, monitoring, and overall uptime 
  • Integrate a scalable ML pipeline within our ETL, providing our BI users and customer platform just-in-time access to our analytics
  • Be an expert in not only the orchestration but the nature of our data, understanding how it is integrated from disparate systems and built up into the business objects that our team and our customers depend on
  • Build out a streaming pipeline for customer events, enabling real-time intelligence within our platform and situational awareness for our marketing and operations teams
  • Be the data science team’s go-to expert on our infrastructure, including database, container, and application management, responsible for monitoring, maintenance, and troubleshooting
  • Contribute to our BI environment, Looker, and support our business users in their use of it
  • On occasion, contribute to our data applications, writing data-driven python-based services
  • Be a champion for Rhythm’s mission and values, both at work and in your community

Qualifications

  • 5+ years of technical experience working in modern, cloud-based environments (8-10 years preferred)
  • 3+ years of data engineering experience specifically
  • Bachelors degree in CS or a quantitative field (M.S. preferred)
  • Expertise developing, testing, deploying and maintaining production Python-based applications
  • Expertise with web and data access in Python, in particular via SQLAlchemy and Flask
  • Expertise in the AWS ecosystem, including ECS, S3, Aurora, SNS, and SQS
  • Expertise with ETL and data pipelines, using tools such as Airflow, Kinesis, DBT, and Celery
  • Expertise architecting and managing data warehouses (we use Snowflake and Timescale)
  • Expertise building, monitoring, and maintaining scalable ML pipelines, preferably on AWS (SageMaker a plus)
  • Expertise with Docker, Docker Compose, and containerized testing and deployment
  • Experience supporting data warehouse BI tools (we use Looker)
  • Experience with basic devops, including managing local dev environments and data access
  • Excellent written and verbal communication skills
  • Excellent organizational skills, and a willingness to work hard and jump on whatever the team needs

Geography

  • Rhythm is headquartered in Houston, Texas and is building out a New York City office
  • Remote candidates will be actively considered, but preference will be given to candidates in the NYC or Houston area
  • Travel required <5%

Compensation and Benefits

  • Competitive base salary and bonus
  • Complete benefits package, including medical, 401k, HSA/FSA, and unlimited PTO.

 

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.