Sr. Data Engineer

Sr. Data Engineer

This job is no longer open
The Role
If being part of a small dynamic, agile software engineering team practicing TDD with an emphasis on software quality, with a tremendous opportunity to make a big impact, this is the job for you!  This role will provide you with an opportunity to make a huge impact.  You will help drive and maintain a high-level of operational excellence in data engineering.

What You’ll Do

  • Develop integrations to move data into the raw zone of the data lake
  • Build ETL/ELT pipelines to transform data in the raw zone, and load it to the structured and consumer zones of the data lake, and to serving tiers
  • Define and lead the best practices in security, data privacy, quality, and data governance
  • Help lead and collaboratively define Lark's next generation data platform
  • Help build online data validation to ensure the assumptions we’ve tested for in our code remain true; outlier and aberration detection, change-detection, etc.
  • Collaborate with teams across the company to help develop data products that drive company success
  • Evaluate, integrate and build tools and infrastructure to accelerate Data Engineering, Data Science, Business Intelligence, Reporting and Analytics as needed
  • Drive data literacy across business functions

What You’ll Need

  • Knowledge of the “Testing Pyramid”, and have helped other engineers apply it correctly
  • You know S.O.L.I.D. principles and practice them intuitively and appropriately
  • Expertise in Scala, Python, and Java
  • Demonstrated expertise in Object Oriented (OO) and Functional programming (FP) including an expert knowledge of common design patterns, idioms, best practices, dependency injection/inversion frameworks and techniques, testing frameworks, Monad-Transformer-Libraries (MTL), Tagless-Final encoding (and when it’s appropriate), etc.
  • Fluency in data structures, algorithms, distributed computing, storage systems, and multiple consistency models
  • In-depth knowledge of AWS (including EMR, DMS, Athena, RDS, Aurora, Lambda, Redshift, etc.)
  • Expertise in stream data processing (e.g., DMS, Flink, Spark, Kinesis, Kafka)
  • Advanced SQL skills
  • Deep Knowledge of multiple database technologies, their tradeoffs, and how to make the best use of each
  • Willingness to learn and mentor in a collaborative team environment
  • Humility with an intrinsic positive drive
  • Passion for developing a world-class engineering culture
  • Value, respect, and an enthusiasm for diversity, inclusion, and alternative perspectives
  • Goal-oriented, with a desire to create an environment of psychological safety
  • Ability to thrive in an environment promoting and enabling collaboration
  • Solid understanding and hands-on experience in computer network

Education and Experience

  • MS, or PhD in Computer Science, Mathematics, Computer Engineering, etc., or equivalent experience
  • 8+ years hands-on software engineering experience with a focus on quality; 4+ years in data engineering
  • Expertise with Apache Spark, DataFrame & Dataset API, spark internals and optimization
  • Data warehouse modernization, building data-marts, star/snowflake schema designs, ETL/ELT pipelines
  • Building production-grade data backup/restore strategies, and disaster recovery solutions experience

Technologies we use in data engineering:

  • Scala, sbt, Python, Pytest, tox, Java, Maven, Github, Code Artifact, Apache Spark (on EMR and Databricks), Airflow, AWS (DynamoDB, RDS, Kinesis, SQS, SNS, MWAA, S3, Lambda, Event Bridge, MSK, EKS, ECR, Kubernetes, Kafka, Delta Lake, SparkML, GraphX, Snowflake and Periscope (Sisense)

JOIN US
Lark is a companion that lives on the user’s phone and provides daily coaching, guidance, and health insights to users. Developed with the expertise of world-leading health professionals, we are on a mission to make the world a healthier, happier place. Come join our team!

About Lark
Lark is the world's largest A.I.-enabled healthcare provider, delivering virtual care and connected health devices that help people stay in control of conditions such as diabetes and heart disease. We’re on a mission to improve the health and happiness of the 1 billion people globally who are living with or at risk for a chronic health condition. Our A.I. enabled programs were the first to achieve clinical outcomes matching those of in-person healthcare professionals, showing the ability to deliver care at a fraction of the cost and at enormous scale. 

Working at Lark 
Lark offers the option to work remotely when both the employee and the job are suited to such an arrangement and when the employee and their manager are in alignment with the work location and working hours. This option is only applicable to U.S. employees. To accommodate all employees, we have identified the “core” period of the day during which all employees are required to work for scheduled meetings, syncs, scrums between 10 a.m. and 3 p.m. Pacific time. The company is headquartered in Mountain View, CA.

Lark is an Equal Opportunity Employer.

 

#LI-PH1
#BI-Remote

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.