Lead Data Engineer

Lead Data Engineer

Join phData, a dynamic and innovative leader in the modern data stack. We partner with major cloud data platforms like Snowflake, AWS, Azure, GCP, Fivetran, and dbt to deliver cutting-edge services and solutions. We're committed to helping global enterprises overcome their toughest data challenges. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.

  • 4x Snowflake Partner of the Year (2020, 2021, 2022, 2023) 
  • #1 Partner in Snowflake Advanced Certifications
  • 600+ Expert Cloud Certifications (Fivetran, dbt, Sigma Award Winners)
  • 7x Best Places to Work 
  • Inc 5000 Fastest Growing US Companies (2020-2023)

Required Experience:

  • 8+ years as a hands-on Data Engineer designing and implementing data solutions
  • Team lead, and/or mentorship of other engineers
  • Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
  • Programming expertise in Java, Python and/or Scala 
  • Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
  • SQL and the ability to write, debug, and optimize SQL queries
  • Client-facing written and verbal communication skills and experience
  • Create and deliver detailed presentations 
  • Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
  • 4-year Bachelor's degree in Computer Science or a related field

Prefer any of the following: 

  • Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks
  • Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems
  • Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies
  • Multiple data sources (e.g. queues, relational databases, files, search, API)
  • Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment
  • Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines
  • Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi

Why phData? We offer:

  • Remote-First Workplace
  • Medical Insurance for Self & Family
  • Medical Insurance for Parents
  • Term Life & Personal Accident
  • Wellness Allowance
  • Broadband Reimbursement
  • 2-4 week bootcamp and provide continuous learning opportunities to enhance your skills and expertise
  • Other perks include paid certifications, professional development allowance and additional compensation for creating company-approved content (dashboards, blogs, videos, whitepapers, etc.)

#LI-DNI

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.