Sr. Lead Data Engineer

Sr. Lead Data Engineer

This job is no longer open

Amherst is revolutionizing the way U.S. real estate is priced, managed and financed in order to unlock opportunities for all market participants. Driven by data, analytics, and technology, Amherst has a 20-year history of anticipating where the next risks and opportunities are likely to emerge and designing actionable strategies for investors to capitalize on opportunities across residential real estate, commercial real estate and public securities. Amherst, along with its affiliates and subsidiaries, has more than 900 employees, $5 billion under management and approximately $15 billion under advisement and oversight. www.amherst.com.

Lead Engineer and Manager for a small group of Data Engineers (2-4)
Own data processes, Help architect and plan data engineering environment, Oversee workload for 2-4 direct reports, Coordinate data projects with external vendors and internal business stake holders and product managers.

Responsibilities:Lead a small group of 2-4 data engineersDevelop logical data models and processes to transform, cleanse, and normalize raw data into high-quality datasets aligned with our analytical requirements.Develop and maintain comprehensive controls to ensure data quality and completeness.Manage data movement through our infrastructure. Streamline existing data workflows to create a flexible, reliable, and faster process.Develop data transformations and data validationsin ETL pipelines.Identify and onboard new data sources. Collaborate with data vendors and internal stakeholders to define requirements and build interfaces.Troubleshoot and resolve issues with data feeds.Design, develop, and implement data infrastructure and pipelines that collect, connect, centralize, and curate data from variousinternal and external data sourcesParticipate in data architecture discussions to understand target data structures, required data transformations and deliver data pipelines/ETL loading processes that meet requirements.Perform detailed exploration of new internal and external source data to perform source-to-target mapping to inform the development of new data pipelines.Investigate the root cause of data-related issues and implement viable, sustainable solutions to correct issues.Act as a lead toa small engineering group focused on the above responsibilities and dutiesRequirements:Passion for data organization, quality, and reliabilityAbility to lead a small team of 2-4 engineers includingorganizingand prioritizingworkApache Spark / Hadoop / Hive/ HDFS, Presto/TrinoPython / PySparkMS SQL Server(preferred), Postgres, MySQL, etc.Experience developingand tuning efficient Apache Spark jobs, SQL queries, ETL pipelinesExperience designing, building and maintaining data warehouse fact and dimension tablesExperience with at least one language Python(preferred), C#, Scala, JavaSourceControl (GitHub)Experience working with large datasetsProactive, hardworking team player with excellent communication skillsBonus Skills:Snowflake Cloud Data WarehouseMatillion ETL for SnowflakeUnit TestingDevOps Deployment AutomationC#, Object Oriented ProgrammingNoSQL Databases technologies like Cassandra, HBase, MongoDBStrong knowledge of statistics, including hands-on experience with Python, R, SAS, Matlab, Machine Learning, AI.Big Data platforms like Cloudera, Databricks, Amazon AWS, AzureTableau/BI Reporting Tools

Our full-time employee benefits include:

  • A competitive compensation package, annual bonus, 401k match
  • Flexible PTO including 7 paid holidays, 1 floating holiday, and 1 volunteer day
  • Employer-paid benefits (medical, dental, vision, health savings account)
  • Professional career development and reimbursement
  • Up to 16 weeks paid maternity leave; up to 4 weeks of paid parental leave
  • Backup childcare offered through Bright Horizons
  • Relaxed casual environment with virtual office events

Amherst is proud to be an Equal Opportunity Employer and committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, color, religion, national origin, gender, pregnancy, sexual orientation, gender identity, age, physical or mental disability, genetic information or veteran status, and encourage all applicants to apply.

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.