Software Engineer II, Copilot Data

Software Engineer II, Copilot Data

This job is no longer open

About GitHub

As the global home for all developers, GitHub is the complete AI-powered developer platform to build, scale, and deliver secure software. Over 100 million people, including developers from 90 of the Fortune 100 companies, use GitHub to build amazing things together across 330+ million repositories. With all the collaborative features of GitHub, it has never been easier for individuals and teams to write faster, better code.

Locations

In this role you can work from Remote, United States

Overview

The Copilot Metrics team is a unique group composed of software engineers, data engineers, and data analysts. We build a service to support our customers' metrics needs and help them understand product usage patterns. The analysts will assist with data modeling, support product managers and leadership in understanding how customers are using our product, and identify pain points. This valuable analysis will also be used to help the product team develop better features.

As a Data Software Engineer on the Copilot Metrics team, you will be responsible for designing, developing, and maintaining efficient and reliable data pipelines. You will work closely with stakeholders across the company to gather business requirements, build data models, and ensure data quality and accessibility. Your expertise in Python, SQL, Airflow, and Spark will be crucial in optimizing our data infrastructure and enabling data-driven decision-making.


Responsibilities

  • Data Pipeline Development: Design, build, and maintain scalable data pipelines using Python, SQL, Airflow, and Spark.

  • Business Requirements Gathering: Collaborate with stakeholders to understand and translate business requirements into technical specifications.

  • Data Modeling:Develop and implement data models that support analytics and reporting needs, ensuring alignment with business goals.

  • Data Quality and Governance: Ensure data accuracy, consistency, and reliability by implementing robust data validation and quality checks.

  • Stakeholder Collaboration: Work with cross-functional teams, including data analysts, data scientists, and business leaders, to deliver high-quality data solutions.

  • Performance Optimization: Continuously monitor and optimize data pipelines for performance, scalability, and cost-efficiency.

  • Monitoring and Observability: Build and implement monitoring and observability metrics to ensure data quality and detect anomalies in data pipelines.

  • Documentation and Communication: Maintain clear and comprehensive documentation of data processes and effectively communicate technical concepts to non-technical stakeholders.


Qualifications

Required/Minimum Qualifications:

  • 2+ years experience in Software Engineering, Computer Science, or related technical discipline with proven experience maintaining production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python 

    • o OR Associate’s Degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 1+ year(s) experience 

    • o OR Bachelor's Degree in Computer Science or related field o OR equivalent experience

  • 2+ years of experience in data engineering or analytics engineering roles, with strong proficiency in Python, SQL, Airflow, and Spark, and extensive expertise in building and maintaining robust data pipelines and ETL processes

  • Experience gathering business requirements and translating them into effective data models that support comprehensive data analysis and reporting

Preferred Qualifications:

  • Familiarity with Go and Ruby
  • Experience with cloud platforms such as AWS, GCP, or Azure

  • Familiarity with data warehousing solutions (e.g., Snowflake, Redshift, BigQuery)

  • Knowledge of data governance and data security best practices

  • Communication: Excellent verbal and written communication skills, with the ability to convey technical information to non-technical audiences

  • Collaboration: Proven ability to work effectively in a collaborative, cross-functional environment


Compensation Range

The base salary range for this job is USD $75,000.00 - USD $198,900.00 /Yr.

These pay ranges are intended to cover roles based across the United States. An individual's base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. At GitHub certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role. In addition, certain roles also have the opportunity to earn sales incentives based on revenue or utilization, depending on the terms of the plan and the employee's role.

GitHub values

  • Customer-obsessed
  • Ship to learn
  • Growth mindset
  • Own the outcome
  • Better together
  • Diverse and inclusive

Manager fundamentals

  • Model
  • Coach
  • Care

Leadership principles

  • Create clarity
  • Generate energy
  • Deliver success


Who We Are

GitHub is the world’s leading AI-powered developer platform with 100 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub.
Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms.
Join us, and let’s change the world, together.

EEO Statement

GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.