Data Engineer

Data Engineer

Basic information

Job Name:

Data Engineer

Location:

Remote

Line of Business:

Data Science

Job Function:

Investor Services

Date:

Thursday, July 31, 2025

Position Summary

We are seeking a highly skilled Data Engineer to join our dynamic team. The ideal candidate will be responsible for creating robust data pipelines from various data vendors to gold tables, primarily for our Machine Learning (ML) team, utilizing Snowflake and Databricks platforms. The role demands expertise in Python, deep familiarity with financial data sources, and the ability to deploy complex data pipelines efficiently. This position requires a proactive approach to analyzing, aggregating, and enriching financial data from both private and public companies.

Responsibilities

  • Build, scale, and maintain robust data solutions to support the firm's objectives.
  • Implement and optimize high-performance data pipelines -- extraction, loading, transformation, and orchestration – that are designed for scalability, reliability, maintainability, and speed.
  • Lead software development projects end to end involving large language models (LLMs), retrieval-augmented generation (RAG) frameworks, and other AI technologies.
  • Champion modern software engineering practices as CI/CD, infrastructure-as-code, containerization, and cloud-native deployments
  • Collaborate closely with business stakeholders to transform use cases into production-ready services and solutions, owning the system from concept to production.
  • Implement rigorous testing and monitoring practices to maintain superior data quality and integrity.
  • Mentor and develop junior team members, fostering a culture of excellence and continuous learning within the team.
  • Be willing to travel up to 20% of the time to collaborate with distributed team members across locations. 

Qualifications

Education & Certificates

  • A bachelor's degree, required
  • Masters or other higher degree in a STEM field, preferred
  • Concentration in Computer Science, Math, Physics or other engineering related field, preferred

Professional Experience

  • 3-5 years of experience in data engineering or a related discipline, with a proven track record of success.
  • Experience in the financial services or private equity industry, preferred 

Competencies & Attributes

  • Expertise in Python and SQL, with a strong foundation in data manipulation and analysis.
  • Proficient with Databricks/PySpark and dbt for data warehousing and data transformation tasks.
  • Experience with workflow orchestration tools e.g. Airflow, Temporal
  • Experience working with large language models (LLMs) especially prompt engineering, retrieval-augmented generation (RAG)s, and/or vector databases.
  • Knowledge of fundamental principles of machine learning, feature engineering, and knowledge graphs are pluses.
  • Demonstrated experience in designing and implementing complex data systems from the ground up.
  • Proficient in handling large-scale data projects, including data cleaning, ETL, and information retrieval.
  • Previous experience in a product development or financial services environment is highly desirable.
  • Excellent communication skills required, both verbal and written.

Benefits/Compensation

The compensation range for this role is specific to Washington, DC and takes into account a wide range of factors including but not limited to the skill sets required/preferred; prior experience and training; licenses and/or certifications.

The anticipated base salary range for this role is $130,000 to $150,000.

In addition to the base salary, the hired professional will enjoy a comprehensive benefits package spanning retirement benefits, health insurance, life insurance and disability, paid time off, paid holidays, family planning benefits and various wellness programs.  Additionally, the hired professional may also be eligible to participate in an annual discretionary incentive program, the award of which will be dependent on various factors, including, without limitation, individual and organizational performance.

Due to the high volume of candidates, please be advised that only candidates selected to interview will be contacted by Carlyle.

Company Information

The Carlyle Group (NASDAQ: CG) is a global investment firm with $453 billion of assets under management and more than half of the AUM managed by women, across 641 investment vehicles as of March 31, 2025. Founded in 1987 in Washington, DC, Carlyle has grown into one of the world's largest and most successful investment firms, with more than 2,300 professionals operating in 29 offices in North America, Europe, the Middle East, Asia and Australia. Carlyle places an emphasis on development, retention and inclusion as supported by our internal processes and seven Employee Resource Groups (ERGs). Carlyle's purpose is to invest wisely and create value on behalf of its investors, which range from public and private pension funds to wealthy individuals and families to sovereign wealth funds, unions and corporations. Carlyle invests across three segments - Global Private Equity, Global Credit and Carlyle AlpInvest - and has expertise in various industries, including: aerospace, defense & government services, consumer & retail, energy, financial services, healthcare, industrial, real estate, technology & business services, telecommunications & media and transportation.

At Carlyle, we believe that a wide spectrum of experiences and viewpoints drives performance and success. Our CEO, Harvey Schwartz, has stated that, "To build better businesses and create value for all of our stakeholders, we are focused on assembling leadership teams with the strongest insights from a range of perspectives." We strive to foster an environment where ideas are openly shared and valued. By bringing together teams with varied expertise and approaches, we enjoy a competitive advantage and create a stronger foundation for long-term success.

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.