Data Engineer | GenAI

Data Engineer | GenAI

This job is no longer open

Apply now for a career that puts wellbeing first!

GET TO KNOW US

Wellhub (formerly Gympass*) is a corporate wellness platform that connects employees to the best partners for fitness, mindfulness, therapy, nutrition, and sleep, all included in one subscription designed to cost less than each individual partner. Founded in 2012 and headquartered in NYC, we have a growing global team in 11 countries. At Wellhub, you have the opportunity to build a career in a high-growth tech company that places wellbeing at the foundation of its culture, and contribute to making every company a wellness company. 

*Big news: Gympass is now Wellhub! 
We are thrilled to announce our rebranding as Wellhub, marking a significant milestone in our journey. This transformation reflects our evolution from a “pass for gyms” to a comprehensive employee wellbeing solution. With our refreshed identity, we are poised to embark on an exciting new chapter of growth and expansion. We are elevating our offerings, including a completely new app experience and an expanded network of wellbeing partners. Learn more about it here. 

THE OPPORTUNITY

We are hiring a Data Engineer for our GenAI team in Brazil!

Our GenAI team leverages advanced AI technologies to personalize user experiences, streamline operations, and optimize resources, acting as a hub for innovation while enabling Wellhub to harness the full potential of generative AI across multiple domains.

We are seeking a meticulous and dedicated Data Engineer to join our team, focusing on data cleanup and preparation for generative AI systems. This role is crucial for ensuring the quality and reliability of the data that powers our advanced AI models. Talented data engineers, of any area, who would like to shift their career to invest in AI are welcome to apply if they are a match to some of the qualifications and preferred qualifications listed below. Such candidates must be avid learners and be prepared to learn fast.

YOUR IMPACT

Data Cleaning and Preprocessing:

  • Develop and implement data cleaning procedures to ensure high-quality data for AI model training.
  • Identify and rectify data inconsistencies, errors, and anomalies.
  • Perform data normalization, transformation, and augmentation as needed.

Data Pipeline Development:

  • Design and build scalable data pipelines to automate data collection, cleaning, and preprocessing tasks.
  • Collaborate with data engineers to integrate data cleaning processes into the overall data pipeline.
  • Ensure data pipelines are efficient, reliable, and maintainable.

Data Quality Assurance:

  • Establish and enforce data quality standards and best practices.
  • Develop and maintain data validation and verification routines.
  • Monitor and report on data quality metrics to ensure continuous improvement.

Collaboration and Communication:

  • Work closely with data scientists, AI researchers, and other engineers to understand data requirements and challenges.
  • Participate in cross-functional team meetings to provide updates on data cleaning efforts and collaborate on solutions.
  • Communicate data quality issues and solutions effectively to stakeholders.

Tooling and Automation:

  • Develop and maintain tools and scripts for data cleaning and preprocessing.
  • Automate repetitive data cleaning tasks to increase efficiency.
  • Stay updated with the latest tools and techniques in data cleaning and preprocessing.

Live the mission:

  • Inspire and empower others by genuinely caring for your own wellbeing and your colleagues. Bring wellbeing to the forefront of work, and create a supportive environment where everyone feels comfortable taking care of themselves, taking time off, and finding work-life balance.

WHO YOU ARE

  • Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field.
  • Proven experience in software development with a focus on data processing and data integration.
  • Strong programming skills in languages such as Python, SQL, or Java.
  • Experience with ETL and data processing frameworks and libraries (e.g., Flink, Spark).
  • Familiarity with machine learning and AI concepts, particularly related to generative AI.
  • Knowledge of data pipeline tools and technologies (e.g., Airflow, EMR, Kafka).
  • Excellent problem-solving skills and attention to detail.
  • Strong communication (both in English and Portuguese) and teamwork skills.

The following are preferred qualifications:

  • Experience with cloud platforms (e.g., AWS, Google Cloud) and their data services.
  • Experience with processing, analyzing, and extracting insights from unstructured data, including text, images, and audio.
  • Knowledge of data governance and data privacy best practices.
  • Knowledge of ML frameworks and techniques (e.g., Scikit-Learn, Keras).

We recognize that individuals approach job applications differently. We strongly encourage all aspiring applicants to go for it, even if they don't match the job description 100%. We welcome your application and will be delighted to explore if you could be a great fit for our team. For this specific role, please note that prior experience in programming skills using Python, SQL or Java, familiarity with machine learning and AI concepts and ETL and data processing frameworks and libraries are mandatory requirements.

WHAT WE OFFER YOU 

We're a wellness company that is committed to the health and wellbeing of our employees. Our flexible program allows you to customize your benefits, according to your needs!

Our benefits include:

WELLNESS: Health, dental, and life insurance.

FLEXIBLE WORK: At Wellhub, flexibility fosters a happier, healthier, and more productive work environment for everyone. As a Flexible First company, we offer two work model options: flexible hybrid and full remote, and make the office a place for collaboration, community, and team building. The model for this role can be discussed with your recruiter and hiring manager. We offer all employees a home office stipend and a monthly flexible work allowance to help cover the costs of working from home.

FLEXIBLE SCHEDULE: Wellhubbers and their leaders can make the best decisions for their scope. This includes flexibility to adjust their working hours based on their personal schedule, time zone, and business needs.

WELLHUB: We believe in our mission and encourage our employees and their families to take care of their wellbeing too. Access onsite gyms and fitness studios, digital fitness programs, and online wellness resources for meditation, nutrition, mental health support, and more. You will receive the Gold plan at no cost, and other premium plans will be significantly discounted.

PAID TIME OFF: We know how important it is that our employees take time away from work to recharge. 

Vacations after 6 months and 3 days off per year + 1 day off for each year of tenure (up to 5 additional days) + extra day off for your birthday.

PAID PARENTAL LEAVE: Welcoming a new child is one of the most special moments in your life and we want our employees to take the time to be present and enjoy their growing family. We offer 100% paid parental leave to all new parents and extended maternity leave.

CAREER GROWTH: Outstanding opportunities for personal and career growth. That means we maintain a growth mindset in everything we do and invest deeply in employee development.  

CULTURE: An exciting and supportive atmosphere with ambitious people from around the world! You’ll partner with global colleagues and share in the success of a high-growth technology company disrupting the health and wellness space. Our value-based culture of trust, flexibility, and integrity makes this possible every day. Find more info on our careers page

And to get a glimpse of Life at Wellhub… Follow us on Instagram @lifeatwellhub and LinkedIn!

Diversity, Equity, and Belonging at Wellhub

We aim to create a collaborative, supportive, and inclusive space where everyone knows they belong.

Wellhub is committed to creating a diverse work environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, sex, gender identity or expression, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.

Questions on how we treat your personal data? See our Aviso de Privacidade para Candidatos. 

#I-REMOTE



This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.