Data Delivery Engineer

Data Delivery Engineer

This job is no longer open
How you will help
You will assist the data delivery team with optimizing the data extraction and delivery process to ensure accurate and on-time data deliveries that meet client expectations. To achieve this, you will dive in to fix issues, optimize processes, and automate what you and the data delivery team do more than once. You will use the best tools for the job, whether modern and revolutionary or time tested and proven, to deliver elegant, scalable solutions that meet business and technical needs.

What you will do
•Work with internal stakeholders to understand business needs for data deliveries
•Troubleshoot and resolve issues relating to data extraction and delivery
•Help establish procedures and best practices for extracting and delivering data
•Work with some of the most exciting open-source tools like Spark, Hadoop, Airflow, Zeppelin
•Leverage distributed computing and serverless architecture such as AWS EMR
•Enjoy the peace that comes with working in a mature software development environment 
•Research and implement new technologies with a team of developers to execute strategies and implement solutions
•Produce peer reviewed quality routines
•Solve complex problems related to the real-time discovery of large data
•Continue to develop broader and deeper knowledge of data assets and analytic methodologies everyday

About You
You are...
• Experienced in writing scalable applications on distributed architectures
• Data driven, testing and measuring as much as you can
• Eager to both review peer code and have your code reviewed
• Confident in SQL, you know it, write smart queries, it’s no big deal
• Passionate about data and optimizing processes around it
• Excited about building and creating production processes that run on time, efficiently, and correctly
• A self-starter that enjoys working in a small, rapidly changing, fast paced environment
• Extremely comfortable working with large data sets 

Required skills and experience
• 5+ years of work experience
• 3+ years of experience with SQL
• 3+ years of experience with Python
• 3+ years of experience with Spark (writing, testing, debugging spark routines)
• 1+ years of experience with AWS EMR, AWS S3 service
• Comfortable using *nix command line (shell scripting, AWK, SED)
• Comfortable working in remote environments
• Able to gather requirements, test strategies, design deliverables
• Proven analytical, evaluative, and problem-solving abilities
• Extensive experience working in a team-oriented, collaborative environment

Desired experience
• Experience with Apache Airflow
• Experience with Apache Zeppelin
• Knowledge of healthcare industry data utilized by manufacturers, payers, clearing houses, etc.
About HealthVerity
Pharmaceutical manufacturers, payers and government organizations have partnered with HealthVerity to solve some of their most complicated use cases through transformative technologies and real-world data infrastructure. The HealthVerity IPGE platform, based on the foundational elements of Identity, Privacy, Governance and Exchange, enables the discovery of RWD across the broadest healthcare data ecosystem, the building of more complete and accurate patient journeys and the ability to power best-in-class analytics and applications with flexibility and ease. Together with our partners, HealthVerity has built the modern way to data for the health insights economy. To learn more about the HealthVerity IPGE platform, visit www.healthverity.com.

Our company challenges
• Empowering clients with highly rewarding data discovery and licensing tools
• Ingesting and managing billions of healthcare records from a wide variety of partners
• Standardizing on common data models across data types
• Orchestrating an industry-leading HIPAA privacy layer
• Innovating our proprietary de-identification and data science algorithms
• Building a culture that supports rapid iteration and new possibilities

We have big plans
The infrastructure and culture we are building will provide an environment that cultivates innovation. We want to move fast knowing we can fix anything we break along the way. If a new need arises, we want to turn around a solution quickly. We want to solve our challenges in ways that create even more possibilities. We’ve created a platform that will scale to support an ever-growing array of data providers and innovative products and services. You must be able to think big while still delivering on near-term requirements.

We pride ourselves on ensuring that each team member at HealthVerity feels connected, validated and heard. From Philadelphia to Manhattan Beach, our success is driven by recognizing that a team is made up of individuals. We offer a robust set of benefits and perks to everyone. View details on our careers page.

HealthVerity is an equal opportunity employer devoted to inclusion in the workplace. We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table. Read our Equity Inclusion and Diversity Statement.

If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com

HealthVerity offers in-office and remote options, so you can work from anywhere within the US! #LI-Remote
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.