Data Integration Engineer

Data Integration Engineer

*Due to COVID-19 we are working 100% remotely, this includes the hiring process. When it is safe to do so, we will return to a hybrid of onsite and remote work for some positions.

Why This Role Is Important To Arcadia

In this position you will work with a talented engineering group to design, develop, install and test in scala and spark sql data pipelines which ingests and analyzes over a billion records every night. Leveraging technologies like Mesos, DCOS, Apache Nifi, Spark, ElasticSearch and ELK you will implement data integration projects on cloud platforms such as AWS. This role involves building the data pipelines which feed the cutting edge of population health and analytics within the healthcare industry and working to solve complex problems to ensure the best data for our clients. The ideal candidate will be team first, detail oriented and a self-starter to take ownership over each project they’re on.   
What Success Looks Like
In 3 months
- Train in the different areas of the data connector life cycle
- Work on initial data integration and egress
- Work on data quality and analytics around connector
- Work on enhancement and issue triage
In 6 months
- Start to develop clinical and claims data connectors
- Work on higher level enhancement requests and defects
- Deliver Data Quality Reviews to clients
In 12 months 
- Developing a range of data connectors with varying complexity
- Work on teams with Product, Engineering or Implementation to build out tools for better data integration
- Pick an SME (Subject Matter Expert) path for what excites you the most
- Working on standardized data connector development
- Working with product to build out new data types for new requirements

What Will You Be Doing

    • Building standard and custom software to integrate large clinical and claims data sets into the Arcadia Analytics infrastructure
    • Use Nifi, Scala, Apache Spark, or other tools/languages to cleanse and transform incoming data into normalized formats
    • Design and implement software components
    • Performing code reviews, Unit & Integration Testing
    • Deploy software components
    • Manage code repositories & enforce software versioning
    • Establish and maintain efficient local development environments
    • Provide feedback and recommendations to improve software development processes

What You’ll Bring

    • At least 2 – 5+ years of related work experience
    • Expert Level in SQL
    • SQL or NoSql database experience such as MySql, Postgres, Cassandra, MS SqlServer, or Oracle
    • Proficient in at least one of the following languages: Scala, Java, Python, R (Expert if no SQL Experience)
    • Experience working with complex data sets
    • Healthcare data experience
    • Experience with Business Intelligence software or advanced reporting queries/frameworks

Would Love for You to Have

    • Apache Nifi, Talend, IBM InfoSphere, TIBCO, Pentaho, or Informatica
    • ELK (ElasticSearch/Logstash/Kibana)
    • Distributed Hadoop-like technologies such as Spark, Storm and/or Kafka
    • Tableau, QlikView, Apache Zeppelin, IPython or Jupyter
    • Github
    • HL7, CCD, CCLF file formats/designs

What You'll Get

    • Learn a TON about healthcare and the bleeding edge of healthcare analytics and medical economics
    • Learn the Apache tech stack and distributed computing
    • Become an expert in clinical and claims healthcare data
    • Chance to be surrounded by a team of extremely talented and dedicated individuals driven to succeed
    • Be a part of a mission driven company that is transforming the healthcare industry by changing the way patients receive care
    • A flexible, remote friendly company with personality and heart
    • Employee driven programs and initiatives for personal and professional development
    • Be a member of the Arcadian and Barkadian Community

About Arcadia helps innovative healthcare systems and health plans around the country transform healthcare to reduce cost while improving patient health.   We do this by aggregating massive amounts of clinical and claims data, applying algorithms to identify opportunities to provide better patient care, and making those opportunities actionable by physicians at the point of care in near-real time.  We are passionate about helping our customers drive meaningful outcomes. We are growing fast and have emerged as the market leader in the highly competitive population health management software and value-based care services markets, and we have been recognized by industry analysts KLAS, IDC, Forrester and Chilmark for our leadership. For a better sense of our brand and products, please explore our website, our online resources, and our interactive Data Gallery.

This position is responsible for following all Security policies and procedures in order to protect all PHI under Arcadia's custodianship as well as Arcadia Intellectual Properties.  For any security-specific roles, the responsibilities would be further defined by the hiring manager.
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.