Cloud Data Integration Engineer

Cloud Data Integration Engineer

VetsEZ is seeking a Cloud Data Integration Engineer to support a project with the Department of Veterans Affairs. The position will support the Data Migration & Syndication (DMS) effort within the Electronic Health Record Modernization Integration Office (EHRM-IO) project. The ideal candidate will work with EHRM Architects and stakeholders to design, enhance, and continue implementation of a cloud strategy to integrate the VistA data from the VX130 system and Oracle Health Millennium data, provide advanced analytics, AI/ML, research, reporting and application needs. Part of the cloud strategy and design efforts will be an analysis and evaluation of which cloud platform to use between Microsoft Azure, Google and AWS.

The candidate must reside within the continental US.

Responsibilities:

  • Utilize a diverse set of tools and systems (AITC, Microsoft Azure Data Lake Storage, AWS S3 and others) to support easy access to the data.
  • Handle incremental updates to databases with near-real time data Management of the data through parquet files or other file formats.
  • Create and manage ETL and Extract, Load, transform (ELT) scripts to populate EHRM data model tables with data.
  • Test and Validate data pipelines and data quality improvement.
  • Utilize data standards, terminologies and regulations in data modeling.
  • Generate Database entity diagrams and data dictionaries using erwin data modeler or similar tool.
  • Access, query, read, write and transform data to and from multiple data sources and varying database applications.
  • Create and update databases comprised of various data types, formats, constraints and storage options over multiple platforms (such as Azure Cloud, etc.).
  • Communicate complex technical concepts to non-technical stakeholders.
  • Monitor and optimize Databricks jobs and clusters to ensure efficient and scalable performance.
  • Troubleshoot and resolve issues related to data integration using Databricks.
  • Build and optimize ‘big data’ data pipelines, architectures and data sets.
  • Update documentation/Wiki pages to document work and updates as directed by the Government Project Manager.
  • Update the Data Lake/Analytics Database Design Document with the enhancements and changes.

Requirements:

  • Bachelor’s degree in Computer Science, Electronics Engineering, or a related technical field, plus 5+ years of experience.
  • Data integration activities could include, the use of the following tools and languages: SSIS, T-SQL, P-SQL, BIML Studio, Visual Studio, PowerBI, Python, Scala, YAML scripting for data pipelines, Azure Data Factory, C#, Talend, AWS Glue, PowerShell, Databricks, DeltaLake, and/or Microsoft Synapse.
  • Knowledge of how to secure the data lake using role-based access controls (RBACs) and Access Control Lists (ACLs).
  • Experience with Agile Frameworks, DevSecOps, and CI/CD Pipelines.
  • Ability to work in a fast paced and agile development environment.
  • Takes ownership of tasks and assignments to completion with the ability to delegate amongst a team effectively.
  • Knowledge of database architecture, administration, and security for on-premise and cloud-hosted database systems.
  • Communicates and leads effectively in detailed technical discussions with the customer and among cross functional stakeholders.

Additional Qualifications:

  • Experience in the VA or other federal organizations desired.
  • Experience with Data Migration and Data Syndication within the VA desired.
  • Ability to obtain a Government clearance.

Benefits:

  • Medical/Dental/Vision
  • 401k with Employer Match 
  • PTO + Federal Holidays  
  • Corporate Laptop 
  • Training opportunities  
  • Remote Opportunity

Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or protected veteran status.

Sorry, we are unable to offer sponsorship at this time.

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.