as a Data Engineer you will support our Lake (AWS S3) , Warehouse (Postgresql), Unified Data Analytics Platform (UDAP), and Message Bus (Kafka w/ Debezium).
While ingesting data you'll do data format conversions, tagging, parsing, monitoring, and alerting.
The end goal is deliver value (making patients, nurses, and providers' healthcare world class) from DATA to ALL stakeholders of the organization, but most especially Data Analysts & Data Scientists.
This role will work closes with DevOps, Security, and Data Architects
- 7yr+ Software Engineering with a focus on Data
- 3yr+ Shell AND Python
- https://projects.apache.org/projects.html?language#Python (22)
- 3yr+ AWS (at least 3 of these)
- Athena, EMR, Redshift, Kinesis, Kafka, QuickSight, Glue, Lake Formation, Data Pipeline, S3, IAM, EKS, ECS
- 1yr+ Kafka
- 6months+ Kubernetes/Docker
- 1yr+ Data and Model Pipeline CI/CD experience
- Proven Ability to define AWS data analytics services and understand how they integrate with each other
- Proven Ability to explain how AWS data analytics services fit in the data lifecycle of collection, storage, processing, and visualization
- SQL ANSI 2015+
- Big Data:
- https://projects.apache.org/projects.html?category#big-data (50)
- Presto
- Storage/API:
- gRPC, protobuf
- Arrow, Parquet, CSV
Only at Carerev
- Leadership has had successful exits before
- 1 of only 1,800 world wide AWS Community Builders
- A publicly recognized AWS Exam Author and Trainer
- A past Vice President of the Apache Software Foundation Infrastructure
- A multi-published O'reilly books technical open source author