Data Engineer (Flink)

Data Engineer (Flink)

This job is no longer open

About us

Who we are? We are Big Data experts , working with international clients, creating and leading innovative projects related to the Big Data environment. We offer tailor-made solutions. It does not matter whether we are talking about building a Data Lake , conducting training in the field of data management, or performing detailed Big Data analysis. We don’t just focus on one technology, instead we specialize in a whole range of open-source and public cloud tools.Our team brings together over 130 specialists in their fields. We have participated in dozens of conferences, written countless amounts of code, we are the organizers of Big Data Tech Summit Warsaw, the largest Polish conference related to Big Data topics. We run webinars, share knowledge on blogs, creating whitepapers and more. Why? Because we believe that Big Data is an indispensable future of business.
Thanks to that, we always select the most optimal Big Data solutions.

Customer

We are working on the project for one of the modern banks. We implement an innovative Event Stream Processing Platform based on Apache Flink to solve multiple large scale real-time automation cases. 
Project
The solution encompasses also building analytics workbench to data analysts self-serviced, observability platform and whole infrastructure to make the solution robust, scalable, fault tolerant and high quality, according to the best DevOps and engineering practices. The platform is created from scratch, the project is just starting.

Responsibilities

  • Design and build a platform for large scale data pipelines on Apache Flink and Apache Kafka

  • Implement SQL and Python based Complex Event Processing pipelines

  • Build platform components with use of Python, Java

  • Troubleshoot application across Java code, Flink framework, JVM, Kubernetes and even OS

  • Deploy and maintain stateful applications on Kubernetes

  • Use best engineering practices: DevOps, continuous integration and delivery, infrastructure as a code

  • Work collaboratively within a team of cross-functional engineers

  • Work with data streams, structured, semi-structure and unstructured data

Technologies used:

  • Flink
  • Python
  • Java
  • JupyterLab
  • Kafka
  • Docker
  • Kubernetes
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.