About us
Who we are? We are Big Data experts , working with international clients, creating and leading innovative projects related to the Big Data environment. We offer tailor-made solutions. It does not matter whether we are talking about building a Data Lake , conducting training in the field of data management, or performing detailed Big Data analysis. We don’t just focus on one technology, instead we specialize in a whole range of open-source and public cloud tools.Our team brings together over 130 specialists in their fields. We have participated in dozens of conferences, written countless amounts of code, we are the organizers of Big Data Tech Summit Warsaw, the largest Polish conference related to Big Data topics. We run webinars, share knowledge on blogs, creating whitepapers and more. Why? Because we believe that Big Data is an indispensable future of business.
Thanks to that, we always select the most optimal Big Data solutions.
Customer
We are working on the project for one of the modern banks. We implement an innovative Event Stream Processing Platform based on Apache Flink to solve multiple large scale real-time automation cases.
Project
The solution encompasses also building analytics workbench to data analysts self-serviced, observability platform and whole infrastructure to make the solution robust, scalable, fault tolerant and high quality, according to the best DevOps and engineering practices. The platform is created from scratch, the project is just starting.
Responsibilities
Design and build a platform for large scale data pipelines on Apache Flink and Apache Kafka
Implement SQL and Python based Complex Event Processing pipelines
Build platform components with use of Python, Java
Troubleshoot application across Java code, Flink framework, JVM, Kubernetes and even OS
Deploy and maintain stateful applications on Kubernetes
Use best engineering practices: DevOps, continuous integration and delivery, infrastructure as a code
Work collaboratively within a team of cross-functional engineers
Work with data streams, structured, semi-structure and unstructured data
Technologies used:
- Flink
- Python
- Java
- JupyterLab
- Kafka
- Docker
- Kubernetes