Senior Data Engineer (GCP)

Senior Data Engineer (GCP)

This job is no longer open

About us

Who we are? We are Big Data experts , working with international clients, creating and leading innovative projects related to the Big Data environment. We offer tailor-made solutions. It does not matter whether we are talking about building a Data Lake , conducting training in the field of data management, or performing detailed Big Data analysis. We don’t just focus on one technology, instead we specialize in a whole range of open-source and public cloud tools.Our team brings together over 130 specialists in their fields. We have participated in dozens of conferences, written countless amounts of code, we are the organizers of Big Data Tech Summit Warsaw, the largest Polish conference related to Big Data topics. We run webinars, share knowledge on blogs, creating whitepapers and more. Why? Because we believe that Big Data is an indispensable future of business.

Thanks to that, we always select the most optimal Big Data solutions.

Customer

We are working with one of the Polish eCommerce leaders who plans to fully migrate to GCP from on-premise Hadoop till the end of 2022. This initiative is delivered in phases, and currently the need is to migrate the data from the existing Oracle system to a Google Cloud Platform, without the mediation of Hadoop.


Project

Data migration from Oracle system to Google Cloud Platform (BigQuery, DataProc, Cloud Composer), without the mediation of Hadoop.

Responsibilities
The activities include:

  • Spark Job adaptation during migration to GCP
  • Performance optimization for large data volumes (over 1kkk rows)
  • Refactoring of the business logic within the queries per needs
  • Cost optimization

Technologies used:

  • GCP (BigQuery, DataProc, Cloud Composer)
  • Apache Spark
  • Apache Airflow
  • Scala
  • Python
  • Terraform
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.