About The Role
Rippling is the system of record for employee data - a complete Employee Management System. To solve this broad problem, a variety of applications and datasets need to come together as a graph connected through the employee record at its center. The graph needs to be represented in a data lake represented in forms efficient for variety of user requests to be processed on that data.
We need a data platform to make it easy to add all forms of data into this data lake, perform various transformations and query efficiently for a variety of online and offline use cases. You will be working on building this distributed data platform, defining key APIs, designing to scale, high availability, and handling both streaming and batch scenarios.
At Rippling, we want users to configure and customize our features/products without complexity. Therefore, we’ve created a new language called Rippling Query Language (RQL) which is used across the product. You will be working to scale RQL on top of the data platform using distributed query processing engines (Apache Presto), and a distributed datastore (Apache Pinot).
What You'll Do: