Data Engineer

Data Engineer

This job is no longer open
At Scribd (pronounced “scribbed”), we believe reading is more important than ever. Join our cast of characters as we work to change the way the world reads by building the world’s largest and most fascinating digital library: giving subscribers access to a growing collection of ebooks, audiobooks, magazines, documents, Scribd Originals, and more. In addition to works from major publishers and top authors, our community includes over 1.5M subscribers in nearly every country worldwide.

We’re defining the workforce of the future with Scribd Flex, a program that embraces multiple perspectives while leaning into our belief that no matter where each team member is, we trust them to accomplish our shared business goals. This program lets employees, in partnership with their manager, choose where they work while creating intentional in-person meetings with co-workers that build culture and connect us personally.

Remote employees must have their primary residence in: Arizona, California, Colorado, Delaware, DC, Florida, Hawaii, Iowa, Massachusetts, Michigan, Missouri, Nevada, New Jersey, New York, Ohio, Oregon, Tennessee, Texas, Utah, Vermont, Washington, Ontario (Canada), and British Columbia (Canada).
*This list may not be complete or accurate, and candidates should speak with their recruiter about their specific location for remote work.


What you'll do

Data quality and integrity are two areas of focus for your work in our existing, organically-grown data infrastructure. You will assist with building tools and technology to ensure that downstream customers can have faith in the data they're consuming. Based on the project, this might involve cross-functional work with the Data Science or Content Engineering teams to troubleshoot, process, or optimize our business-critical pipelines, or working with Core Platform to implement better processing jobs for scaling our consumption of streaming data sets. Almost everything you would be working on would be to increase the "customer satisfaction" for internal customers of Scribd data.

Required Skills

    • Strong written and verbal communication skills (we're remote!).
    • You have at least 3 years of experience in data engineering creating or managing end-to-end data pipelines on large complex datasets.
    • You have engineered scalable software using big data technologies (e.g. Hadoop, Spark, Hive, Flink, Samza, Storm, Elasticsearch, Druid, Cassandra, etc).
    • Fluency with at least one dialect of SQL.
    • Expertise in Scala, Java, or Python.

Desired Skills

    • You have worked on and have knowledge of Streaming platforms, typically based around Kafka.
    • Strong grasp of AWS data platform services and their strengths/weaknesses.
    • Strong experience using  Jira, Slack, JetBrains IDEs, Git, GitLab, GitHub, Docker, Jenkins, Terraform. 
    • Experience using DataBricks.
Benefits, Perks, and Wellbeing at Scribd

• Healthcare Insurance Coverage: Scribd pays for employee’s Medical, Vision, and Dental premiums and a portion of dependent premiums
• 401k/RSP plans provided, plus company matching with no vesting period
• Professional development: generous annual budget for our employees to attend conferences, classes, and other events
• Quarterly Wellness, Connectivity & Comfort Benefit
• Concern mental health digital platform
• Free subscription to Scribd + gift memberships for friends & family
• Leaves: 12 weeks paid parental leave, company paid short-term/long-term disability plans and milestone Sabbaticals
• Generous Paid Time Off: Paid Holidays, Flexible Sick Time, Volunteer Day + office closure between Christmas Eve and New Years Day
• Company-wide Diversity, Equity, & Inclusion programs which include learning & development opportunities, employee resource groups, and hiring best practices.

Want to learn more? Check out our office and meet some of the team at www.linkedin.com/company/scribd/life

Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

We encourage people of all backgrounds to apply. We believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

 #LI-Remote
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.