Principal Data Engineer / Architect
Company
Scribd
Location
USA
Type
Full Time
Job Description
At Scribd (pronounced “scribbed”) our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge democratize the exchange of ideas and information and empower collective expertise through our three products: Everand Scribd and Slideshare.
We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.
Our flexible work benefit - Scribd Flex - enables employees in partnership with their manager to choose the daily work-style that best suits their individual needs. As an organization we prioritize collaboration and intentional in-person moments to build culture and connection. For this reason occasional in-person attendance is required for all Scribd employees regardless of their location.
What You'll Do:
As a pivotal member of the team you will lead the design and development of a robust data architecture that guides data modeling integration processing and delivery standards enabling modern data product development at Scribd.
You will also serve as a data and analytics solution architect leading architecture initiatives encompassing data warehousing data pipeline development data integrations and data modeling. You will shape Scribd’s data strategy guiding stakeholders in how they consume and act on data.
We’re looking for someone with proven proficiency in architecting designing and development experience with batch and real time streaming infrastructure and workloads. Your expertise will help establish standards for data modeling integration processing and delivery and also help translate business requirements into technical specifications.
At Scribd we leverage deep data insights to inform every aspect of our business from product development experimentation to understanding our subscriber engagement and tracking key performance indicators. You'll join a data engineering team tackling complex challenges within a rich domain encompassing three distinct brands – Scribd Everand and Slideshare – all serving a massive user base with over 200 million monthly visitors and 2 million paying subscribers. You'll have the opportunity to make a real impact as we are heavily investing in improving our core data layer and this exciting new role puts you right at the forefront of this initiative.
Based on the project this might involve cross-functional work with the Data Science Analytics and other Engineering and Business teams to design cohesive data models database schemas and data storage solutions consumption strategies and patterns. Almost everything you will be working on will be to increase the 'customer satisfaction' for internal customers of Scribd data.
Required Skills:
• 7+ years of experience in data engineering with a strong background in data architecture data modeling and data management building and scaling robust data systems for complex business domains.
• Expertise in Scala or Python with a deep understanding and hands-on experience in Spark for designing optimizing and scaling large-scale data processing pipelines and proficiency in at least one SQL dialect.
• Experience with data lake technologies (e.g. Databricks Delta Lake) data storage formats (Parquet Avro) query engines (such as Photon Spark SQL) and both real-time streaming and batch processing or equivalent technologies and frameworks.
Desired Skills:
• Experience and working knowledge of streaming platforms typically based around Kafka.
• Strong grasp of AWS data platform services and their strengths/weaknesses.
• Hands on experience in implementing data pipelines for data ingestion and transformation to support analytics and ML pipelines
• Strong experience communicating asynchronously using collaboration tools like Jira Slack etc.
• Experience using automation and CI/CD tooling like Git GitHubDockerJenkins Terraform etc.
• Experience developing standards for database design and implementation of various strategic data architecture initiatives around data quality data management policies/standards data governance privacy and metadata management
• Working experience integrating with BI frameworks like Qlik ThoughtSpot Looker Tableau etc.
At Scribd your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role level and geographic location. San Francisco is our highest geographic market in the United States.
In the state of California the reasonably expected salary range is between $191500 [minimum salary in our lowest geographic market within California] to $259500 [maximum salary in our highest geographic market within California].
In the United States outside of California the reasonably expected salary range is between $158000 [minimum salary in our lowest US geographic market outside of California] to $247000 [maximum salary in our highest US geographic market outside of California].
In Canada the reasonably expected salary range is between $198500 CAD[minimum salary in our lowest geographic market] to $246000 CAD[maximum salary in our highest geographic market].
We carefully consider a wide range of factors when determining compensation including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership and a comprehensive and generous benefits package.
Benefits Perks and Wellbeing at Scribd
*Benefits/perks listed may vary depending on the nature of your employment with Scribd and the geographical location where you work.
• Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
• 12 weeks paid parental leave
• Short-term/long-term disability plans
• 401k/RSP matching
• Tuition Reimbursement
• Learning & Development programs
• Quarterly stipend for Wellness Connectivity & Comfort
• Mental Health support & resources
• Free subscription to Scribd + gift memberships for friends & family
• Referral Bonuses
• Book Benefit
• Sabbaticals
• Company wide events
• Team engagement budgets
• Vacation & Personal Days
• Paid Holidays (+ winter break)
• Flexible Sick Time
• Volunteer Day
• Company-wide Diversity Equity & Inclusion programs
Want to learn more about life at Scribd? www.linkedin.com/company/scribd/life
---------------------------------------------------------------------------------------------------------------------------
We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing accommodations [@] scribd.com about the need for adjustments at any point in the interview process.
Scribd is committed to equal employment opportunity regardless of race color religion national origin gender sexual orientation age marital status veteran status disability status or any other characteristic protected by law. We encourage people of all backgrounds to apply and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.
---------------------------------------------------------------------------------------------------------------------------
Remote employees must have their primary residence in:Â Arizona California Colorado Connecticut Delaware DC Florida Georgia Hawaii Massachusetts Michigan Minnesota Missouri Nevada New Jersey New York Ohio Oregon Tennessee Texas Utah Washington Ontario (Canada) British Columbia (Canada) or Mexico.
#LI-Remote
Date Posted
01/27/2025
Views
0
Similar Jobs
Staff Salesforce Engineer - CRM Systems - GitLab
Views in the last 30 days - 0
This job description outlines a Staff Salesforce Developer role focusing on designing building and scaling enterprisegrade solutions across Salesforce...
View DetailsSolutions Architect - phData
Views in the last 30 days - 0
This job posting seeks a Solutions Architect to join phDatas Elastic Platform Operations team focusing on cloudnative data platforms like Snowflake AW...
View DetailsSoftware Engineer III | Platform - ExtraHop
Views in the last 30 days - 0
This job posting seeks a Software Engineer III to develop features lead junior team members and contribute to secure cloud and appliance solutions The...
View DetailsDevOps Engineer - Guidehouse
Views in the last 30 days - 0
This job posting seeks a skilled DevOps Engineer to support development QA and operations across applications emphasizing automation cloudnative infra...
View DetailsSoftware Solutions Architect - Unqork
Views in the last 30 days - 0
Unqork empowers enterprises with AIpowered applications emphasizing innovation security and growth The job posting highlights benefits like remote wor...
View DetailsData Scientist - Capstone Integrated Solutions
Views in the last 30 days - 0
Capstone Integrated Solutions promotes itself as a customerfocused provider offering comprehensive software services and seeks a Data Scientist with e...
View Details