Distinguished Data Systems Architect - Data Engineering
Company
GitLab
Location
USA
Type
Full Time
Job Description
An overview of this role
As a Distinguished Data Systems Architect you will define and drive the evolution of GitLab’s strategic data platforms across both SaaS and self-managed deployments. You’ll architect scalable distributed data systems that balance OLTP and OLAP performance cost and resiliency while enabling secure compliant data use in complex regulated environments. You’ll bring together ingestion orchestration transformation and metadata into a cohesive event-driven architecture using tools like Argo Airflow Kubernetes Trino Postgres and graph-based metadata systems and you’ll establish opinionated design principles governance frameworks and semantic models that power analytics AI and monetizable data products. Partnering closely with product and engineering leadership you’ll transform ambiguous business needs into strategic technical roadmaps embed AI-driven patterns and standards into our data infrastructure and create a model-driven architecture that clearly separates logical data design from physical implementations to reduce technical debt and unlock new capabilities for GitLab customers and internal teams.
Some examples of our projects:
-
Creating a unified architectural blueprint for GitLab’s data ecosystem that aligns SaaS and self-managed platforms on shared patterns and standards
-
Designing monetizable data services and APIs with strong governance and observability to support new product offerings and revenue streams
What you’ll do
-
Lead the architectural vision for scalable distributed data systems across GitLab’s SaaS and self-managed deployments balancing OLTP and OLAP performance scalability and cost-efficiency
-
Design and evolve enterprise data governance frameworks including lineage data quality controls versioning and compliance practices that align with global regulatory needs
-
Architect monetizable data services and APIs with clear semantic models that support internal analytics and external product offerings while meeting security and performance service-level expectations
-
Create and maintain a cohesive architectural blueprint for GitLab’s data ecosystem identifying gaps against modern platforms and defining opinionated design principles grounded in cloud-native patterns
-
Design event-driven and end-to-end data lifecycle architectures covering ingestion orchestration with tools like Argo Airflow and Kubernetes transformation workflows and unified metadata management with strong observability
-
Partner with product engineering and security leadership to embed AI-driven patterns into data infrastructure align senior engineering leaders on shared design tenets and drive platform standards adoption
-
Translate ambiguous business and product challenges into strategic technical roadmaps leading high-impact architectural engagements where data platforms provide measurable competitive advantage
-
Design and implement a Model Driven Architecture framework that cleanly separates conceptual and logical data models from physical implementations improving agility and reducing technical debt across enterprise data systems
What you’ll bring
-
Proven experience architecting large-scale distributed data systems in complex regulated environments spanning SaaS and self-managed deployments
-
Background designing multi-modal data services and APIs with strong developer experience principles including monetization governance and data product lifecycle management
-
Hands-on proficiency with modern data platforms and tools such as Python Docker Airflow Trino Postgres distributed query engines and graph-based metadata systems
-
Advanced understanding of data architecture concepts including logical and conceptual modeling Model Driven Architecture schema management and data processing paradigms across synchronous and asynchronous patterns
-
Experience building unified data platforms that integrate cloud-native compute orchestration and semantic modeling bridging cloud and on-premises environments with automation and developer self-service in mind
-
Familiarity with modern data and telemetry standards such as OpenTelemetry OpenMetadata and OpenLineage and the ability to apply them in real-world platform designs
-
Practical experience with AI-driven architectures and emerging technologies including model orchestration agentic patterns and standards like Model Context Protocol (MCP)
-
Ability to lead through influence mentor senior technical partners and collaborate across product engineering and business teams with openness to candidates who bring transferable experience from adjacent large-scale data or platform roles
About the team
Data Engineering and Monetization is a function within the Engineering organization with a mission to build a responsible scalable data foundation that powers GitLab’s SaaS and self-managed offerings. The team brings together data engineers architects and platform specialists working asynchronously across regions to design operate and evolve unified data platforms governance and monetization capabilities for GitLab’s AI-powered DevSecOps platform. You’ll partner closely with product security infrastructure and finance teams to tackle challenges such as harmonizing data across deployments enabling compliant usage in regulated markets and creating reliable monetizable data services that unlock new product and revenue opportunities.
The base salary range for this role’s listed level is currently for residents of the United States only. This range is intended to reflect the role's base salary rate in locations throughout the US. Grade level and salary ranges are determined through interviews and a review of education experience knowledge skills abilities of the applicant equity with other team members alignment with market data and geographic location. The base salary range does not include any bonuses equity or benefits. See more information on our benefits and equity . Sales roles are also eligible for incentive pay targeted at up to 100% of the offered base salary.
United States Salary Range
$219100—$328700 USD
Date Posted
12/20/2025
Views
0
Similar Jobs
Data Scientist - KoBold Metals
Views in the last 30 days - 0
KoBold leverages AI and advanced sensors for mineral exploration achieving significant discoveries with less capital Theyre hiring a Data Scientist to...
View DetailsBackend Product Software Engineer - Financial Data - Dropbox
Views in the last 30 days - 0
This role involves modernizing Dropboxs financial infrastructure collaborating across teams and driving innovation in scalable systems The position em...
View DetailsHealthcare Solution Architect - GetWellNetwork, Inc.
Views in the last 30 days - 0
Get Well seeks a Healthcare Solution Architect to design secure scalable solutions collaborate with stakeholders and align with business processes The...
View DetailsSenior Software Engineer - Android Test Engineering - Reddit
Views in the last 30 days - 0
A Senior Software Engineer role in Quality Engineering at Reddit requires 5 years in Android development and experience with test frameworks Responsib...
View DetailsAWS Solutions Architect - Guidehouse
Views in the last 30 days - 0
This job posting highlights a senior AWS Solution Architect role with significant influence on enterprise solutions and innovation The position offers...
View DetailsAssociate Director - IT Project Manager - Guidehouse
Views in the last 30 days - 0
This job posting describes a senior ServiceNow consultant role with responsibilities including leading IT projects technical expertise in ServiceNow m...
View Details