Distinguished Data Systems Architect - Data Engineering

GitLab • USA

Company

GitLab

Location

USA

Type

Full Time

Job Description

An overview of this role

As a Distinguished Data Systems Architect you will define and drive the evolution of GitLab’s strategic data platforms across both SaaS and self-managed deployments. You’ll architect scalable distributed data systems that balance OLTP and OLAP performance cost and resiliency while enabling secure compliant data use in complex regulated environments. You’ll bring together ingestion orchestration transformation and metadata into a cohesive event-driven architecture using tools like Argo Airflow Kubernetes Trino Postgres and graph-based metadata systems and you’ll establish opinionated design principles governance frameworks and semantic models that power analytics AI and monetizable data products. Partnering closely with product and engineering leadership you’ll transform ambiguous business needs into strategic technical roadmaps embed AI-driven patterns and standards into our data infrastructure and create a model-driven architecture that clearly separates logical data design from physical implementations to reduce technical debt and unlock new capabilities for GitLab customers and internal teams.

Some examples of our projects:

  • Creating a unified architectural blueprint for GitLab’s data ecosystem that aligns SaaS and self-managed platforms on shared patterns and standards

  • Designing monetizable data services and APIs with strong governance and observability to support new product offerings and revenue streams

What you’ll do

  • Lead the architectural vision for scalable distributed data systems across GitLab’s SaaS and self-managed deployments balancing OLTP and OLAP performance scalability and cost-efficiency

  • Design and evolve enterprise data governance frameworks including lineage data quality controls versioning and compliance practices that align with global regulatory needs

  • Architect monetizable data services and APIs with clear semantic models that support internal analytics and external product offerings while meeting security and performance service-level expectations

  • Create and maintain a cohesive architectural blueprint for GitLab’s data ecosystem identifying gaps against modern platforms and defining opinionated design principles grounded in cloud-native patterns

  • Design event-driven and end-to-end data lifecycle architectures covering ingestion orchestration with tools like Argo Airflow and Kubernetes transformation workflows and unified metadata management with strong observability

  • Partner with product engineering and security leadership to embed AI-driven patterns into data infrastructure align senior engineering leaders on shared design tenets and drive platform standards adoption

  • Translate ambiguous business and product challenges into strategic technical roadmaps leading high-impact architectural engagements where data platforms provide measurable competitive advantage

  • Design and implement a Model Driven Architecture framework that cleanly separates conceptual and logical data models from physical implementations improving agility and reducing technical debt across enterprise data systems

What you’ll bring

  • Proven experience architecting large-scale distributed data systems in complex regulated environments spanning SaaS and self-managed deployments

  • Background designing multi-modal data services and APIs with strong developer experience principles including monetization governance and data product lifecycle management

  • Hands-on proficiency with modern data platforms and tools such as Python Docker Airflow Trino Postgres distributed query engines and graph-based metadata systems

  • Advanced understanding of data architecture concepts including logical and conceptual modeling Model Driven Architecture schema management and data processing paradigms across synchronous and asynchronous patterns

  • Experience building unified data platforms that integrate cloud-native compute orchestration and semantic modeling bridging cloud and on-premises environments with automation and developer self-service in mind

  • Familiarity with modern data and telemetry standards such as OpenTelemetry OpenMetadata and OpenLineage and the ability to apply them in real-world platform designs

  • Practical experience with AI-driven architectures and emerging technologies including model orchestration agentic patterns and standards like Model Context Protocol (MCP)

  • Ability to lead through influence mentor senior technical partners and collaborate across product engineering and business teams with openness to candidates who bring transferable experience from adjacent large-scale data or platform roles

About the team

Data Engineering and Monetization is a function within the Engineering organization with a mission to build a responsible scalable data foundation that powers GitLab’s SaaS and self-managed offerings. The team brings together data engineers architects and platform specialists working asynchronously across regions to design operate and evolve unified data platforms governance and monetization capabilities for GitLab’s AI-powered DevSecOps platform. You’ll partner closely with product security infrastructure and finance teams to tackle challenges such as harmonizing data across deployments enabling compliant usage in regulated markets and creating reliable monetizable data services that unlock new product and revenue opportunities.

The base salary range for this role’s listed level is currently for residents of the United States only. This range is intended to reflect the role's base salary rate in locations throughout the US. Grade level and salary ranges are determined through interviews and a review of education experience knowledge skills abilities of the applicant equity with other team members alignment with market data and geographic location. The base salary range does not include any bonuses equity or benefits. See more information on our benefits and equity . Sales roles are also eligible for incentive pay targeted at up to 100% of the offered base salary.

United States Salary Range

$219100—$328700 USD

Apply Now

Date Posted

12/20/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.2

Similar Jobs

Engineering Manager - Software Supply Chain Security: Auth Infrastructure - GitLab

Views in the last 30 days - 0

This job description highlights a leadership role in developing secure scalable authentication infrastructure for GitLab It emphasizes technical exper...

View Details

Staff Salesforce Engineer - CRM Systems - GitLab

Views in the last 30 days - 0

This job description outlines a Staff Salesforce Developer role focusing on designing building and scaling enterprisegrade solutions across Salesforce...

View Details

Solutions Architect - phData

Views in the last 30 days - 0

This job posting seeks a Solutions Architect to join phDatas Elastic Platform Operations team focusing on cloudnative data platforms like Snowflake AW...

View Details

Software Solutions Architect - Unqork

Views in the last 30 days - 0

Unqork empowers enterprises with AIpowered applications emphasizing innovation security and growth The job posting highlights benefits like remote wor...

View Details

Data Scientist - Capstone Integrated Solutions

Views in the last 30 days - 0

Capstone Integrated Solutions promotes itself as a customerfocused provider offering comprehensive software services and seeks a Data Scientist with e...

View Details

Growth Product Lead - Loyalty - Trafilea

Views in the last 30 days - 0

Trafilea promotes itself as a transformative consumer tech platform with AIdriven growth solutions highlighting achievements like 1B revenue and globa...

View Details