Data Engineer- Arlington, VA

BigBear.ai · Washington DC

Company

BigBear.ai

Location

Washington DC

Type

Full Time

Job Description

Overview

BigBear.ai is seeking a Data Engineer to support a program in Pentagon (Onsite 5 days per week). As a Data Engineer, you will play a crucial role in designing, developing, and maintaining our Advana data infrastructure and systems. Your expertise in ETL, Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks will be essential in ensuring efficient data processing and analysis.

This is an ideal opportunity to be part of one of the fastest growing AI/ML companies in the industry. At BigBear.ai, we're in this business together. We own it, we make it thrive, and we enjoy the challenges of our work. We know that our employees play the largest role in our continual success. That is why we foster an environment of growth and development, with an emphasis on opportunity, recognition, and work-life balance. We give the same high level of commitment to our employees that we give to our clients. If BigBear.ai sounds like the place where you want to be, we'd enjoy speaking with you.

This position requires an active TS/SCI Clearance.

What you will do

Design, develop, and implement end-to-end data pipelines, utilizing ETL processes and technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
Create and optimize data pipelines from scratch, ensuring scalability, reliability, and high-performance processing.
Perform data cleansing, data integration, and data quality assurance activities to maintain the accuracy and integrity of large datasets.
Leverage big data technologies to efficiently process and analyze large datasets, particularly those encountered in a federal agency.
Troubleshoot data-related problems and provide innovative solutions to address complex data challenges.
Implement and enforce data governance policies and procedures, ensuring compliance with regulatory requirements and industry best practices.
Work closely with cross-functional teams to understand data requirements and design optimal data models and architectures.
Collaborate with data scientists, analysts, and stakeholders to provide timely and accurate data insights and support decision-making processes.
Maintain documentation for software applications, workflows, and processes.
Stay updated with emerging trends and advancements in data engineering and recommend suitable tools and technologies for continuous improvement.

What you need to have

TS/SCI clearance is required
Minimum of 5+ years of experience as a Data Engineer, with demonstrated experience creating data pipelines from scratch.
High level of proficiency in ETL processes and demonstrated, hands-on experience with technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
Strong problem-solving skills and ability to solve complex data-related issues.
Demonstrated experience working with large datasets and leveraging big data technologies to process and analyze data efficiently.
Understanding of data modeling/visualization, database design principles, and data governance practices.
Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
Detail-oriented mindset with a commitment to delivering high-quality results.
Must be in the DC Metro area and available to work onsite in Alexandria, VA.
DOD or IC-related experience.

What we'd like you to have

Knowledge of Qlik/Qlik Sense, QVD/QlikView, and Qlik Production Application Standards (QPAS) is a significant plus.
Recent DoD or IC-related experience.
Previous experience with Advana

About BigBear.ai

BigBear.ai delivers AI-powered analytics and cyber engineering solutions to support mission-critical operations and decision-making in complex, real-world environments. BigBear.ai's customers, which include the US Intelligence Community, Department of Defense, the US Federal Government, as well as customers in manufacturing, healthcare, commercial space, and other sectors, rely on BigBear.ai's solutions to see and shape their world through reliable, predictive insights and goal-oriented advice. Headquartered in Columbia, Maryland, BigBear.ai is a global, public company traded on the NYSE under the symbol BBAI. For more information, please visit: http://bigbear.ai/ and follow BigBear.ai on Twitter: @BigBearai.

What you will do

Design, develop, and implement end-to-end data pipelines, utilizing ETL processes and technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
Create and optimize data pipelines from scratch, ensuring scalability, reliability, and high-performance processing.
Perform data cleansing, data integration, and data quality assurance activities to maintain the accuracy and integrity of large datasets.
Leverage big data technologies to efficiently process and analyze large datasets, particularly those encountered in a federal agency.
Troubleshoot data-related problems and provide innovative solutions to address complex data challenges.
Implement and enforce data governance policies and procedures, ensuring compliance with regulatory requirements and industry best practices.
Work closely with cross-functional teams to understand data requirements and design optimal data models and architectures.
Collaborate with data scientists, analysts, and stakeholders to provide timely and accurate data insights and support decision-making processes.
Maintain documentation for software applications, workflows, and processes.
Stay updated with emerging trends and advancements in data engineering and recommend suitable tools and technologies for continuous improvement.

What you need to have

TS/SCI clearance is required
Minimum of 5+ years of experience as a Data Engineer, with demonstrated experience creating data pipelines from scratch.
High level of proficiency in ETL processes and demonstrated, hands-on experience with technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
Strong problem-solving skills and ability to solve complex data-related issues.
Demonstrated experience working with large datasets and leveraging big data technologies to process and analyze data efficiently.
Understanding of data modeling/visualization, database design principles, and data governance practices.
Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
Detail-oriented mindset with a commitment to delivering high-quality results.
Must be in the DC Metro area and available to work onsite in Alexandria, VA.
DOD or IC-related experience.

Explore More

Data Engineer position Jobs design Jobs develop Jobs and maintain data infrastructure Jobs TS/SCI clearance required Jobs More Jobs at BigBear.ai Jobs in Washington DC

Date Posted

03/08/2024

Views

Back to Job Listings Add To Job List Company Profile View Company Reviews

Positive

Subjectivity Score: 0.9

Similar Jobs

LMI - Junior Data Analyst - TS/SCI Required 🛂 - LMI

Views in the last 30 days - 0

View Details

2025 Sensor Modeling and Simulation Analysis Engineer - The Aerospace Corporation

Views in the last 30 days - 0

The Aerospace Corporation is a trusted partner to the nations space programs providing technical expertise and innovative solutions across satellite l...

View Details

Senior Associate, Data Science - People Analytics - Capital One

Views in the last 30 days - 0

Capital One is seeking a Senior Associate Data Science specialist for their People Strategy Analytics team The role involves applying data science an...

View Details

Senior Associate, Data Scientist - Customer Management - Capital One

Views in the last 30 days - 0

Capital One is seeking a Senior Associate Data Scientist for the Mainstreet Customer Management Data Science team The role involves partnering with cr...

View Details

Information Security Consultant - Application Security Engineer - MassMutual

Views in the last 30 days - 0

MassMutual is seeking an experienced Application Security Engineer to join their dedicated team The role involves driving security best practices cond...

View Details

Fraud Technologist - Data and Analytics - Sr Associate - PwC

Views in the last 30 days - 0

PwCs Financial Crimes Data and Analytics team focuses on leveraging data to drive insights and make informed business decisions They utilize advanced ...

View Details