Data Engineer- Arlington, VA
Job Description
Overview
BigBear.ai is seeking a Data Engineer to support a program in Pentagon (Onsite 5 days per week). As a Data Engineer, you will play a crucial role in designing, developing, and maintaining our Advana data infrastructure and systems. Your expertise in ETL, Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks will be essential in ensuring efficient data processing and analysis.
This is an ideal opportunity to be part of one of the fastest growing AI/ML companies in the industry. At BigBear.ai, we're in this business together. We own it, we make it thrive, and we enjoy the challenges of our work. We know that our employees play the largest role in our continual success. That is why we foster an environment of growth and development, with an emphasis on opportunity, recognition, and work-life balance. We give the same high level of commitment to our employees that we give to our clients. If BigBear.ai sounds like the place where you want to be, we'd enjoy speaking with you.
This position requires an active TS/SCI Clearance.
What you will do
- Design, develop, and implement end-to-end data pipelines, utilizing ETL processes and technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
- Create and optimize data pipelines from scratch, ensuring scalability, reliability, and high-performance processing.
- Perform data cleansing, data integration, and data quality assurance activities to maintain the accuracy and integrity of large datasets.
- Leverage big data technologies to efficiently process and analyze large datasets, particularly those encountered in a federal agency.
- Troubleshoot data-related problems and provide innovative solutions to address complex data challenges.
- Implement and enforce data governance policies and procedures, ensuring compliance with regulatory requirements and industry best practices.
- Work closely with cross-functional teams to understand data requirements and design optimal data models and architectures.
- Collaborate with data scientists, analysts, and stakeholders to provide timely and accurate data insights and support decision-making processes.
- Maintain documentation for software applications, workflows, and processes.
- Stay updated with emerging trends and advancements in data engineering and recommend suitable tools and technologies for continuous improvement.
What you need to have
- TS/SCI clearance is required
- Minimum of 5+ years of experience as a Data Engineer, with demonstrated experience creating data pipelines from scratch.
- High level of proficiency in ETL processes and demonstrated, hands-on experience with technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
- Strong problem-solving skills and ability to solve complex data-related issues.
- Demonstrated experience working with large datasets and leveraging big data technologies to process and analyze data efficiently.
- Understanding of data modeling/visualization, database design principles, and data governance practices.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
- Detail-oriented mindset with a commitment to delivering high-quality results.
- Must be in the DC Metro area and available to work onsite in Alexandria, VA.
- DOD or IC-related experience.
What we'd like you to have
- Knowledge of Qlik/Qlik Sense, QVD/QlikView, and Qlik Production Application Standards (QPAS) is a significant plus.
- Recent DoD or IC-related experience.
- Previous experience with Advana
About BigBear.ai
BigBear.ai delivers AI-powered analytics and cyber engineering solutions to support mission-critical operations and decision-making in complex, real-world environments. BigBear.ai's customers, which include the US Intelligence Community, Department of Defense, the US Federal Government, as well as customers in manufacturing, healthcare, commercial space, and other sectors, rely on BigBear.ai's solutions to see and shape their world through reliable, predictive insights and goal-oriented advice. Headquartered in Columbia, Maryland, BigBear.ai is a global, public company traded on the NYSE under the symbol BBAI. For more information, please visit: http://bigbear.ai/ and follow BigBear.ai on Twitter: @BigBearai.
What you will do
- Design, develop, and implement end-to-end data pipelines, utilizing ETL processes and technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
- Create and optimize data pipelines from scratch, ensuring scalability, reliability, and high-performance processing.
- Perform data cleansing, data integration, and data quality assurance activities to maintain the accuracy and integrity of large datasets.
- Leverage big data technologies to efficiently process and analyze large datasets, particularly those encountered in a federal agency.
- Troubleshoot data-related problems and provide innovative solutions to address complex data challenges.
- Implement and enforce data governance policies and procedures, ensuring compliance with regulatory requirements and industry best practices.
- Work closely with cross-functional teams to understand data requirements and design optimal data models and architectures.
- Collaborate with data scientists, analysts, and stakeholders to provide timely and accurate data insights and support decision-making processes.
- Maintain documentation for software applications, workflows, and processes.
- Stay updated with emerging trends and advancements in data engineering and recommend suitable tools and technologies for continuous improvement.
What you need to have
- TS/SCI clearance is required
- Minimum of 5+ years of experience as a Data Engineer, with demonstrated experience creating data pipelines from scratch.
- High level of proficiency in ETL processes and demonstrated, hands-on experience with technologies such as Databricks, Python, Spark, Scala, JavaScript/JSON, SQL, and Jupyter Notebooks.
- Strong problem-solving skills and ability to solve complex data-related issues.
- Demonstrated experience working with large datasets and leveraging big data technologies to process and analyze data efficiently.
- Understanding of data modeling/visualization, database design principles, and data governance practices.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
- Detail-oriented mindset with a commitment to delivering high-quality results.
- Must be in the DC Metro area and available to work onsite in Alexandria, VA.
- DOD or IC-related experience.
Date Posted
03/08/2024
Views
0
Similar Jobs
2025 Sensor Modeling and Simulation Analysis Engineer - The Aerospace Corporation
Views in the last 30 days - 0
The Aerospace Corporation is a trusted partner to the nations space programs providing technical expertise and innovative solutions across satellite l...
View DetailsSenior Associate, Data Science - People Analytics - Capital One
Views in the last 30 days - 0
Capital One is seeking a Senior Associate Data Science specialist for their People Strategy Analytics team The role involves applying data science an...
View DetailsSenior Associate, Data Scientist - Customer Management - Capital One
Views in the last 30 days - 0
Capital One is seeking a Senior Associate Data Scientist for the Mainstreet Customer Management Data Science team The role involves partnering with cr...
View DetailsInformation Security Consultant - Application Security Engineer - MassMutual
Views in the last 30 days - 0
MassMutual is seeking an experienced Application Security Engineer to join their dedicated team The role involves driving security best practices cond...
View DetailsFraud Technologist - Data and Analytics - Sr Associate - PwC
Views in the last 30 days - 0
PwCs Financial Crimes Data and Analytics team focuses on leveraging data to drive insights and make informed business decisions They utilize advanced ...
View Details