Data Engineer (Quantexa, Spark ,Scala, Elastic Search)
Company
Unison Consulting
Location
Other US Location
Type
Full Time
Job Description
We are seeking a talented and experienced Data Engineer (Quantexa)with expertise in Hadoop, Scala, Spark, Elastic, Open Shift Container Platform (OCP) and DevOps practices. Elasticsearch to join our team. As a Data Engineer, you will play a crucial role in designing, developing, and optimizing big data solutions using Apache Spark, Scala, and Elasticsearch. You will collaborate with cross-functional teams to build scalable and efficient data processing pipelines and search applications. Knowledge and experience in the Compliance / AML domain will be a plus. Working experience with Quantexa tool is a must.
Responsibilities:
ยท Implement data transformation, aggregation, and enrichment processes to support various data analytics and machine learning initiatives
ยท Collaborate with cross-functional teams to understand data requirements and translate them into effective data engineering solutions
ยท Design, develop, and implement Spark Scala applications and data processing pipelines to process large volumes of structured and unstructured data
ยท Integrate Elasticsearch with Spark to enable efficient indexing, querying, and retrieval of data
ยท Optimize and tune Spark jobs for performance and scalability, ensuring efficient data processing and indexing in Elasticsearch
ยท Implement data transformations, aggregations, and computations using Spark RDDs, DataFrames, and Datasets, and integrate them with Elasticsearch
ยท Develop and maintain scalable and fault-tolerant Spark applications, adhering to industry best practices and coding standards
ยท Troubleshoot and resolve issues related to data processing, performance, and data quality in the Spark-Elasticsearch integration
ยท Monitor and analyze job performance metrics, identify bottlenecks, and propose optimizations in both Spark and Elasticsearch components
ยท Ensure data quality and integrity throughout the data processing lifecycle
ยท Design and deploy data engineering solutions on OpenShift Container Platform (OCP) using containerization and orchestration techniques
ยท Optimize data engineering workflows for containerized deployment and efficient resource utilization
ยท Collaborate with DevOps teams to streamline deployment processes, implement CI/CD pipelines, and ensure platform stability
ยท Implement data governance practices, data lineage, and metadata management to ensure data accuracy, traceability, and compliance
ยท Monitor and optimize data pipeline performance, troubleshoot issues, and implement necessary enhancements
ยท Implement monitoring and logging mechanisms to ensure the health, availability, and performance of the data infrastructure
ยท Document data engineering processes, workflows, and infrastructure configurations for knowledge sharing and reference
- More than 5 years of experience as a Data Engineer
- ยท Bachelor's or Master's degree in Computer Science, Software Engineering, or a related discipline
- ยท Possession of Quantexa certification as a Data Engineer or Data Architect, with proficiency in the tool
- ยท Demonstrated experience as a Data Engineer, utilizing Hadoop, Spark, and data processing technologies in large-scale environments
- ยท Expertise in the Scala programming language and familiarity with functional programming principles
- ยท Prior experience with the Quantexa tool is highly desirable
- ยท Comprehensive understanding of Apache Spark architecture, including RDDs, DataFrames, and Spark SQL
- ยท Advanced proficiency in designing and developing data infrastructure utilizing Hadoop, Spark, and associated tools (HDFS, Hive, Pig, etc.)
- ยท Experience with containerization platforms such as OpenShift Container Platform (OCP) and container orchestration via Kubernetes
- ยท Proficiency in programming languages commonly employed in data engineering, including Spark, Python, Scala, or Java
- ยท Knowledge of DevOps methodologies, CI/CD pipelines, and infrastructure automation tools (e.g., Docker, Jenkins, Ansible, BitBucket)
- ยท Experience with Graphana, Prometheus, and Splunk will be considered an added advantage
- ยท Background in integrating and utilizing Elasticsearch for data indexing and search applications
- ยท Solid understanding of Elasticsearch data modeling, indexing strategies, and query optimization techniques
- ยท Experience with distributed computing, parallel processing, and handling large datasets
- ยท Proficient in performance tuning and optimization methods for Spark applications and Elasticsearch queries
- ยท Strong problem-solving and analytical capabilities with the capacity to debug and resolve intricate issues
- ยท Familiarity with version control systems (e.g., Git) and collaborative development environments
Date Posted
10/07/2024
Views
0
Similar Jobs
Senior Engineering Manager, Micros Foundations - Atlassian
Views in the last 30 days - 0
Atlassian is seeking a Senior Engineering Manager to lead a team of Backend Software Engineers The role involves guiding technical decisions prioritiz...
View DetailsSenior Frontend Engineer - Simply Business
Views in the last 30 days - 0
Simply Business is seeking a Senior Frontend Engineer to join their Front End Tooling team The role involves developing products using best practices ...
View DetailsDevelopment Underwriter - Simply Business
Views in the last 30 days - 0
Simply Business is seeking a Development Underwriter with an Underwriting background to support their new MGA brand Nupro which aims to disrupt the sm...
View DetailsE2E Solution Architect - Ahold Delhaize USA
Views in the last 30 days - 0
Ahold Delhaize USA is seeking a Solution Architect with extensive experience in IT architecture BigData Analytics and various software designs and dev...
View DetailsE2E Solution Architect - Ahold Delhaize USA
Views in the last 30 days - 0
Ahold Delhaize USA is seeking a Solution Architect with extensive experience in IT architecture BigData Analytics and various software designs and dev...
View DetailsE2E Solution Architect - Ahold Delhaize USA
Views in the last 30 days - 0
Ahold Delhaize USA a division of a global food retailer is seeking a Solution Architect for its US operations The role involves translating business r...
View Details