Data Engineer (Quantexa, Spark ,Scala, Elastic Search)

Unison Consulting · Other US Location

Company

Unison Consulting

Location

Other US Location

Type

Full Time

Job Description

Description

We are seeking a talented and experienced Data Engineer (Quantexa)with expertise in Hadoop, Scala, Spark, Elastic, Open Shift Container Platform (OCP) and DevOps practices. Elasticsearch to join our team. As a Data Engineer, you will play a crucial role in designing, developing, and optimizing big data solutions using Apache Spark, Scala, and Elasticsearch. You will collaborate with cross-functional teams to build scalable and efficient data processing pipelines and search applications. Knowledge and experience in the Compliance / AML domain will be a plus. Working experience with Quantexa tool is a must.

Responsibilities:

ยท Implement data transformation, aggregation, and enrichment processes to support various data analytics and machine learning initiatives

ยท Collaborate with cross-functional teams to understand data requirements and translate them into effective data engineering solutions

ยท Design, develop, and implement Spark Scala applications and data processing pipelines to process large volumes of structured and unstructured data

ยท Integrate Elasticsearch with Spark to enable efficient indexing, querying, and retrieval of data

ยท Optimize and tune Spark jobs for performance and scalability, ensuring efficient data processing and indexing in Elasticsearch

ยท Implement data transformations, aggregations, and computations using Spark RDDs, DataFrames, and Datasets, and integrate them with Elasticsearch

ยท Develop and maintain scalable and fault-tolerant Spark applications, adhering to industry best practices and coding standards

ยท Troubleshoot and resolve issues related to data processing, performance, and data quality in the Spark-Elasticsearch integration

ยท Monitor and analyze job performance metrics, identify bottlenecks, and propose optimizations in both Spark and Elasticsearch components

ยท Ensure data quality and integrity throughout the data processing lifecycle

ยท Design and deploy data engineering solutions on OpenShift Container Platform (OCP) using containerization and orchestration techniques

ยท Optimize data engineering workflows for containerized deployment and efficient resource utilization

ยท Collaborate with DevOps teams to streamline deployment processes, implement CI/CD pipelines, and ensure platform stability

ยท Implement data governance practices, data lineage, and metadata management to ensure data accuracy, traceability, and compliance

ยท Monitor and optimize data pipeline performance, troubleshoot issues, and implement necessary enhancements

ยท Implement monitoring and logging mechanisms to ensure the health, availability, and performance of the data infrastructure

ยท Document data engineering processes, workflows, and infrastructure configurations for knowledge sharing and reference

Requirements
  1. More than 5 years of experience as a Data Engineer
  2. ยท Bachelor's or Master's degree in Computer Science, Software Engineering, or a related discipline
  3. ยท Possession of Quantexa certification as a Data Engineer or Data Architect, with proficiency in the tool
  4. ยท Demonstrated experience as a Data Engineer, utilizing Hadoop, Spark, and data processing technologies in large-scale environments
  5. ยท Expertise in the Scala programming language and familiarity with functional programming principles
  6. ยท Prior experience with the Quantexa tool is highly desirable
  7. ยท Comprehensive understanding of Apache Spark architecture, including RDDs, DataFrames, and Spark SQL
  8. ยท Advanced proficiency in designing and developing data infrastructure utilizing Hadoop, Spark, and associated tools (HDFS, Hive, Pig, etc.)
  9. ยท Experience with containerization platforms such as OpenShift Container Platform (OCP) and container orchestration via Kubernetes
  10. ยท Proficiency in programming languages commonly employed in data engineering, including Spark, Python, Scala, or Java
  11. ยท Knowledge of DevOps methodologies, CI/CD pipelines, and infrastructure automation tools (e.g., Docker, Jenkins, Ansible, BitBucket)
  12. ยท Experience with Graphana, Prometheus, and Splunk will be considered an added advantage
  13. ยท Background in integrating and utilizing Elasticsearch for data indexing and search applications
  14. ยท Solid understanding of Elasticsearch data modeling, indexing strategies, and query optimization techniques
  15. ยท Experience with distributed computing, parallel processing, and handling large datasets
  16. ยท Proficient in performance tuning and optimization methods for Spark applications and Elasticsearch queries
  17. ยท Strong problem-solving and analytical capabilities with the capacity to debug and resolve intricate issues
  18. ยท Familiarity with version control systems (e.g., Git) and collaborative development environments

Apply Now

Date Posted

10/07/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Senior Software Engineer (Scala/Java) - HERE Technologies

Views in the last 30 days - 0

HERE Technologies is seeking an experienced backend engineer with strong Java or Scala skills to join the Map Processing Pipelines team The role invol...

View Details

Software Architecture Engineering and Cloud Computing Engineer - The Aerospace Corporation

Views in the last 30 days - 0

The Aerospace Corporation is seeking a Senior Project Engineer with expertise in software architecture engineering and cloud computing The role involv...

View Details

Senior Data Analyst - Customer Experience - WISE

Views in the last 30 days - 0

Wise is a global technology company aiming to revolutionize international money transfers by offering minimal fees maximum ease and full speed They ar...

View Details

Lead Data Analyst - Mitigation - WISE

Views in the last 30 days - 0

Wise is a global technology company seeking an Operations Analyst with 4 years of experience in analytics particularly in operational team analytics T...

View Details

Lead Technical Support Engineer - HERE Technologies

Views in the last 30 days - 0

This role Senior Technical Support Engineer at HERE Technologies involves supporting a diverse portfolio of products and services acting as a technica...

View Details

Principal / Lead Software Engineer- RUST (Algorithmic and Mathematics) - m/w/d - HERE Technologies

Views in the last 30 days - 0

HERE Technologies is seeking a Principal Software Engineer to lead the development of extended services for their VRP solver Tour Planning The role in...

View Details