Sr. Data Engineer- Python, SQL, Spark, AWS
Job Description
Duration: 10 Months
Location: Hillsboro, OR (Remote)
Key Skills: Python, SQL, Spark, AWS, CI/CD, APIs, Apache,
Job Duties
- Engineer data solutions in support of Sustainability reporting and analytics initiatives
- Engage with product owner, analysts, visualization developers, and business partners to understand capability requirements, and to develop and support data solutions based on product backlog priorities
Skills:
- Preference will be shown to candidates who can provide a link to their open source code portfolio (a link to your profile on github.com, bitbucket.com, gitlab.com, or another public VCS is sufficient).
Required Technical Competencies:
Candidates should have demonstrated, in a professional capacity, all of the competencies listed for each of the following three subject areas:
General Purpose Python Programming:
- Python has been your primary coding language (daily use) for at least 3 years
- You have authored distributable Python packages (packages which can be built, installed, and distributed using setuptools, pip, and twine)
- You have a solid understanding of how pip dependency resolution works
- You are proficient in authoring and automating unit and integration tests for python packages using (minimally) unittests, pytest, and tox
- You are meticulous about code quality, including readability, know your PEP8 guidelines inside and out, and are capable of authoring code which will pass validation by commonly used static analysis tools including mypy and flake8
- You are proficient in authoring readable, well-structured, SQL SELECT statements using ISO/ANSI-standard SQL
- You have hands-on professional experience in data warehouse design and modeling, including authoring DDL statements.
- You have experience with trunk-based development (feature branching) using git for version control, with fully automated deployments (CI/CD).
Desired Technical Competencies
General Purpose Python Programming: • You have a deep understanding of python's standard library and python internals. You understand python memory management, how CPython implements built-in data structures, and which data structures are best suited for different scenarios • You understand and can compare/contrast CPython's built-in concurrency models, when to use each, and what obstacles might prevent the use of each mechanism Database Design, SQL, and Object Relational Models • You are adept at performance-tuning SQL queries for both OLAP and OLTP databases • You understand and are prepared to discuss how and when/where to utilize more esoteric and/or modern SQL features such as window functions and common table expressions. • You understand and are prepared to discuss the performance implications of columnar vs relational databases. • You have hands-on experience in managing database schema migrations (ideally using SQLAlchemy's ORM + Alembic). Version Control and CI/CD • You have experience with trunk-based development (feature branching) using git for version control, with fully automated deployments (CI/CD).
Cloud Infrastructure and Amazon Web Services • You have hands-on experience using boto3 to interact with Amazon Web Services' resource APIs, particularly Amazon S3 (Simple Storage Service). • You have hands-on experience authoring unit and integration tests utilizing localstack to emulate AWS resources. • You have hands-on experience using HashiCorp Terraform to manage cloud infrastructure. • You have hands-on experience developing serverless ASGI applications using AWS lambda and AWS API Gateway
Web API Server and Client Development • You have experience planning and executing the design and development of web APIs using a modern python ASGI framework (preferably FastAPI). • You have authored, validated, and maintained OpenAPI documents describing your web APIs accurately. • You have experience developing and testing python web API client libraries based on an OpenAPI document.
Distributed Computing and Apache Spark • You have experience using Apache Spark for ingestion and manipulation of data sets which are too large to process efficiently in-memory. • You have hands-on experience translating algorithms and procedures designed by topical subject matter experts, having varying levels of engineering experience, into well-designed data pipelines. • You have experience configuring and tuning Spark clusters to optimize use of computing resources for varying workloads. • You understand, and can discuss: when and why to use distributed computing frameworks, such as Apache Spark, versus alternate concurrency models such as asyncio or multiprocessing.
Database Design, SQL, and Object Relational Models • You have experience modeling databases using SQLAlchemy's ORM framework. • You have experience managing database versions and schema migrations using SQLAlchemy with Alembic.
Required Soft Skills
Candidates should have demonstrated, in a professional capacity, all or most of the following
skills: • You are proficient in communicating effectively and efficiently within a hybrid remote/in-person team structure: • You are meticulous about managing your calendar to accurately reflect your free/busy hours • You respect and seek to learn digital communications etiquette-including region-specific, industry-specific, and organization-specific etiquette • You proactively initiate constructive discussions while curating and targeting your communications with respect for your colleagues' time and schedules • You are adept at discovering and navigating the complex bureaucratic resources of a
large organization.
Email your resume to [email protected]
To learn more about Mavensoft visit us online at http://www.mavensoft.com/
Explore More
Date Posted
10/07/2022
Views
7
Similar Jobs
Software Engineer - DAT
Views in the last 30 days - 1
DAT is looking for a Software Engineer to join their team in Beaverton OR or Denver CO The role involves working in a full stack TypeScript ecosystem ...
View DetailsIT Engineer - DAT
Views in the last 30 days - 0
DAT is seeking a strong IT Engineer to join their growing technical team and drive the evolution of their technology infrastructure and end user exper...
View DetailsSenior Structural Engineer - HDR
Views in the last 30 days - 5
HDR is a company that specializes in engineering architecture and construction services They believe in diversity and collaboration and offer employee...
View DetailsSenior Frontend Engineer - ICIS
Views in the last 30 days - 0
Cirium is a company that provides data and aviation analytics solutions to various industries They are looking for a senior frontend software engineer...
View DetailsSoftware Engineer (Mid-level) - Act-On Software
Views in the last 30 days - 7
ActOn is a leading marketing automation company looking for a Software Engineer to join their team The company offers a supportive and fun culture com...
View DetailsAssociate Actuary - Unum
Views in the last 30 days - 6
Unum offers a comprehensive employee benefits package including competitive benefits generous PTO and opportunities for career advancement The company...
View Details