Job Description
Team: IT
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Software Engineer in India.
This role sits within a data-driven engineering environment focused on building scalable systems that power large-scale web data collection and processing. You will contribute to the design and development of backend services and data pipelines that transform diverse global data sources into structured, high-quality datasets used for critical decision-making in the healthcare space. Working closely with senior engineers and cross-functional teams, you will help improve data ingestion systems, optimize pipeline performance, and ensure reliability at scale. The environment is highly collaborative, fast-evolving, and mission-driven, with a strong emphasis on engineering excellence, data quality, and continuous improvement. This is an opportunity to work on complex distributed systems that directly contribute to improving access to healthcare information worldwide.
Accountabilities:
- Contribute to the development and enhancement of systems that collect and process large-scale web data from structured and unstructured sources.
- Design and implement data extraction components using APIs, scraping frameworks, and parsing techniques.
- Build and maintain ETL/ELT pipelines using Python, Apache Spark, and cloud-based infrastructure (primarily AWS).
- Develop clean, efficient, and scalable Python code to support data ingestion and transformation workflows.
- Support the monitoring, debugging, and optimization of data pipelines to improve reliability and performance.
- Collaborate with senior engineers to enhance data quality, normalization processes, and pipeline scalability.
- Troubleshoot issues related to data inconsistencies, pipeline failures, and changes in external data sources.
- Work closely with product, data, and engineering teams to ensure data usability and alignment with business needs.
- Contribute to documentation, code reviews, and engineering best practices within the team.
- 3–5 years of professional experience in backend engineering or data engineering roles.
- Strong proficiency in Python for production-level development.
- Experience building and maintaining data pipelines or backend data systems.
- Familiarity with web data extraction techniques, including APIs, web scraping (e.g., Selenium, Playwright), and handling structured/unstructured data.
- Strong SQL skills with experience using PostgreSQL or similar relational databases.
- Hands-on experience with Apache Spark or comparable data processing frameworks.
- Experience working in AWS or similar cloud environments.
- Familiarity with Docker or containerized development workflows.
- Strong foundation in data structures, backend systems, and system design principles.
- Good problem-solving skills, adaptability, and willingness to work with complex real-world datasets.
- Strong collaboration skills and openness to feedback in a team-oriented environment.
- Comprehensive health insurance coverage and generous paid time off.
- Wellness-focused company-wide holidays and health initiatives.
- Retirement planning options and financial well-being support.
- Health and charitable donation stipends.
- Flexible work hours with the opportunity to work from anywhere.
- Exposure to cutting-edge data engineering challenges in a mission-driven environment.
- Opportunity to contribute to impactful work in healthcare and life sciences innovation.
Requirements:
Benefits:
Explore More
Date Posted
05/06/2026
Views
0
Similar Jobs
Customer Support Engineer, Tier 1 (MacOS and MDM) - India - JumpCloud
Views in the last 30 days - 0
View DetailsTechnology and Innovation - Finance Analytics Engineer - Jobgether
Views in the last 30 days - 0
View Details