Job Description
Team: IT
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Web Scraping Specialist (Python) in United States.
This role is focused on building and maintaining advanced web data extraction systems within a hybrid AI-assisted environment. You will work on complex scraping workflows that involve dynamic, large-scale, and frequently changing web sources, ensuring high-quality structured data output. Acting as a technical expert, you will combine Python engineering skills with AI-powered tools to design efficient and resilient data pipelines. The position emphasizes problem-solving, autonomy, and strong attention to data accuracy and validation. You will collaborate with AI agents that handle repetitive tasks while you focus on architecture, logic, and quality control. This is a highly technical, remote, and flexible opportunity ideal for engineers passionate about scalable data extraction and automation.
Accountabilities:
- Design, develop, and maintain end-to-end web scraping pipelines for complex and dynamic websites.
- Extract, structure, and validate large-scale datasets ensuring accuracy, completeness, and consistency across sources.
- Work with tools such as Apify and OpenRouter alongside custom Python-based solutions to optimize data collection workflows.
- Handle JavaScript-rendered content, APIs, and evolving site structures using robust scraping strategies.
- Implement data quality checks, normalization processes, and validation frameworks before delivery.
- Scale scraping systems using batching, parallelization, and performance optimization techniques.
- Monitor, debug, and adapt scraping workflows to mitigate anti-bot measures and structural website changes.
- Collaborate within an AI-assisted environment, guiding automated agents while ensuring output reliability and precision.
- 5+ years of experience in web scraping, data engineering, automation, or software development.
- Strong proficiency in Python and scraping frameworks such as BeautifulSoup, Selenium, or similar tools.
- Experience extracting data from dynamic websites (AJAX, JavaScript-rendered pages, infinite scroll).
- Solid knowledge of data cleaning, transformation, normalization, and structured dataset delivery (CSV, JSON, spreadsheets).
- Proven ability to bypass or handle anti-bot systems and complex site architectures.
- Familiarity with cloud platforms (AWS or equivalent) and containerization tools such as Docker.
- Experience using LLM frameworks (e.g., LangChain, OpenRouter) in automation or data workflows.
- Strong analytical mindset with high attention to detail and data accuracy.
- Ability to work independently, troubleshoot effectively, and manage technical complexity.
- Upper-intermediate (B2+) English proficiency required; GitHub portfolio is a plus.
- Competitive compensation up to $45/hour equivalent, depending on experience and contribution level
- Flexible part-time workload (approximately 10–20 hours per week during active project phases)
- Fully remote collaboration within a global AI-driven technical environment
- Opportunity to work on advanced AI + data engineering hybrid workflows
- Exposure to cutting-edge tools such as LLM frameworks and automation systems
- Project-based engagement with potential access to multiple future opportunities
- Independent work structure with high autonomy and technical ownership
Requirements:
Benefits:
Explore More
Date Posted
05/29/2026
Views
0
Similar Jobs
Senior Software Engineer, Developer Experience - Jobgether
Views in the last 30 days - 0
View Details