Data Extraction Engineer - Part-time - LATAM/EMEA
Description
About the role
We are seeking a highly motivated and experienced Data Extraction Engineer to significantly level up our web scraping capabilities. While deep, hands-on expertise with Octoparse is required for current operations, we are specifically looking for a candidate with a deeper technical foundation, capable of transitioning to and building robust solutions in Python (e.g., Scrapy) for more complex, scalable, and custom extraction needs.
This role requires a candidate who can not only execute but also architect solutions in an Enterprise-scale e-Commerce environment, with an understanding of the nuances of large-scale data collection.
This position requires availability during Eastern Standard Time (EST) (GMT-5) working hours for team collaboration.
Key Responsibilities:
- Advanced Scraping Architecture - Develop robust, high-volume web scraping solutions using Python/Scrapy for complex sites where Octoparse is insufficient.
- Octoparse Management - Build, manage, and optimize complex Octoparse tasks (handling dynamic content, proxies, anti-bot measures, etc.).
- Enterprise Data Integrity - Apply e-commerce knowledge to rigorously normalize, validate, and structure raw data for enterprise use.
- Troubleshooting - Monitor and troubleshoot runs (Octoparse and custom code) to ensure high data uptime and quality.
The Ideal Candidate:
- Proven ability to move beyond visual tools into custom coding (Python/Scrapy).
- Strong background in data extraction for Enterprise-level e-commerce.
- Highly motivated, proactive, and solution-oriented.
- Must be available and responsive during US EST (GMT-5) working hours.
Skills
Python