Create flexible and precise queries that fit your needs exactly. Example: React.js, -USA  × Laravel, Vue.js, -Contract  × will get you jobs that are (React.js and not in USA) or (Laravel and Vue.js and not Contract/Freelance).

You can mix and match any tags, negations and groups in any order. And don't worry about typos – the search is fuzzy.

Dismiss
🕧  This listing has expired

Remote Web Scraping Specialist

Wynd Labs NY New York City, New York, United States

Demonstrated ability to extract data from complex websites with minimal supervision, with a portfolio or examples of past projects.
  • Proficiency in languages such as Python or JavaScript, with strong skills in libraries and frameworks like BeautifulSoup, Scrapy, or Selenium.
  • Knowledge of asynchronous programming, multithreading, and distributed scraping.
  • In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).
  • Experience with NoSQL databases (MongoDB, Cassandra), capable of designing efficient storage solutions and managing data integrity.
  • Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis adds significant value.
  • Experience with cloud services (AWS, Google Cloud, Azure) for deploying and managing scraping jobs at scale.
  • Active participation in open-source projects related to web scraping, data processing, or similar fields.
  • What You'll Be Doing.

    • Write, test, and refine code that extracts data from various online sources, ensuring reliability and efficiency.
    • Perform data retrieval tasks, handling complexities such as pagination and dynamic content loaded with AJAX.
    • Clean and format extracted data, ensuring it meets quality standards for further analysis or processing.
    • Database management: Store and manage the scraped data in appropriate databases, optimizing for access speed and data integrity.
    • Regularly monitor the scraping processes, identify and resolve any issues to maintain continuous data flow.

    Why Work With Us.

    • Opportunity. We are at at the forefront of developing a web-scale crawler and knowledge graph that allows ordinary people to participate in the process, and share in the benefits, of AI development.
    • Culture. We’re a lean team working together to achieve a very ambitious goal of improving access to public web data and distributing the value of AI to the people. We prioritize low ego and high output.
    • Compensation. You’ll receive a competitive salary and equity package.
    • Resources and growth. We’re well-capitalized, with backing from leading venture funds like Polychain, Tribe, NLH, Hack, BH Digital, and more. We keep a lean team, and this is a rare opportunity to join. You’ll learn a lot and grow as our company scales.

    Apply Now: