⬅️ To Main

Create flexible and precise queries that fit your needs exactly. Example: React.js, -USA × Laravel, Vue.js, -Contract × will get you jobs that are (React.js and not in USA) or (Laravel and Vue.js and not Contract/Freelance).

You can mix and match any tags, negations and groups in any order. And don't worry about typos – the search is fuzzy.

Dismiss

Include US-only

Include EU-only

🕧 This listing has expired

Remote Web Scraping Specialist

Wynd Labs NY New York City, New York, United States

🗓 Mon, Jul 8 📍 New York / USA AI AWS Azure Cassandra Crypto Google Cloud JavaScript Machine Learning MongoDB NoSQL Python

Demonstrated ability to extract data from complex websites with minimal supervision, with a portfolio or examples of past projects.
Proficiency in languages such as Python or JavaScript, with strong skills in libraries and frameworks like BeautifulSoup, Scrapy, or Selenium.

Knowledge of asynchronous programming, multithreading, and distributed scraping.

In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).

Experience with NoSQL databases (MongoDB, Cassandra), capable of designing efficient storage solutions and managing data integrity.

Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis adds significant value.

Experience with cloud services (AWS, Google Cloud, Azure) for deploying and managing scraping jobs at scale.

Active participation in open-source projects related to web scraping, data processing, or similar fields.

What You'll Be Doing.

Write, test, and refine code that extracts data from various online sources, ensuring reliability and efficiency.

Perform data retrieval tasks, handling complexities such as pagination and dynamic content loaded with AJAX.

Clean and format extracted data, ensuring it meets quality standards for further analysis or processing.

Database management: Store and manage the scraped data in appropriate databases, optimizing for access speed and data integrity.

Regularly monitor the scraping processes, identify and resolve any issues to maintain continuous data flow.

Why Work With Us.

Opportunity. We are at at the forefront of developing a web-scale crawler and knowledge graph that allows ordinary people to participate in the process, and share in the benefits, of AI development.

Culture. We’re a lean team working together to achieve a very ambitious goal of improving access to public web data and distributing the value of AI to the people. We prioritize low ego and high output.

Compensation. You’ll receive a competitive salary and equity package.

Resources and growth. We’re well-capitalized, with backing from leading venture funds like Polychain, Tribe, NLH, Hack, BH Digital, and more. We keep a lean team, and this is a rare opportunity to join. You’ll learn a lot and grow as our company scales.

Apply Now: