Match score not available

Web Scraping Specialist - EU Remote CET

Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

2-4 years of web scraping experience, Proficient in Python and web scraping libraries, Experience with third-party proxy providers, Knowledge of client faking techniques, Fluent in English.

Key responsabilities:

  • Develop and maintain a scraping pipeline
  • Implement web scraping using tools like Selenium and BeautifulSoup
  • Troubleshoot and optimize scraping workflows
  • Collaborate on data consolidation and integration
  • Stay updated on web scraping developments
SEON logo
SEON SME https://seon.io/
201 - 500 Employees
See more SEON offers

Job description

SEON is the leading fraud prevention system of record, catching fraud before it happens at any point across the customer journey. Trusted by over 5,000 global companies, we combine your company’s data with our proprietary real-time signals to deliver actionable fraud insights tailored to your business outcomes. We deliver the fastest time to value in the market through a single API call, enabling quick and seamless onboarding and integration. By analyzing billions of transactions, we’ve prevented $200 billion in fraudulent activities, showcasing why the world’s most innovative companies choose SEON.

SEON seeks a skilled Web Scraping Specialist to join our team in building cutting-edge anti-money laundering (AML) solutions. The data you extract from open-source web platforms will improve SEON’s fraud prevention and risk detection tools. By gathering and analyzing key information from public websites, you will strengthen our ability to detect and prevent illicit activities in financial transactions.

Our AML team specializes in the development of an anti-money laundering product suite. Our primary objectives revolve around enhancing the efficiency of collecting and managing data about individuals subject to diverse AML regulations, including sanctions, prominent public figures, financial supervision penalties, and warrant lists. Our diligent efforts involve continuously extracting data from over 300 sources and approximately 4,000 websites. In rendering this data searchable, we employ sophisticated techniques to address the intricacies of various languages and transcription nuances. Our focus on meticulous handling aims to minimize false positive results with the support of advanced Natural Language Processing (NLP) tools.

This is a remote role, and the ideal candidate will be based in the European Union, CET.

WHAT YOU’LL DO:

  • Develop and maintain a scalable in-house built scraping pipeline using Python.
  • Implement web scraping solutions using tools like Selenium, BeautifulSoup, or similar libraries.
  • Troubleshoot, optimize and enhance existing scraping workflows and tools.
  • Cooperation with data scientists and colleagues in developing in-house built data consolidation tools to clean and organize scraped data to ensure it is accurate, reliable, and ready for analysis.
  • Manage and utilize third-party proxy services to ensure effective data extraction, bypassing anti-scraping mechanisms.
  • Apply advanced client-faking techniques (e.g., user-agent rotation, CAPTCHA solving, IP masking) to avoid detection.
  • Collaborate with data engineers and other team members to integrate data into pipelines or systems.
  • Stay updated on the latest developments in web scraping, proxies, and anti-scraping techniques.

WHAT YOU’ll BRING:

  • 2-4 years of experience in web scraping, with a strong focus on data extraction from complex, dynamic websites and unstructured resources.
  • Proficient in Python and libraries such as Selenium, BeautifulSoup, Scrapy, or equivalent frameworks.
  • Experience working with third-party proxy providers and rotating proxies to handle scraping challenges.
  • Knowledge of client faking techniques (e.g., user-agent manipulation, cookie management, header spoofing).
  • Familiarity with handling common web scraping challenges like CAPTCHAs, rate limiting, and bot detection.
  • Experience with API interaction and extracting data from both public and private APIs.
  • Strong problem-solving skills, attention to detail, and the ability to handle large-scale scraping projects.
  • Familiarity with data cleaning and processing best practices.
  • Fluent English

NICE TO HAVE:

  • Experience with cloud services like AWS, Google Cloud, or Azure.
  • Knowledge of database systems and handling large datasets (SQL/NoSQL).
  • Understanding of ethical data scraping practices and legal considerations (e.g., complying with website terms of service).
  • Experience with containerization (e.g., Docker, kubernetes) and workflow automation.

WHAT WE OFFER:

  • Employee stock ownership plan (ESOP)
  • Flexible hours
  • Generous Holiday allowance
  • Access to significant opportunities for learning and development
  • Private health insurance including dependants (inc. employee assistance & mental health support)
  • Complimentary weekly language courses
  • Enhanced Parental leave

WHAT'S NEXT:

Does that sound good? Great, we can’t wait to hear from you! Would you like to learn more about what it’s like to work at SEON first?
👉 Here you go

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
EnglishEnglish
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Detail Oriented
  • Problem Solving
  • Adaptability

Related jobs