Role overview

Qualifications

Strong experience with web scraping and data extraction.
Practical programming experience using Python or similar scripting languages.
Experience working with HTML parsing, APIs, HTTP requests, FTP sources, and structured or unstructured data.
Strong analytical and problem-solving skills.

Responsibilities

Research and identify public and government data sources.
Extract and normalize data from websites, APIs, feeds, and online repositories.
Build reusable, maintainable, and re-runnable scripts and scraping workflows.
Document data sources, extraction methodologies, challenges encountered, and re-run procedures.

Key facts

Remote from: Anywhere
Full time
English

Hard skills

Python (Programming Language) Text Parsing Http Protocols Relational Databases Playwright (Software Testing) Selenium (Software) Puppeteer (Software) Internal Documentation

Other skills

Problem Solving
Analytical Skills
Detail Oriented

About the company

Softgic

We are a young and growing company, with operations in Medellin and Bogota, focused on the generation of technological solutions in synergy with our customers and our team so that these solutions add value within their organizations and their business processes.

Company details

Company size51 - 200

Links

Website LinkedIn See all jobs

Your match analysis

See how your profile stacks up against this role.

We compared the job requirements to your profile to show where you're strong and where you fall short.

Job description

This is a remote position.

We are seeking a Data Scraping to help collect, organize, and normalize data from public and government sources into a consistent, structured format. This role focuses on solving complex data acquisition challenges, researching unfamiliar sources, extracting information from websites and feeds, and transforming it into predefined formats that can be consumed by downstream systems. The ideal candidate enjoys working with messy datasets, investigating how websites and data sources are structured, and creating reusable solutions that can be executed repeatedly with consistent results. This position requires strong problem-solving skills, attention to detail, and the ability to work independently while documenting findings and processes clearly.

Schedule: Monday to Friday - 12:00 PM – 8:00 PM CST

Responsibilities:
Research and identify public and government data sources. Extract and normalize data from websites, APIs, feeds, and online repositories. Build reusable, maintainable, and re-runnable scripts and scraping workflows. Deliver structured outputs in predefined formats. Provide sample outputs for review before processing larger datasets. Document data sources, extraction methodologies, challenges encountered, and re-run procedures. Capture and report any relevant information discovered during extraction, including inconsistencies, amendments, effective dates, repeal notes, or related metadata. Troubleshoot data acquisition issues and propose alternative approaches when needed. Collaborate with stakeholders through regular check-ins and written communication. Maintain version-controlled code repositories and follow standard development practices.

Requisitos

Strong experience with web scraping and data extraction. Practical programming experience using Python or similar scripting languages. Experience working with HTML parsing, APIs, HTTP requests, FTP sources, and structured or unstructured data. Ability to evaluate, debug, and improve scraping solutions. Strong analytical and problem-solving skills. Experience building reusable automation workflows rather than one-off scripts. Familiarity with relational databases (PostgreSQL preferred) and a normal Git workflow. Strong documentation and communication skills. Ability to work independently and take ownership of technical challenges. High attention to detail and commitment to data accuracy. Nice to Have: Experience working with government, regulatory, compliance, or public-sector datasets. Experience with Playwright, Selenium, Puppeteer, Scrapy, or similar scraping frameworks. Experience with data versioning, change detection, or document lineage. Familiarity with AI-assisted development tools and workflows.

Apply once. Then go straight to the hiring manager.

After you apply, unlock the direct contact details of the people who actually make the call. A quick follow-up makes you 5x more likely to land an interview.

Marcus Rivera

Chief Revenue Officer

m.rivera@company.com

linkedin.com/in/marcusrivera

Unlocked after you apply

Related jobs

Worldwide

Strategic Account Executive - Healthcare

30+ days ago

Omnissa

Full time

Enterprise SalesStrategic ManagementCross-SellingCustomer RetentionConsultative Selling

Senior Medical Only Claims Representative - Workers Compensation

4 days ago

Great American Insurance Company

Full time

Medical Insurance ClaimsCode CoverageWorkers' Compensation ClaimsOrganizational Structure

Technical Support Representative (Remote)

30+ days ago

NEXIS Builds

Full time

Dispatch CoordinationSystem Level TroubleshootingHardware SupportHelp Desk SupportService Management

Technical Support Engineer III

30+ days ago

Forcepoint

Full time

Application DevelopmentInformation Systems SecurityTransport Layer Security (TLS)SMTP (Simple Mail Transfer Protocol)Packet Analyzer

Sr Corporate Paralegal

11 days ago

GHX

Full time

Corporate GovernanceCorporate AccountingLegal Document ManagementDue DiligenceMergers And Acquisitions

Other jobs at Softgic

1747 Cloud Security Specialist (AWS)

13 days ago

Softgic

Full time

Amazon Web ServicesAmazon Web ServicesIncident ResponseIdentity And Access ManagementAmazon Elastic Compute Cloud

Full-Stack Developer (Product-Oriented) 1654

30+ days ago

Softgic

Full time
Mid-level (2-5 years)

JavaScript LibrariesTypeScriptJavaScript LibrariesJavaScript LibrariesDomain Driven Design

1748 Product Lead / Product Owner

4 days ago

Softgic

Full time
Senior (5-10 years)

Product ManagementBacklogsUser StoryLarge Language Modeling

Data Scraping 1743

Role overview

Qualifications

Responsibilities

Key facts

Hard skills

Other skills

About the company

Company details

Links

Your match analysis

Job description

Requisitos

Apply once. Then go straight to the hiring manager.

Related jobs

Strategic Account Executive - Healthcare

Senior Medical Only Claims Representative - Workers Compensation

Technical Support Representative (Remote)

Technical Support Engineer III

Sr Corporate Paralegal

Other jobs at Softgic

1747 Cloud Security Specialist (AWS)

Full-Stack Developer (Product-Oriented) 1654

1748 Product Lead / Product Owner

Reach out to the hiring manager directly.