Key Facts

Remote From:

United States

Category: Data Engineer

Full time

Senior (5-10 years)

English

Hard Skills

Apache Spark Data Engineering SQL (Programming Language) Python (Programming Language) Database Schema Mathematical Optimization Data Warehouse Architectures Internal Documentation Data Architecture Code Analysis +31 more

Other Skills

•
Mentorship
•
Collaboration
•
Communication
•
Teamwork
•
Critical Thinking
•
Problem Solving

Roles & Responsibilities

5+ years of hands-on data engineering experience, building and maintaining production-grade data platforms and pipelines.
Strong programming skills in Python or Scala for data processing, and SQL for data analytics.
Deep experience with distributed data processing frameworks (e.g., Apache Spark) including performance tuning and optimization.
Proven experience building data solutions on AWS (EMR, Lambda, S3, etc.)

Requirements:

Design and build batch and streaming data pipelines to support automated data ingestion, ML feature engineering and analytics across multiple product domains.
Own end-to-end delivery of complex data initiatives, including architecture, implementation, testing, deployment, monitoring, and documentation.
Automate data operations (validation, quality checks, alerting, backfills, and recovery workflows) to reduce manual effort and improve consistency.
Develop and evolve the data platform to support large-scale data processing using modern cloud-native technologies.

Socure

Internet

About Socure

Socure is the leading platform for digital identity verification and trust. Its predictive analytics platform applies artificial intelligence and machine learning techniques with trusted online/offline data intelligence from email, phone, address, IP, device, velocity, and the broader internet to verify identities in real time. The company has more than 2,000 customers across the financial services, government, gaming, healthcare, telecom, and e-commerce industries, including four of the top five banks, seven of the top 10 card issuers, three of the top MSBs, the top payroll provider, the top credit bureau, the top online gaming operator, the top Buy Now, Pay Later (BNPL) providers, and over 250 of the largest fintechs. Marquee customers include Chime, SoFi, Varo, Robinhood, Public, Stash, Gusto, and DraftKings. Socure customers have become investors in the company including Citi Ventures, Capital One Ventures, MVB Bank, and Synchrony. Additional investors include Accel, T. Rowe Price, Bain Capital Ventures, Tiger Global, Commerce Ventures, Scale Venture Partners, Sorenson, Flint Capital, Two Sigma Ventures, and others. Socure has received numerous industry awards and accolades, including named to the 2021 & 2022 Forbes Cloud 100 List, Forbes’ Fintech 50 List 2021, and Forbes’ America’s Best Startup Employers 2021, being named to CB Insights: The Fintech 250 for the third year in a row, being awarded Best New Technology Introduced Over the Last 12 Months—Data and Data Services at the 2020 American Financial Technology Awards (AFTAs), being ranked number 70 in Deloitte’s Technology Fast 500™, being listed as a Gartner Cool Vendor, being recognized by Forbes as one of the Top 25 Machine Learning Startups to Watch, being awarded Finovate’s Award for Best Use of AI/ML, to name a few. Founder/CEO Johnny Ayers has also been recognized by Goldman Sachs as one of the 100 Most Intriguing Entrepreneurs of 2021 and as an EY Entrepreneur of the Year 2022.

Company type: Scaleup

Industry: Internet

Founded: 2018

Company size: 501 - 1000

Website LinkedIn See all jobs →

Job description

Why Socure?

Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.

We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won’t be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.

About the Role

We are looking for a Senior Data Engineer to join our Data Automation team. You will play a critical role in designing and building scalable data platforms and pipelines that power Socure’s identity verification products and analytics. This role is ideal for someone who has a strong passion for solving real business problems with data, and combines deep hands-on data engineering expertise with strong ownership.

What You'll Do

• Design and build batch and streaming data pipelines to support automated data ingestion, ML feature engineering and analytics across multiple product domains.

• Own end-to-end delivery of complex, ambiguous data initiatives, including architecture, implementation, testing, deployment, monitoring, and documentation.

• Develop and evolve the data platform to support large-scale data processing using modern cloud-native technologies.

• Automate data operations (validation, quality checks, alerting, backfills, and recovery workflows) to reduce manual effort and improve consistency.

• Optimize cost, performance, and reliability of data workloads.

• Partner closely with cross-functional teams (Data Science, Product, Engineering) to understand requirements, translate them into technical solutions.

• Evaluate and adopt new technologies (new processing engines, storage formats, orchestration tools, GenAI-assisted ingestion) to keep the platform modern and efficient.

What You Bring

• 5+ years of hands-on data engineering experience, building and maintaining production-grade data platforms and pipelines.

• Strong programming skills in general-purpose language (such as Python or Scala) for data processing, and SQL for data analytics.

• Deep experience with distributed data processing frameworks, such as Apache Spark, including performance tuning and optimization.

• Proven experience building data solutions using services on AWS (EMR, Lambda, s3, etc).

• Strong understanding of data modeling and data warehousing concepts, including partitioning, schema design for large-scale datasets.

• Experience operating and supporting production pipelines, including monitoring, alerting, incident response, and improving reliability over time.

• Solid foundation in software engineering practices, including version control, CI/CD, testing strategies, and code review.

• Strong communication and collaboration skills, with the ability to work effectively with both technical and non-technical stakeholders.

Preferred Qualifications

• Experience with streaming or near-real-time data processing (Kafka, Kinesis, etc).

• Hands-on experience with data orchestration tools (Airflow, Step Functions, etc).

• Familiarity with modern data platform patterns such as Data Lakehouse, Data Mesh, and large-scale data sharing across teams.

• Experience with prompt engineering using modern GenAI, Large Language Models (LLM).

• Experience mentoring other engineers and contributing to engineering-wide standards, best practices.

As a note; Socure cannot provide sponsorship now or in the future for this role.

Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.

Follow Us!

YouTube | LinkedIn | X (Twitter) | Facebook