About Spokeo:
Spokeo is a people intelligence platform that helps over 18 million monthly visitors reconnect with friends, reunite with families, and build trust in new relationships. Thousands of companies also trust Spokeo’s 60 billion public records to improve customer research, help verify information, and prevent fraud.
Founded in 2006, Spokeo has built a dedicated, remote-first team with an average tenure of 6.9 years. It has earned recognition from Comparably as a Best Company for Compensation, Employee Happiness, Perks and Benefits, Support for Women, Work-Life Balance, and CEO Leadership.
About this Opportunity
As a Senior Data Engineer at Spokeo, you will develop, optimize, and improve our data systems such as ETL data, pipeline, storage, and entity resolution. This involves working with infrastructure built in AWS, including Airflow, PySpark, EMR, S3, DynamoDB, and more. This role will help build and improve data products, automation platform features, analytical software packages, and data pipeline orchestration tools.
What You’ll Do:
Build infrastructure and data automation pipelines for the ingestion, processing, and loading of data from various sources. Automate and integrate new components into the data pipeline.
Collaborate with stakeholders and data science teams to develop data products, including entity resolution and best selection, to efficiently execute product vision and strategy in alignment with organizational goals and priorities.
Collaborated with Data Scientists and ML Engineers to design, build, and maintain scalable data pipelines and infrastructure to support end-to-end machine learning workflows (development, training, deployment, and monitoring).
Create unit and stress test components to monitor technical performance and ensure identified issues are resolved.
Develop data analysis tools to provide data insights and capture key metrics.
Research solutions and maintain technical documentation.
Follow best practices for data governance, quality, cleansing, and other ETL-related activities.
Who You Are:
7+ years of development experience in data engineering within a production environment (internships and academic settings excluded).
Proven experience working with large datasets exceeding 100M+ records or multiple terabytes.
2+ years of development experience in highly scalable, distributed systems and cluster architectures using AWS.
5+ years of hands-on programming experience with Python.
5+ years of professional experience working in big data ecosystems, Spark is required; PySpark is preferable.
3+ years of experience with SQL, schema design, and dimensional data modeling.
2+ years of professional experience working with dataflow orchestration tools, such as Airflow.
2+ years of experience with non-relational databases (e.g., DynamoDB, Elasticsearch, etc.).
A bachelor’s degree in Computer Science, Information Systems, Mathematics, or a related field is required.
Working at Spokeo
Our mission is to advance transparency, and to achieve that goal, we rally around six core values: listening with empathy, understanding the why, clarifying with data, innovating to learn, collaborating to achieve, and insisting on quality.
As a remote-first company, we are able to hire team members residing in the following US states: AZ, CA, CO, FL, GA, KY, MD, MI, MO, NJ, NV, NC, PA, SC, SD, TX, VA, WA, or WY.
In addition to a highly competitive base salary, our generous benefits include:
annual bonus program
stock options
401K matching
100% medical/dental/vision coverage
unlimited PTO
mental health resources
paid home office equipment
fitness reimbursements
support paying for courses
and more
We extend written offers to candidates who successfully complete their selection process. Offers will depend on several factors, including, but not limited to, marketplace competition, job leveling, experience, and skills.
Privacy Notice for Candidates: https://www.spokeo.com/recruiting-policy
Spokeo is an equal opportunity employer. Applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or protected veteran status. Spokeo fosters a business culture where ideas and decisions from all people help us grow, innovate, create the best products, and be relevant in a rapidly changing world.
Recruiters or staffing agencies: Spokeo is not obligated to compensate any external recruiter or search firm who presents a candidate or their resume or profile to a Spokeo employee without 1) a current, fully executed agreement on file, and 2) being assigned to the open position (as a search) via our applicant tracking solution.
#LI-Remote

TrueML Products

NEORIS

Jack & Jill

EVT

Veeva Systems

Spokeo

Spokeo

Spokeo