Match score not available

Remote Machine Learning Data Analyst | WFH

Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Bachelor's degree in Data Science or related field, 2+ years experience in Data Analyst role, Proficient in Python, Pandas, and NumPy, Experience with web analysis tools, Knowledge of natural language processing techniques.

Key responsabilities:

  • Manage complete data lifecycle for machine learning applications
  • Extract and structure data from DOM environments
  • Collaborate with engineers on feature engineering experiments
  • Generate synthetic datasets using large language models
  • Develop validation and data quality systems
Get It Recruit - Information Technology logo
Get It Recruit - Information Technology Human Resources, Staffing & Recruiting TPE https://www.get.it/
2 - 10 Employees
See more Get It Recruit - Information Technology offers

Job description

Job Overview

We are seeking a highly skilled and driven Machine Learning Data Analyst to become an integral part of our innovative AI & Threat Analytics team. In this role, you will play a vital role in enhancing our autofill classification models through meticulous management, optimization, and analysis of intricate datasets. This position offers the flexibility of 100% remote work, with the option for a hybrid schedule for candidates based in the El Dorado Hills, CA, or Chicago, IL areas.

Key Responsibilities

  • Take full ownership of the complete data lifecycle, including collection, cleaning, and preprocessing of HTML-based datasets for machine learning applications.
  • Utilize advanced web analysis tools to extract and structure data from DOM environments, ensuring robust model training and validation.
  • Collaborate closely with engineers to support and execute feature engineering experiments while producing high-quality training datasets.
  • Generate and enhance synthetic datasets utilizing large language models (LLMs) to improve data balance and availability for model training.
  • Employ dimensionality reduction techniques to analyze data, explore feature strengths, and elevate dataset quality.
  • Streamline data processing workflows through automation, enhancing efficiency and accuracy in data manipulation and transformation.
  • Maintain comprehensive documentation for data workflows, methodologies, and processes to ensure lineage, reproducibility, and scalability.
  • Establish robust validation and data quality systems to guarantee consistency and integrity across all datasets.

Required Skills

  • 2+ years of professional experience in a Data Analyst role, ideally within a cybersecurity or machine learning context.
  • Proficiency in Python for data manipulation and analysis, utilizing libraries such as Pandas and NumPy for workflow automation.
  • Extensive experience with web analysis tools (e.g., Selenium, BeautifulSoup) and a solid grasp of HTML and DOM structures for effective data extraction.
  • Knowledge of natural language processing (NLP) techniques including tokenization, stop word removal, and lemmatization for text data preparation.
  • Experience in generating synthetic datasets and leveraging LLMs to enhance machine learning data.
  • Strong collaboration skills to work effectively with machine learning engineers and other technical teams.
  • Exceptional problem-solving ability with a meticulous focus on data quality and governance.
  • Familiarity with cloud platforms (AWS, GCP, Azure) for data storage and processing.

Qualifications

  • A Bachelor's degree in Data Science, Statistics, Computer Science, or a related field, or equivalent experience.
  • Due to the role's involvement in GovCloud, all applicants must be identified as a US Person.

Career Growth Opportunities

Joining our team presents numerous opportunities for professional advancement as you work alongside experienced machine learning engineers and gain hands-on exposure to cutting-edge data analysis techniques and tools.

Company Culture and Values

We pride ourselves on fostering a collaborative and innovative environment that values diversity and encourages sharing ideas and leveraging collective expertise.

Compensation And Benefits

  • Medical, dental, and vision insurance (including coverage for domestic partnerships).
  • Employer-paid life insurance and employee/spouse/child supplemental life insurance.
  • Voluntary short/long-term disability insurance.
  • 401(k) plan with both Roth and traditional options available.
  • Generous paid time off (PTO) plan that acknowledges your commitment and seniority, including paid bereavement and jury duty leave.
  • Competitive annual bonuses.

We are dedicated to creating an inclusive environment for all employees.

Employment Type: Full-Time

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Problem Solving
  • Collaboration

Data Analyst Related jobs