Match score not available

Software Engineer (US, remote work)

Remote: 
Full Remote
Contract: 
Salary: 
29 - 29K yearly
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Experience in RLHF for LLMs, Prior experience in data annotation, Familiarity with LLM frameworks like GPT-3/4, BERT, Strong command of programming languages (Python, SQL), Turing.com work experience preferred.

Key responsabilities:

  • Annotate and curate data for AI models
  • Implement RLHF techniques with human feedback
Outlier logo
Outlier Large https://outlier.ai/
1001 - 5000 Employees
See more Outlier offers

Job description

**ONLY FOR CANDIDATES WHO HAVE WORKED AT TURING.COM**

**Other candidates should apply to different job listing by me**

**RLHF for LLMs**


Type: Part-Time, Remote

Perks: US organisation, can offer competitive compensation to Turing

Compensation: Starting at $15/hour (~Rs. 1200+ per hour)

  • if you work an average of 3 hours a day - that could be upwards of Rs 80K per month
  • if you choose to work average 8 hours a day - that could be upwards of Rs 2L per month


Minimum Commitment: 10 hours/week

Signing Bonus: $300 for qualified candidates who onboard within the next week and stay for a month


About Us:

We are at forefront of AI and machine learning, and we’re looking for motivated individuals to contribute to the next generation of intelligent models. The ideal candidate will have experience working with Turing.com and a strong background in data annotation, prompt engineering, and model fine-tuning. You will play a critical role in refining AI systems, providing essential human feedback, and enhancing overall model performance.


πŸŒπŸ’‘Key Qualifications:

βœ… Experience in RLHF:

  • Deep understanding of Reinforcement Learning (especially RLHF) for LLMs and how it applies to improving AI models.
  • Hands-on experience in fine-tuning LLMs through iterative human feedback.

βœ… Data Expertise:

  • Prior experience in annotating datasets for AI/ML models with a focus on quality control.
  • Experience with annotation tools and platforms like Labelbox, Prodigy, or Turing’s proprietary tools.

βœ… Technical Proficiency:

  • Familiarity with LLM frameworks like GPT-3/4, BERT, and advanced NLP models.
  • Strong command of Python, SQL, or related programming languages for handling data processing tasks.
  • Understanding of prompt engineering, and experience with platforms like Hugging Face or LangChain is a plus.

βœ… Turing.com Experience:

  • Prior work experience at Turing.com (or similar remote work platforms), with a focus on AI, data annotation, or similar roles.
  • Understanding of the remote work dynamic and experience collaborating with distributed teams.

βœ… Preferred Qualifications:

  • Experience in model fine-tuning, prompt engineering, and human-in-the-loop systems.
  • Familiarity with cloud platforms (AWS, Azure) and MLOps best practices.
  • Previous work on reinforcement learning pipelines in large-scale AI projects.


πŸŒπŸ’‘What You’ll Do:

You will play a key role in annotating and curating data for the training and fine-tuning of large language models (LLMs), ensuring annotations are accurate, consistent, and project-aligned. You’ll implement Reinforcement Learning with Human Feedback (RLHF) techniques, providing structured human feedback to guide model outputs and continuously fine-tune models to improve performance.


πŸŒπŸ’‘Why You Should Apply:

βœ… Flexible without any restrictions, opportunity – work whenever it fits your schedule!

βœ… Remote – work from anywhere in India!

βœ… Competitive pay – starting from $15/hour based on experience and performance


How to Apply:

βœ… Fill the GoogleForm

βœ… Wait for shortlisting email

βœ… Receive offer letter

βœ… Take onboarding seriously


Follow for more AI Jobs + Entrepreneurship

Ayyush Sharma (Chhotapreneur)

Growth, Strategy & Revenue Operations | A+ track record in scaling startups.

Growth @ Outlier AI

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Quality Control

Software Engineer Related jobs