LLM Engineer (Reinforcement Learning)

• 30+ days ago

Key Facts

Remote From:

Category: Field Engineer (Solutions)

Full time

Mid-level (2-5 years)

English

Hard Skills

Natural Language Processing (NLP) Reinforcement Learning Python (Programming Language) Deep Learning Application Programming Interface (API) PyTorch (Machine Learning Library) Graphics Processing Unit (GPU) Kubernetes Docker (Software)

Other Skills

•
Troubleshooting (Problem Solving)
•
Problem Solving

Roles & Responsibilities

3+ years of experience in Deep Learning or NLP, with a master's degree acceptable for entry-level applicants.
Proficiency in Python programming and experience with PyTorch for model design, training, and evaluation.
Experience with GPU-based LLM training and troubleshooting skills.
Knowledge of distributed training frameworks such as Slurm, DDP, or Horovod.

Requirements:

Improve the efficiency of LLM training processes using algorithms like PPO, GRPO, and DPO.
Enhance the accuracy and stability of generated outputs while preventing reward hacking.
Develop foundational models capable of external knowledge and API integration, including selecting necessary external tools based on instructions.

Job description

We are looking for the best
LLM Engineer(Reinforcement Learning)는 LLM학습 파이프라인을 설계하여 실서비스에서 활용 가능한 생성형 언어모델을 학습합니다. 지속적인 품질 향상을 위하여 끊임없이 새로운 방법론을 시도하여, 실사용자에게 꼭 필요한 서비스를 출시하고, LLM 스스로 품질을 개선할 수 있도록 가다듬는 일에 기여합니다.
Responsibilities
LLM학습 과정의 효율 향상
PLM 또는 Finetuned LLM의 Direct Alignment Algorithm PPO, GRPO, DPO 등을 이용한 학습 과정의 전반적인 효율 향상
생성 결과의 전반적인 정확성과 안정성 향상
생성 결과의 품질 향상을 위하여 Reward Hacking을 방지하고, SelfRefine이 가능한 학습 구조 설계
외부 지식 및 API와 연동 가능한 기초 모델 개발
지시의 종류에 따라 스스로 필요한 외부 연동 Tool을 선택하는 LLM 학습
Qualifications
Deep Learning 또는 NLP 관련 경력 3년 이상 (석사 신입 지원 가능)
숙련된 프로그래밍 (Python & pytorch) 능력
PyTorch를 활용한 모델 설계, 학습, 평가 및 최적화 경험
GPU를 활용한 LLM 학습 및 Trouble shooting 능력
분산 학습 프레임워크(Slurm, DDP, Horovod 등) 사용 경험
동료와의 원활한 협업 능력
Preferred Qualifications
Deep LearningNLP 관련 논문 제출 또는 석박사 학위 소지자
주요 학술 대회(ACL, EMNLP, NeurIPS 등) 논문 발표 경험
Docker 및 Kubernetes에 대한 경험
GPU 클러스터를 활용한 학습 파이프라인 설계 및 관리 경험
GPU를 활용한 학습 및 서비스 개발 경험
GPU 기반의 Training 또는 Inference 시스템 구축 경험
LLM의 Posttraining 관련 경험
Supervised FineTuning 및 Parameter Efficient FineTuning 활용 경험
Interview Process
서류전형 코딩테스트 화상면접 (1시간 내외) 대면 혹은 화상면접 (3시간 내외) 최종합격
전형절차는 직무별로 다르게 운영될 수 있으며, 일정 및 상황에 따라 변동될 수 있습니다.
전형일정 및 결과는 지원서에 등록하신 이메일로 개별 안내드립니다.
Additional Information
이력서 제출 시 주민등록번호, 가족관계, 혼인 여부, 연봉, 사진, 신체조건, 출신 지역 등 채용절차법상 요구 금지된 정보는 제외 부탁드립니다.
모든 제출 파일은 30MB 이하의 PDF 양식으로 업로드를 부탁드립니다. (이력서 업로드 중 문제가 발생한다면 이력서와 함께 지원하시고자 하는 포지션의 URL을 recruit@42dot.ai으로 전송 부탁드립니다.)
인터뷰 프로세스 종료 후 지원자의 동의하에 평판조회가 진행될 수 있습니다.
국가보훈대상자 및 취업보호 대상자는 관계법령에 따라 우대합니다.
장애인 고용 촉진 및 직업재활법에 따라 장애인 등록증 소지자를 우대합니다.
42dot은 의뢰하지 않은 서치펌의 이력서를 받지 않으며, 요청하지 않은 이력서에 대해 수수료를 지불하지 않습니다.
※ 지원 전 아래 내용을 꼭 확인해 주세요.
42dot이 일하는 방식, 42dot Way 보러가기 →
42dot만의 업무몰입 프로그램, Employee Engagement Program 보러가기 →

Ready to apply?

Share ·

Field Engineer (Solutions) Related jobs

South Korea Field Engineer (Solutions)

Field Supervisor 2 (Midwest)

Field Supervisor 2 (Midwest)

Field Supervisor 2 (Midwest)

1 day ago

Principal Financial Group

Full time

Securities ResearchSecurities (Finance)Customer Experience Improvement

Roku Engineer

Roku Engineer

Just Now

TribalScale

Full time

Java API For RESTful Web ServicesGoogle TvApple TV

Field Clinical Engineer - US

Field Clinical Engineer - US

Field Clinical Engineer - US

1 day ago

T45 Labs

Full time

Cath LabDevice CompatibilityClinical Data AnalysisGood Clinical Practices (GCP)Medical Imaging

Field Engineer - Emergency Call Handling - Federal Team

Field Engineer - Emergency Call Handling - Federal Team

Field Engineer - Emergency Call Handling - Federal Team

1 day ago

Noggin

Full time

System ConfigurationCloud ComputingPublic Safety TelecommunicationNetwork TroubleshootingCloud Applications

Petroleum Engineer

Petroleum Engineer

Petroleum Engineer

1 day ago

BP

Full time

Artificial LiftSystems ModelingProduction PlanningGeomechanicsFlow Assurance

Other jobs at 42dot

Infra Security Engineer (엔드포인트 보안솔루션 구축/운영)

Infra Security Engineer (엔드포인트 보안솔루션 구축/운영)

Infra Security Engineer (엔드포인트 보안솔루션 구축/운영)

30+ days ago

42dot

Full time

Azure SecurityThreat DetectionEndpoint SecuritySoftware SecurityTest Automation

Senior MCU System Engineer

Senior MCU System Engineer

Senior MCU System Engineer

30+ days ago

42dot

Full time
Senior (5-10 years)

Abstraction LayersAUTomotive Open System Architecture (AUTOSAR)Microprocessor ArchitectureAbstraction LayersMicrocontrollers

Sr. Test Automation Engineer

Sr. Test Automation Engineer

Sr. Test Automation Engineer

30+ days ago

42dot

Full time
Senior (5-10 years)

Automated Testing FrameworkPython (Programming Language)Systems ArchitectureRequirements TraceabilityMultiprocessing

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.