MidSenior LLM Engineer (Remote Worldwide)

Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

5+ years of experience in building production-grade Python codebases., Deep understanding of transformer architectures and training dynamics., Expertise in inference optimization techniques like quantization and distillation., Hands-on experience with multi-GPU/multi-node fine-tuning using tools like FSDP, DeepSpeed, or accelerate..

Key responsibilities:

  • Fine-tune and optimize large language models for conversational AI.
  • Collaborate with stakeholders to develop and improve AI features.
  • Manage datasets and adapt models for multilingual support.
  • Assess and select technological approaches for model training and deployment.

EverAI logo
EverAI https://www.everai.ai/
51 - 200 Employees
See all jobs

Job description

Our Vision & Products

🚀 EverAI — Building the Future of AI Companionship

One of the Top 15 Largest & FastestGrowing AI Companies in the World

30+ Million Users in under 2 years — Help Us Reach 100M first, 500M next

At EverAI, we’re shaping what it means to connect with AI. With 30+ million users and counting, were not just building products — were creating entirely new categories.

Our flagship product is the worlds largest AI girlfriendboyfriend platform, redefining relationships for millions. And we’re only just getting started.

Up next? We’re scaling our second product to revolutionize the creator economy. Think bestinclass AI content engines for video and image generation — designed to put worldclass tools in every creator’s pocket.

All of this is governed by our proprietary moderation system, EverGuard — an internal AI designed to ensure everything we build is safe, ethical, and humanfirst.

Our Team

We are an enthusiastic, passionate and hardworking team of 55+ people. Our founding team has strong entrepreneurial experience building and scaling web products from 0 to IPO.

Alexis Soulopoulos [CEO]

• 10+ years in Tech Executive Leadership

• CoFounder Mad Paws Holdings (from 0 to IPO)

• Forbes 30 under 30 + Deloitte TechFast50 ’22 & ‘23

Michael Monin [Cofounder & CTO]

• 10+ years as CTO COO (web2web3), 1+ year in AILLM

• Serialentrepreneur: MTK Digital (exited 0>$20m revenue) and Zipchat (AI Chatbot for Ecommerce brands)

Thomas Lacroix [Cofounder & CMO]

• 8+ years in Customer Acquisition & Ecommerce Growth

• Serialentrepreneur: Curatible (sold to Blackstone) and MTK Digital (exited 0>$20m revenue)

Maruša Fasano [CFOLegal]

• 25+ years in Finance, Strategy, M&A

• ExCFOM&A @Curatible (exited to Blackstone)

• ExPresident of the Board @SotremoSA (exited)

• CofounderCFO @SoftOne (exited)

Your Role

🚀 Architect the Future of AI Relationships

As our LLM Engineer, youll finetune and optimize large language models that power conversations for over 30 million users, processing more than 5 million messages daily. Youll be at the forefront of developing AI companionship technology that scales globally while maintaining personalized and meaningful interactions.

Key Responsibilities
  • Interact with stakeholders (Cofounders, Web Engineers, DevOps Engineers) to bring your project to life.

  • Oversee the creation and optimization of algorithms for LLM behavior adjustments based on user interactions, focusing on finetuning and prompt engineering.

  • Develop features to improve the richness of the product (multicharacter chats, gamification, etc)

  • In addition to chat, interacting with modalities managed by other team members (audio, image, video), and collaborating with them

  • Adaptation and finetuning of base models for multilingual support

  • Manage the creation and maintenance of diverse datasets critical for training and improving the performance of LLMs.

  • Assess and determine the best technological approaches, selecting between classifiers, finetuning, and other methods based on the specific projects needs.

    • Your Qualifications

      MustHaves
      • Python Mastery: 5+ years building production‑grade, modular, maintainable codebases

      • LLM Architecture Expertise: Deep understanding of transformers and their training dynamics (attention, positional encodings, samplers, tokenizers, posttraining, reasoning LM)

      • Inference Optimization at Scale: Expert with vLLM TensorRT‑LLM (or similar); proven record of reducing latency and memory via quantization andor distillation

      • Distributed Training: Hands‑on multi‑GPU multi‑node fine‑tuning using FSDP, DeepSpeed, or accelerate; comfortable with mixed‑precision, gradient checkpointing, and memory‑aware scheduling

      • Performance Profiling & Optimization: Skilled at identifying and resolving compute or memory bottlenecks across CPUGPU pipelines with industry‑standard profiling workflows

        • Nice‑to‑Haves
          • Concurrency & Runtime Engineering: Strong with asyncio, multiprocessing, or equivalent backendbatch‑scheduling patterns

          • Low‑level Systems: Practical CUDA Triton experience; able to write or debug custom kernels

          • Open‑Source Impact: Contributor to core LLM tooling (vLLM, HF Transformers, Triton, etc.)

          • Real‑time Deployments: Built or maintained latency‑critical, multi‑user LLM services (RAG, streaming, agents, chatbots)

          • Specialized Generation Use Cases: Exposure to erotic role playing, multi‑turn instruction tuning, or non‑English quality alignment

            • Soft Skills

              🗣 Strong communication & collaborative skills (perfectly fluent in English)

              🎯 Goaloriented, ownership and commitment

              ⚡️ Doer mindset we are moving fast and we need people who can find the right balance between executing, planning and strategy

              🧢 Humble willing to learn, open to feedback

              🍭 #NSFW you are comfortable building products that are based on uncensored models and content

              Why EverAI?

              📈 Exponential Growth: From 30M+ users in 18 months, to 100M next — and 500M beyond

              🚀 Track Record of CategoryCreating Innovation: We consistently launch worldfirst AI applications — setting the pace, not following it

              🌍 Global Impact: Toptier user growth, realworld adoption, and cultural relevance

              🧠 Proven Leadership: A senior team that’s launched, scaled, and exited & IPO’d multiple scale ups — now fully focused on reshaping AI companionship

              👥 Elite Remote Team: 100% remote and built to win — worldclass talent from Tier 1 tech companies, with a culture of ownership, velocity, and radical creativity

              🛡️ Ethical Core: Our AI ecosystem is governed by EverGuard, our proprietary AI moderation technology, ensuring responsible development at scale

              What We Offer

              ✍️ We prefer a B2B contract but we can be flexible, as long as you’re in it for the long haul

              📍 Fullremote (you work from the place that suits you best)

              🏝️ 4 weeks PTO

              👨‍👩‍👧‍👦 Annual gathering to get to know each other better

              💆‍♀️ Wellbeing budget up to 200$

              📚 Learning budget

              💻 Company laptop

              ⚡️ GPT4, Mistral and Hugging Face Pro plan

              🎯 Top Tier Talent Is Our Multiplier

              We’re a fully remote group of Aplayers from Tier 1 tech, led by an exec team who’ve launched, scaled, and exited multiple companies. We move fast, and care deeply about what we build — and who we build it with.

              We’re looking for exceptional talent ready to ship & distribute worldfirst AI products at scale, fast, and cocreate with us this categorydefining business.

              If that’s you — reach out and apply!

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Goal-Oriented
  • Willingness To Learn

Field Engineer (Solutions) Related jobs