This is a remote position.
We're building agentic AI for Fortune 500 enterprises — the kind that survives contact with messy real-world data.
We're hiring a Forward Deployed Engineer with deep production RAG expertise to embed with our largest enterprise customers, architect their retrieval and grounding systems end-to-end, and lead a distributed team shipping AI that runs on real traffic.
This is not a "build a POC with LangChain in two weeks" role. This is "your code is in the customer's production environment on Monday, and you own the faithfulness, latency, and citation numbers."
What you'll do
You should have
- Chunking strategies and the ablations you ran to choose one
- Vector DB trade-offs from real production use — Pinecone, Qdrant, Weaviate, pgvector, Vespa, OpenSearch (pick your 2–3 and have opinions)
- Hybrid search, reranking (Cohere, cross-encoders, ColBERT), HyDE, query rewriting, context compression
- Retrieval and generation evaluation — you can quote real numbers from real systems you've owned
- The boring-but-critical stuff: access control, citation enforcement, freshness, multi-tenancy, cost-at-scale
Nice to have
Who you are
Why this role

NTT DATA

Contech Systems Inc.

Life360

Roofr

NewRocket

Embrace Software Inc

Embrace Software Inc

Embrace Software Inc