7+ years in software, data, or ML engineering with 3+ years building production search systems
Experience with e-commerce search patterns: faceting, merchandising rules, query understanding
Strong knowledge of embedding models, approximate nearest neighbor search, and reranking architectures
Hands-on experience with vector databases and similarity search at scale (Pinecone, Milvus, Weaviate, FAISS or similar)
Requirements:
Design hybrid retrieval systems combining keyword search, vector similarity, and cross-encoder reranking at scale
Build intelligent query routing with cascading classification strategies
Architect multi-model inference pipelines optimized for latency-sensitive workloads
Define relevance metrics, run A/B experiments, and drive measurable business outcomes
Job description
We are seeking a ML Architect for Search where they will design the retrieval and ranking systems that power product discovery for millions of users—balancing cutting-edge ML with real-time performance constraints. This is modern, ML-first search architecture: embedding models, vector similarity, cross-encoder reranking, and multi-model orchestration under strict latency budgets. Your work directly impacts conversion, revenue, and customer experience.
Responsibilities: · Design hybrid retrieval systems combining keyword search, vector similarity, and cross-encoder reranking at scale. · Build intelligent query routing with cascading classification strategies · Architect multi-model inference pipelines optimized for latency-sensitive workloads · Define relevance metrics, run A/B experiments, and drive measurable business outcomes · Support the driving MLOps standards for model deployment, monitoring, and continuous improvement · Partner with Product, Merchandising, and Engineering to translate business requirements into ML solutions · Mentor engineers and define search and ML architectural standards
Requirements: · 7+ years in software, data, or ML engineering with 3+ years building production search systems · Experience with e-commerce search patterns: faceting, merchandising rules, query understanding · Strong knowledge of embedding models, approximate nearest neighbor search, and reranking architectures · Hands-on experience with vector databases and similarity search at scale (Pinecone, Milvus, Weaviate, FAISS or similar) · MLOps expertise: model deployment pipelines, monitoring, versioning, and retraining workflows · Production experience with transformer-based models for classification and ranking · Track record balancing latency, cost, and relevance tradeoffs in real-time systems · Experience designing controlled experiments and defining ML success metrics
Preferred Requirements: · Experience with enterprise search platforms (Algolia, OpenSearch, Elastic or similar) · Background in Learning-to-Rank and multi-stage retrieval architectures · Cloud ML platform experience (AWS SageMaker, GCP Vertex AI, or Azure ML)
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.