Hands-on experience building end-to-end ML/GenAI pipelines (data → model → deployment)
Experience with document AI, embeddings, and vector search
Experience with vector databases and MongoDB; production-grade ML engineering practices
Requirements:
Design and develop scalable ML and Generative AI solutions
Build end-to-end ML pipelines (data → model → deployment) and document AI processing workflows (OCR/extraction, parsing, normalization, text chunking)
Develop embeddings for semantic search and implement vector similarity search using vector databases
Integrate ML models with vector databases and MongoDB, delivering production-grade, maintainable, deployment-ready code
Job description
Role: Data Science Engineer Location: Remote Job type: Full time Salary Range: $110,000-$130,000 a year
Job Description
Advanced Python development for ML/AI workloads.
End‐to‐end ML lifecycle: model training, evaluation, fine‐tuning, and labeling/tagging workflows.
Generative AI systems design, including LLM-based application development.
Prompt engineering optimization for large language models.
Document AI pipelines: OCR/extraction, parsing, normalization, and text chunking for structured & unstructured data Embedding generation pipelines for semantic search and retrieval Vector similarity search implementation using vector databases
ML model integration with Vector DBs and MongoDB Production‐grade ML engineering: scalable, maintainable, and deployment‐ready code Python, Large Language Models (LLMs) (via LLM‐based applications), Vector Databases, MongoDB. Roles & Responsibilities
We are seeking a highly skilled Data Science Engineer to design and develop scalable ML and Generative AI solutions. The ideal candidate will have deep expertise in Python, hands-on experience in model training, document processing pipelines, and strong knowledge of vector databases and modern ML/GenAI frameworks. Strong fit if the candidate:
Has expert level Python skills
Has hands on experience building ML/GenAI systems, not just theoretical knowledge
Has worked on end to end ML pipelines (data → model → deployment)
Has experience with document AI, embeddings, and vector search
Thinks like an engineer (scalable, maintainable, production ready code)
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.