Strong expertise in Large Language Models (LLMs) and NLP pipelines., Proven experience in building and deploying Generative AI applications., Hands-on knowledge of frameworks like Python, LangChain, and LlamaIndex., Experience with cloud platforms (AWS, GCP, Azure) and microservice architecture..
Key responsibilities:
Develop and prototype Generative AI solutions and POCs.
Collaborate with cross-functional teams to integrate AI features into products.
Build production-ready systems from validated prototypes.
Implement retrieval-augmented generation (RAG) pipelines and optimize AI models.
Report this Job
Help us maintain the quality of our job listings. If you find any issues
with this job post, please let us know. Select the reason you're reporting
this job:
With over 25 years of expertise, CloudRay is a trailblazer in IT Consulting, IT Staffing, and Industrial Staffing solutions. Our founder's profound industry knowledge forms the cornerstone of our success across these three domains.
In IT development, we excel in Software Application Design, Architecture and Integration, Software Sourcing, and Project Management, setting a global standard for consultant-provided software solutions.
Complementing our top-tier services, CloudRay proudly offers Professional consulting in Infrastructure Management, SAP Data Management, Information Management Systems, and Custom Application Development, Support, and Maintenance.
Specializing in versatile Industrial Staffing, we cater to roles ranging from assembly line workers to warehouse associates, prioritizing excellence in the dynamic industrial landscape.
Serving several clients and industry leaders in Healthcare, Oil & Gas, Hospitals, Pharmaceuticals, our onshore/offshore global service model ensures unparalleled service accessibility, adapting to clients' individual needs with exceptional availability matching the quality of our service.
Job Summary: We are looking for a Senior Generative AI Engineer to lead the development of Proofs of Concept (POCs) and transition them into robust, scalable productiongrade solutions. The ideal candidate has strong expertise in LLMs, prompt engineering, RAG, and deploying GenAIpowered applications. Youll collaborate across product, data, and engineering teams to rapidly prototype ideas and deliver AIfirst features that create business impact.
Key Responsibilities: • Drive endtoend development of POCs using Generative AI models (OpenAI, Claude, Gemini, Mistral, opensource LLMs). • Translate business problems into AIpowered use cases and prototypes with clear outcomes. • Architect and build productionready systems from validated POCs. • Implement RetrievalAugmented Generation (RAG) pipelines, vector databases (e.g., Pinecone, FAISS, Weaviate), and embeddingbased search. • Optimize prompts, model selection, finetuning, and response pipelines for reliability and costefficiency. • Build API services, microservices, or SDKs for GenAI functionalities and expose them to frontend or enterprise systems. • Evaluate opensource and proprietary models and recommend fitforpurpose solutions. • Ensure secure, ethical, and responsible AI use in compliance with organizational and regulatory guidelines. • Collaborate closely with product managers and software engineers to integrate GenAI into realworld applications.
Required Skills & Qualifications: • Strong experience with LLMs, transformerbased architectures, and NLP pipelines. • Proven track record building and deploying GenAIpowered POCs or applications. • Handson experience with OpenAI, Anthropic, Google Gemini, Hugging Face, Llama, etc. • Experience in Python, LangChain, LlamaIndex, or similar orchestration frameworks. • Working knowledge of vector databases, embedding models, and RAG architecture. • Cloud experience (AWSGCPAzure) including AIML services, serverless architecture, and containerization. • Familiarity with API design, backend development, and microservice architecture. • Strong understanding of model safety, cost optimization, prompt chaining, token limits, and response streaming.
Preferred Skills: • Experience with finetuning opensource models (e.g., LLaMA, Mistral, Falcon). • Familiarity with agentic workflows (e.g., AutoGPT, CrewAI, LangGraph). • Exposure to MLOps tools (MLflow, Kubeflow, SageMaker Pipelines). • Ability to handle unstructured data (PDFs, audio, images, structured logs) and convert into usable GenAI formats.
Required profile
Experience
Level of experience:Senior (5-10 years)
Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.