Our client, ShipShape.vc, is a UK-based tech company that helps startups discover and secure funding opportunities - from venture capital to alternative sources - tailored to their specific needs. The funding landscape is notoriously opaque for most startups, which ShipShape.vc addresses by using AI to scrape and analyse relevant funding data, then recommending the most suitable options.
They are seeking a highly skilled and motivated AI / ML Engineer to join their team. The ideal candidate is proficient in Python and has experience working with modern AI (NLP) tools and workflows.
This role requires expertise in building and deploying scalable vector search solutions, installing and working with large language models (LLMs) like Llama 3 in cloud solutions, and developing pipelines for summarization and embedding generation.
Apply if you enjoy:
What you will be doing:
You will be the main ML / AI engineer driving the co-ideation and creation of end-to-end solutions. These include namely improving ShipShape.vc's recommendation engine for startups, and also processing structured startup data for end-users.
LLM Integration:
o Install, fine-tune e.g. Llama 3 and other large language models in cloud solutions e.g. Sagemaker, Vertex. Note: using ChatGPT will not suffice.
o Develop custom workflows to generate outcomes using LLM e.g. text summarization, embeddings, classifications.
o Complete PEFT to fine-tune models.
o Conversant with tools for e.g. NER.
o Using latest enablers from e.g. HuggingFace.
Model Training & Fine-tuning:
o Train and adapt LLMs to specific business requirements using domain-specific datasets.
o Evaluate and improve model accuracy, scalability, and efficiency.
Data Engineering & Programming:
o Develop and maintain robust ETL pipelines using Python.
o Leverage Docker for containerized development and deployment workflows.
o Implement and manage CI/CD pipelines for seamless integration and delivery.
Vector Search Solutions:
o Design and implement vector-based search systems using Pinecone or similar technologies.
o Optimise search performance for large-scale datasets to support real-time and batch querying.
Documentation:
o Document workflows, codebases, and best practices to ensure maintainability and scalability.
Requirements
Problem-Solving:
o Strong analytical and troubleshooting skills.
o Proven ability to design scalable and efficient solutions for complex AI/ML problems.
AI/ML Expertise:
o Strong proficiency in NLP techniques
o Experience working with vector databases, particularly: Pinecone.
o Understanding of LLM architectures and installation/configuration of models like Llama 3.
o Ability to fine-tune models for tasks such as summarisation, text generation, and embedding creation.
o Deep understanding of RAG pipelines, including integration with LLMs and retrieval systems
o Practical experience with PEFT methods for customising LLMs
Other technical Skills:
o Strong proficiency in Python with experience in libraries such as Pandas, NumPy, and PyTorch.
o Hands-on experience with Docker for development and deployment.
o Familiarity with CI/CD tools (e.g., Jenkins, GitHub Actions, GitLab CI).
o Familiarity with cloud platforms (AWS, GCP, or Azure) for deploying ML workflows
Benefits
Letuelezioni
Jobs via Dice
Egis en France
Galileo Learning
julhiet sterwen