Offer summary

Qualifications:

Proficient in Python with experience in libraries such as Pandas, NumPy, and PyTorch., Experience with modern AI (NLP) tools and workflows, particularly with large language models like Llama 3., Strong understanding of vector databases, especially Pinecone, and LLM architectures., Familiarity with cloud platforms (AWS, GCP, or Azure) and CI/CD tools for deploying ML workflows..

Key responsibilities:

Drive the co-ideation and creation of end-to-end AI/ML solutions for startup funding recommendations.

Install and fine-tune large language models in cloud solutions, developing custom workflows for text summarization and embeddings.

Design and implement vector-based search systems, optimizing performance for large-scale datasets.

Document workflows and best practices to ensure maintainability and scalability of ML solutions.

Job description

Our client, ShipShape.vc, is a UK-based tech company that helps startups discover and secure funding opportunities - from venture capital to alternative sources - tailored to their specific needs. The funding landscape is notoriously opaque for most startups, which ShipShape.vc addresses by using AI to scrape and analyse relevant funding data, then recommending the most suitable options.

They are seeking a highly skilled and motivated AI / ML Engineer to join their team. The ideal candidate is proficient in Python and has experience working with modern AI (NLP) tools and workflows.

This role requires expertise in building and deploying scalable vector search solutions, installing and working with large language models (LLMs) like Llama 3 in cloud solutions, and developing pipelines for summarization and embedding generation.

Apply if you enjoy:

A fast-paced startup environment
Building AI / ML products and staying ahead of AI developments
Working in small, lean teams, and happy to roll your sleeves up to get the work done
Co-ideating entire solutions to problems, rather than coding small tickets

What you will be doing:

You will be the main ML / AI engineer driving the co-ideation and creation of end-to-end solutions. These include namely improving ShipShape.vc's recommendation engine for startups, and also processing structured startup data for end-users.

LLM Integration:

o Install, fine-tune e.g. Llama 3 and other large language models in cloud solutions e.g. Sagemaker, Vertex. Note: using ChatGPT will not suffice.

o Develop custom workflows to generate outcomes using LLM e.g. text summarization, embeddings, classifications.

o Complete PEFT to fine-tune models.

o Conversant with tools for e.g. NER.

o Using latest enablers from e.g. HuggingFace.

Model Training & Fine-tuning:

o Train and adapt LLMs to specific business requirements using domain-specific datasets.

o Evaluate and improve model accuracy, scalability, and efficiency.

Data Engineering & Programming:

o Develop and maintain robust ETL pipelines using Python.

o Leverage Docker for containerized development and deployment workflows.

o Implement and manage CI/CD pipelines for seamless integration and delivery.

Vector Search Solutions:

o Design and implement vector-based search systems using Pinecone or similar technologies.

o Optimise search performance for large-scale datasets to support real-time and batch querying.

Documentation:

o Document workflows, codebases, and best practices to ensure maintainability and scalability.

Requirements

Problem-Solving:

o Strong analytical and troubleshooting skills.

o Proven ability to design scalable and efficient solutions for complex AI/ML problems.

AI/ML Expertise:

o Strong proficiency in NLP techniques

o Experience working with vector databases, particularly: Pinecone.

o Understanding of LLM architectures and installation/configuration of models like Llama 3.

o Ability to fine-tune models for tasks such as summarisation, text generation, and embedding creation.

o Deep understanding of RAG pipelines, including integration with LLMs and retrieval systems

o Practical experience with PEFT methods for customising LLMs

Other technical Skills:

o Strong proficiency in Python with experience in libraries such as Pandas, NumPy, and PyTorch.

o Hands-on experience with Docker for development and deployment.

o Familiarity with CI/CD tools (e.g., Jenkins, GitHub Actions, GitLab CI).

o Familiarity with cloud platforms (AWS, GCP, or Azure) for deploying ML workflows

Benefits

Competitive compensation structure.
Build a sector-defining product to meaningfully drive funding into the global startup ecosystem.
Flexible remote work setup.
Room for professional growth and skill development.