Senior AI & Data Engineer

Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Proficiency in Python and AI integrations., Experience with web scraping, OCR, embeddings, vector databases., Background in building AI-driven products and workflows., Knowledge of data processing, model training, and automation tools..

Key responsibilities:

  • Develop AI agents and agentic workflows using LangChain, OpenAI, and Gemini.
  • Create data pipelines for structured and unstructured data sources.
  • Implement OCR, vector search, and RAG systems for data extraction and retrieval.
  • Automate workflows and develop APIs to integrate AI solutions with the platform.

bolsterup logo
bolsterup

Job description

Bolsterup is transforming the construction industry with AIpowered intelligence. We’re looking for an AI Engineer passionate about building agentic workflows, LLMdriven solutions, and smart automation.

This role suits someone with experience at the intersection of AI, data engineering, and automation, ideally from AI SaaS, dataheavy platforms, or applied AI startups.

What you’ll do:

Build AI agents with OpenAI, Gemini, and LangChain.

Create data pipelines for structured & unstructured data (web scraping, PDFs, Excel).

Implement OCR, vector search (Pinecone), and RAG systems.

Automate workflows using n8n & Python.

What we need:

✅ Expert in Python and AI integrations.

✅ Skilled in web scraping, OCR, embeddings, vector DBs.

✅ Experience with custom model training & agent orchestration.

If you love building AIdriven products, designing intelligent workflows, and working with cuttingedge tech, we want to talk to you!

Requirements

Key Responsibilities
AI & LLM Development
  • Build agentic workflows using LangChain, OpenAI, Gemini, and custom orchestration.
      • Design contextaware RAG systems for accurate retrieval and response.
      • Finetune models for domainspecific tasks using LoRA, PEFT, RLHF.
        • Data Processing & Extraction
          • Build robust web scrapers for structured and unstructured sources.
              • Implement OCR solutions for extracting data from PDFs, images, and scanned documents.
              • Parse Excel sheets, PDFs, and semistructured data, extracting and matching entities across datasets.
              • Normalize and structure raw scraped and document data for downstream AI workflows.
                • Vectorization & Retrieval Systems
                  • Implement and optimize data vectorization pipelines for semantic search.
                  • Use Pinecone, FAISS, or Weaviate for vector storage and similarity search.
                  • Apply dimension reduction techniques (PCA, UMAP) for efficiency.
                    • Workflow Orchestration & Automation
                      • Use n8n and similar tools for rapid prototyping and automation.
                      • Build modular pipelines for continuous data ingestion and transformation.
                        • Infrastructure & Integrations
                          • Develop APIs and connectors to integrate AIdriven insights with Bolsterup’s core platform.
                          • Deploy solutions using Docker, serverless architectures, and cloud platforms (GCPAWS).
                          • Implement monitoring for AI pipelines, including token usage and latency tracking.
                            • Required Skills & Experience
                              • Python Expert – Advanced proficiency in async programming, data processing (pandas, NumPy), and automation.
                                  • Web Scraping Expertise – Experience with Playwright, Puppeteer, Scrapy, and antibot evasion techniques.
                                  • Document Parsing & OCR – Skilled in Tesseract, AWS Textract, Google Document AI, or similar.
                                  • LLM Development – Handson with OpenAI, Gemini, LangChain, and building custom agents.
                                  • Vector Database Knowledge – Experience with Pinecone, FAISS, and embedding optimization.
                                  • Data Structuring & Entity Matching – Experience with data normalization, deduplication, and fuzzy matching.
                                  • Workflow Automation – Proficient in n8n, Zapier, or other orchestration platforms.
                                  • Cloud & Deployment – Familiar with Docker, serverless functions, and GCPAWS.
                                    • NicetoHave Skills

                                      • Experience with Vertex AI and AI model deployment on cloud.
                                      • Familiarity with multimodal AI (text, image, tabular).
                                      • Knowledge of data governance and privacy best practices.
                                      • Prior experience with Stream Chat, Cloudflare Workers, and CDNbased deployments.
                                      • Experience building backend services with either Django or NestJS
                                        • Benefits

                                          • Opportunity to build the future of AI in Contech.
                                          • Fully remote role
                                          • Competitive compensation and equity.
                                          • Employee stock options
                                          • Cuttingedge AI infrastructure and a fastpaced, innovationdriven culture.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Data Engineer Related jobs