Senior Solutions Architect (AIML) Contractual

fully flexible
Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

8+ years of experience in software engineering or architecture., 4+ years leading cross-functional GenAI/ML teams., Hands-on experience with distributed AI systems in production., Expertise in enterprise-scale AI architecture and advanced AI concepts..

Key responsibilities:

  • Design and deploy multi-agent AI systems using modern frameworks.
  • Lead the development of high-performance ML infrastructure and deployment pipelines.
  • Drive innovation and establish best practices in AI/ML architecture.
  • Oversee production AI systems, including deployment, monitoring, and optimization.

Proximity Works logo
Proximity Works Scaleup https://www.proximity.tech/
51 - 200 Employees
See all jobs

Job description

We are looking for a Senior Solutions Architect to design, develop, and scale innovative AIMLdriven solutions. You will be responsible for architecting highly scalable, lowlatency distributed systems optimized for AIML workloads. As a key technical leader, you will solve complex challenges, influence nextgeneration AIML infrastructures, and guide crossfunctional teams to deliver stateoftheart solutions for fastgrowing startups and enterprise companies.

Be at the forefront of shaping nextgeneration AIML infrastructures, driving solutions for highimpact products across diverse industries. Youll have the opportunity to influence key architectural decisions and enable realworld applications that scale globally, ensuring innovation and efficiency at every step.

Requirements

Youll be responsible for —
Driving endtoend GenAI architecture and implementation:
  • Design and deploy multiagent systems using modern frameworks (LangGraph, CrewAI, AutoGen)
  • Architect RAG solutions with advanced vector store integration
  • Implement efficient finetuning strategies for foundation models
  • Develop synthetic data generation pipelines for training and testing
    • Leading ML infrastructure and deployment:
      • Design highperformance model serving architectures
      • Implement distributed training and inference systems
      • Establish MLOps practices and pipelines
      • Optimize cloud resource utilization and costs
      • Set up monitoring and observability solutions
        • Drivng technical excellence and innovation:
          • Define architectural standards and best practices
          • Lead technical decisionmaking for AIML initiatives
          • Ensure scalability and reliability of AI systems
          • Implement AI governance and security measures
          • Guide teams on advanced AI concepts and implementations
            • Overseeing production AI systems:
              • Manage model deployment and versioning
              • Implement AB testing frameworks
              • Monitor system performance and model drift
              • Optimize inference latency and throughput
              • Ensure high availability and fault tolerance
                • Fostering collaboration and growth:
                  • Mentor engineering teams on AI architecture
                  • Collaborate with stakeholders on technical strategy
                  • Drive innovation in AIML solutions
                  • Share knowledge through documentation and training
                  • Lead technical reviews and architecture discussions
                    • You need —
                      8+ years exoerience in software engineering or architecture, including:
                      • 4+ years leading crossfunctional GenAIML teams
                      • Production experience with distributed AI systems
                      • Enterprisescale AI architecture implementation
                        • To lead and architect enterprisescale GenAIML solutions, focusing on:
                          • Multiagent orchestration using LangGraph, CrewAI, and AutoGen
                          • Workflow automation with LlamaIndex, LangChain, and LangFlow
                          • Agent coordination using LETTA framework
                          • Integration of specialized agents for reasoning, planning, and execution
                            • To design and implement sophisticated AI architectures incorporating:
                              Advanced RAG systems using:
                              • Vector databases (Chroma, Weaviate, Pinecone, Milvus)
                              • Hybrid search with BM25 and semantic embeddings
                              • Selfquerying and recursive retrieval patterns
                                • Finetuning strategies for foundation models:
                                  • PEFT methods (LoRA, QLoRA, Adaptertuning)
                                  • Parameterefficient training approaches
                                  • Instruction finetuning and RLHF
                                    • Multiagent frameworks integrating:
                                      • Tooluse and reasoning chains
                                      • Memory systems (shortterm and longterm)
                                      • Metaprompting and reflection mechanisms
                                      • Agent communication protocols
                                        • Expertise in advance data generation and synthesis:
                                          • Synthetic data generation using Arigilla and PersonaHub
                                          • Privacypreserving data synthesis
                                          • Domainspecific data augmentation
                                          • Quality assessment of synthetic data
                                          • Data balancing and bias mitigation
                                            • To architect highperformance ML serving infrastructure focusing on:
                                              • Model serving platforms (BentoML, Ray Serve, Triton)
                                              • Realtime processing with Ray, Kafka, and Spark Streaming
                                              • Distributed training using Horovod, DeepSpeed, and FSDP
                                              • vLLM and TGI for efficient inference
                                              • Integration patterns for hybrid cloudedge deployments
                                                • To drive cloud architecture decisions across:
                                                  • Kubernetes orchestration with Kubeflow and KServe
                                                  • Serverless ML with AWS Lambda, Azure Functions, Cloud Run
                                                  • Autoscaling using HPA, KEDA, and custom metrics
                                                  • Resource optimization with Nvidia Triton and TensorRT
                                                  • MLOps platforms (MLflow, Weights & Biases, DVC)
                                                    • Bonus points for —
                                                      • Research publications in AIML
                                                      • Opensource project maintenance
                                                      • Technical blog posts on AI architecture
                                                      • Conference presentations
                                                      • AI community leadership
                                                        • Benefits

                                                          What you get —
                                                          • Best in class salary: We hire only the best, and we pay accordingly.
                                                          • Proximity Talks: Meet other designers, engineers, and product geeks — and learn from experts in the field.
                                                          • Keep on learning with a worldclass team: Work with the best in the field, challenge yourself constantly, and learn something new every day.
                                                            • About us —

                                                              We are Proximity — a global team of coders, designers, product managers, geeks, and experts. We solve complex problems and build cuttingedge tech at scale. Heres a quick guide to getting to know us better:

                                                              • Watch our CEO, Hardik Jagda, tell you all about Proximity.
                                                              • Read about Proximitys values and meet some of our Proxonauts here.
                                                              • Explore our website, blog, and the design wing — Studio Proximity.
                                                              • Get behind the scenes with us on Instagram! Follow @ProxWrks and @H.Jagda

    Required profile

    Experience

    Level of experience: Mid-level (2-5 years)
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Community Leadership
    • Presentations
    • Mentorship
    • Decision Making
    • Collaboration

    Solutions Architect Related jobs