Senior Engineer – Multimodal AI Model Development Research

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Strong background in deep learning frameworks like TensorFlow or PyTorch., Experience in developing and deploying multimodal AI models involving text, vision, or audio., Knowledge of model optimization techniques such as quantization, pruning, and distillation., Advanced degree in Computer Science, Machine Learning, or related fields, with 5+ years of relevant experience..

Key responsibilities:

  • Design, develop, and optimize multimodal AI models for real-time inference.
  • Collaborate with cross-functional teams to integrate AI models into the platform.
  • Optimize models for memory efficiency, low latency, and high throughput.
  • Stay updated with the latest research and implement innovative techniques in generative AI.

Axelera AI logo
Axelera AI Scaleup https://axelera.ai/
51 - 200 Employees
See all jobs

Job description

Company Overview
Axelera is a European, highgrowth Series B startup revolutionizing the AI landscape with our inmemory computing platform. We specialize in creating AI hardware and software optimized for highperformance inference, catering to cuttingedge use cases across highend edge computing, embodied AI, and serverside AI deployments. We are looking for passionate, innovative research engineers to join our team and help drive the future of AI.

Role Overview
We are seeking an AI Research Engineer with expertise in developing and optimizing multimodal AI models. The role will be central to advancing our platform’s capabilities in inference for Generative AI, working on stateoftheart models that integrate multiple data modalities (e.g., text, vision, and audio) for a broad range of applications.

This is an exciting opportunity to work at the intersection of advanced machine learning, inmemory computing, and highperformance AI inference on cuttingedge hardware architectures.

Responsibilities:

  • Model Development: Design, develop, and optimize multimodal AI models for realtime, highefficiency inference across a variety of deployment environments (edge, serverside, and embodied AI).

  • Collaboration: Work closely with crossfunctional teams, including AI researchers, hardware engineers, and software engineers to integrate AI models into the broader platform.

  • Scalability and Optimization: Focus on optimizing models for memory efficiency, lowlatency inference, and high throughput.

  • Innovation: Stay uptodate with the latest research in multimodal AI, proposing and implementing new techniques to push the boundaries of whats possible in generative AI.

  • Deployment & Testing: Implement best practices for model testing, deployment, and continuous improvement to ensure models scale effectively in production environments.

    • Requirements:

      • Experience: Proven experience (for all levels) in developing and deploying multimodal models, including text, image, andor audio data.

      • Technical Skills:

        • Strong background in deep learning frameworks (e.g., TensorFlow, PyTorch, JAX).

        • Proficiency in natural language processing (NLP), computer vision (CV), and speech processing techniques.

        • Experience with model optimization techniques (e.g., quantization, pruning, distillation).

        • Familiarity with distributed computing, inmemory computing platforms, or highperformance computing.

          • Knowledge: A strong understanding of the latest advancements in AIML research, particularly in generative models (e.g. transformers and diffusion models).

          • Collaboration & Communication: Ability to work in a highly collaborative, fastpaced startup environment and communicate complex technical concepts clearly.

            • Preferred Qualifications:

              • PhD or advanced degree in Computer Science, Machine Learning, AI, or related fields.

              • 5+ years of postgraduation relevant work experience.

              • Experience in deploying models on edge devices or inmemory computing systems.

              • Familiarity with model deployment frameworks like TensorRT, ONNX, or similar.

              • A passion for solving realworld challenges with AI in dynamic, highperformance environments.

                • Location

                  This position is based in Italy & we support relocation to Bologna, Florence or Milan for talent based abroad and interested in this role.

                  Why Join Us?

                  • Impact: Work on groundbreaking technology that will power the next wave of AI applications, from edge computing to embodied AI systems.

                  • Culture: Join a diverse, driven team that values innovation, collaboration, and continuous learning.

                  • Growth: As a Series B startup, you’ll have significant growth opportunities, including the chance to shape the direction of the product and AI strategy.

                  • Compensation: Competitive salary, equity options, and benefits package.

                    • At Axelera AI, we wholeheartedly embrace equal opportunity and hold diversity in the highest regard. Our steadfast commitment is to cultivate a warm and inclusive environment that empowers and celebrates every member of our team. We welcome applicants from all backgrounds to join us in shaping the future of AI.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication

AI Operations (AI Ops) Engineer Related jobs