Match score not available

Staff Software Engineer

unlimited holidays
Remote: 
Full Remote
Contract: 
Salary: 
143 - 250K yearly
Experience: 
Senior (5-10 years)
Work from: 
California (USA), United States

Offer summary

Qualifications:

Proven experience with NVIDIA Triton Inference Server, Strong understanding of LLM techniques, Familiarity with Retrieval-Augmented Generation frameworks, Proficient in Python and ML frameworks (e.g., PyTorch, TensorFlow), Strong problem-solving skills.

Key responsabilities:

  • Develop and maintain APIs using NVIDIA Triton for LLMs
  • Implement and optimize processing pipelines for LLMs
  • Work with RAG frameworks for model enhancement
  • Collaborate to deploy ML solutions into production
  • Troubleshoot issues related to model performance and scalability
ServiceNow logo
ServiceNow Information Technology & Services Large https://www.servicenow.com/
10001 Employees
HQ: Santa Clara
See more ServiceNow offers

Job description

Company Description

It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.

Job Description

Key Responsibilities:

  • Develop and maintain APIs using NVIDIA Triton Inference Server for scalable deployment of Large Language Models (LLM).
  • Implement and optimize pre-processing and post-processing pipelines tailored for LLMs to improve accuracy and efficiency.
  • Work with Retrieval-Augmented Generation (RAG) frameworks to enhance the model’s response generation capabilities.
  • Collaborate with data scientists, software engineers, and product teams to integrate and deploy ML solutions into production.
  • Troubleshoot and resolve issues related to model inference, performance, and scalability.

Qualifications

Qualifications:

  • Proven experience with NVIDIA Triton Inference Server and its APIs.
  • Strong understanding of pre-processing and post-processing techniques for LLMs.
  • Familiarity with Retrieval-Augmented Generation (RAG) and other LLM mechanisms.
  • Proficient in Python and relevant ML frameworks (e.g., PyTorch, TensorFlow).
  • Strong problem-solving skills and ability to work in a fast-paced environment.

 

 

FD21

For positions in California (outside of the Bay Area), we offer a base pay of $142,700 - $249,800, plus equity (when applicable), variable/incentive compensation and benefits. Sales positions generally offer a competitive On Target Earnings (OTE) incentive compensation structure. Please note that the base pay shown is a guideline, and individual total compensation will vary based on factors such as qualifications, skill level, competencies and work location. We also offer health plans, including flexible spending accounts, a 401(k) Plan with company match, ESPP, matching donations, a flexible time away plan and family leave programs (subject to eligibility requirements). Compensation is based on the geographic location in which the role is located, and is subject to change based on work location. For individuals who will be working in the Bay Area, there is a pay enhancement for positions located in that geographical area; please contact your recruiter for additional information.

Not sure if you meet every qualification? We still encourage you to apply! We value inclusivity, welcoming candidates from diverse backgrounds, including non-traditional paths. Unique experiences enrich our team, and the willingness to dream big makes you an exceptional candidate!

Additional Information

Work Personas 

We lead with flexibility and trust in our distributed world of work. Work personas (flexible, remote, or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work. Learn more here

Equal Opportunity Employer 

ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status, or any other category protected by law. In addition, all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements. 

Accommodations 

We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process, or are unable to use this online application and need an alternative method to apply, please contact [email protected] for assistance. 

Export Control Regulations 

For positions requiring access to controlled technology subject to export control regulations, including the U.S. Export Administration Regulations (EAR), ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities. 

From Fortune. ©2024 Fortune Media IP Limited. All rights reserved. Used under license. 

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Adaptability
  • Problem Solving
  • Collaboration

Software Engineer Related jobs