Senior Developer - Data Center Server Management

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

5+ years of experience in server firmware and platform software development., Bachelor's, Master's, or PhD in Electrical Engineering, Computer Science, or a related field., Strong proficiency in C/C++ and Python, with excellent programming and debugging skills., Knowledge of server architecture, manageability, and hardware management interfaces..

Key responsibilities:

  • Develop and optimize server management software for GPU and Grace solutions.
  • Implement firmware and software components based on performance requirements.
  • Collaborate with data center architects and cross-functional teams to ensure timely implementation.
  • Support cluster validation and resolve technical issues efficiently.

NVIDIA logo
NVIDIA XLarge http://www.nvidia.com
10001 Employees
See all jobs

Job description

NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the most thoughtful people in the world.NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to design of this massive superchip. We are looking for skilled software engineers to help implement firmware and software components for next generation AI supercomputing platforms.We are looking for a strong senior developer to implement manageability components for these products in data centers. You will collaborate with various teams, understand customer requirements, and develop robust solutions to drive our products to market.

What you'll be doing:

  • Develop and optimize server management software for GPU and Grace solutions in large clusters

  • Implement firmware and software components based on performance requirements and architecture specifications

  • Collaborate with data center architects to understand requirements and ensure timely implementation

  • Work with cross-functional teams to align implementation with design requirements

  • Optimize firmware components for reliability in data center environments

  • Support cluster validation and resolve technical issues efficiently

  • Contribute to quality, reliability and telemetry performance of firmware delivered to data centers

What we need to see:

  • 5+ years of relevant experience working on server firmware (BMC) and platform software development with BS, MS, or PhD in EE/CS or related field

  • Experience with data center health management implementation

  • Track record of delivering server firmware components

  • Knowledge of server architecture and manageability in data centers

  • Understanding of hardware management interfaces (USB, SMBus/I2C, PCIe) and familiarity with modern management protocols including Redfish, MCTP, and PLDM

  • Strong proficiency in C/C++ and Python

  • Strong programming and debugging skills for server platforms

  • Experience with SCM (e.g. Git, Perforce) and project management tools like Jira

  • Excellent written and oral communication skills, good work ethics, team-oriented mentality, and dedication to quality work

  • Self-starter who can solve sophisticated technical problems with effective coding solutions

Ways to stand out from the crowd:

  • Familiarity with x86 or ARM system architecture

  • Experience collaborating effectively within large engineering teams

  • Background with performance optimization in firmware components

  • Experience with RTOS and bare metal programming

  • Linux kernel and user space development experience

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Strong Work Ethic
  • Communication
  • Problem Solving

Related jobs