Match score not available

Systems Engineer - Infrastructure

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 

Offer summary

Key responsabilities:

  • Architect, operate, and debug infrastructure
  • Qualify hardware readiness & monitor systems
  • Troubleshoot performance & scalability issues
  • Automate operational tasks & debug low-latency frameworks
  • Set up monitoring, logging, and alerting systems
DDN Storage  logo
DDN Storage Information Technology & Services Scaleup
501 - 1000 Employees
See more DDN Storage offers

Job description

Overview:

DDN Storage is seeking great candidates to join our dynamic team of passionate customer-enabling technologists!

 

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DDN Storage is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

 

"DDN's A3I solutions are transforming the landscape of AI infrastructure." – IDC

 

“The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments” - ~ Marc Hamilton VP, Solutions Architecture & Engineering | NVIDIA

 

DDN Storage is the global leader in AI and multi-cloud data management at scale. Our cutting-edge storage and data management solutions are designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN Storage empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.

 

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.

 

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage.

Job Description:

Key Responsibilities:

  • Architect, operate, and debug high-performance compute, network, and storage infrastructure.
  • Qualify the readiness of new compute hardware and build health monitoring systems.
  • Troubleshoot performance and scalability issues.
  • Automate routine operational tasks.
  • Debug issues in low-latency communication frameworks.
  • Set up monitoring, logging, and alerting systems for comprehensive observability.

 

Tech Stack:

  • Linux
  • Kubernetes
  • Infiniband and RDMA
  • MPI and NCCL

Ideal Candidate Profile:

  • Extensive experience in large-scale systems administration and configuration management.
  • Strong background in systems engineering with an emphasis on low-level system performance in an HPC environment.
  • Proficient in operating and troubleshooting complex software and hardware systems.
  • Skilled in debugging across software, hardware, and network boundaries.
  • Knowledge and practical experience in designing and deploying systems in cloud or on-premises environments.
  • Open to generalists and specialists, including systems programmers or engineers focused on performance, security, or data center operations.

 

Preferred Qualifications:

  • Hands-on experience with high-performance computing infrastructure.
  • Proficiency in Kubernetes for container orchestration.
  • Expertise in setting up and managing low-latency communication frameworks.
  • Strong understanding of modern monitoring and observability practices.
DDN:

Our team is highly motivated and focused on engineering excellence.

We look for individuals who appreciate challenging themselves and thrive on curiosity.

Engineers are encouraged to work across multiple areas of the company.

We operate with a flat organizational structure.

All employees are expected to be hands-on and to contribute directly to the company’s mission.

Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills.

They should be able to concisely and accurately share knowledge with their teammates.

 

Interview Process:

 

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 30-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

 

  • Coding assessment in a language of your choice.
  • Systems design: Translate high-level requirements into a scalable, fault-tolerant service.
  • Systems hands-on: Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive: Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week.
  • We don’t rely on recruiters for assessments.
  • Every application is reviewed by a member of our technical team.

 

DataDirect Networks, Inc. is an Equal Opportunity/Affirmative Action employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

Required profile

Experience

Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Verbal Communication Skills
  • Open Mindset
  • Prioritization
  • Troubleshooting (Problem Solving)

Infrastructure Engineer Related jobs