Match score not available

Principal Engineer

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Experience: 
Expert & Leadership (>10 years)
Work from: 

Offer summary

Qualifications:

BS/MS in Computer Science or related field, 15+ years of experience with HPC storage systems, Familiarity with HPC and AI benchmarking tools, Knowledge of Linux internals and administration, Experience with Parallel File Systems, particularly Lustre.

Key responsabilities:

  • Develop strategies to mitigate performance issues
  • Automate performance regressions flagging during releases
  • Collaborate on future development designs in Lustre
  • Assist with performance tuning for specific environments
  • Create a diagnostic playbook for common issues
DDN Storage  logo
DDN Storage Information Technology & Services Scaleup
501 - 1000 Employees
See more DDN Storage offers

Job description

Overview:

DDN Storage is seeking great candidates to join our dynamic team of passionate customer-enabling technologists!

 

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DDN Storage is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

 

"DDN's A3I solutions are transforming the landscape of AI infrastructure." – IDC

 

“The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments” - ~ Marc Hamilton VP, Solutions Architecture & Engineering | NVIDIA

 

DDN Storage is the global leader in AI and multi-cloud data management at scale. Our cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN Storage empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence. 

  

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management. 

  

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage. 

Job Description:

We are looking for an Principal Engineer for our team, which focuses on creating storage solutions for the most data-intensive workloads. The ideal candidate will have extensive experience working with scale deployments of Lustre and be familiar in how to isolate and identify performance/stability issues. This role within the core Lustre development team will mean working closely with the most senior Lustre developers and interacting with technical contacts at the largest AI/HPC deployments in the world.


Responsibilities for this role include but are not limited to:

  • Developing strategies to identify and mitigate situations where stability/performance is below expectations for key accounts
  • Providing input into automating the flagging of performance regressions during the release cycle.
  • Work with the Engineering manager and a geographically distributed team to provide input into designs for future developments in Lustre.
  • Assist with performance tuning of features for specific environments and use-cases.
  • Creating a diagnostic playbook to help the DDN field/support deal more effectively with common issues.

 

Qualifications:

  • BS/MS in Computer Science, Computer Engineering or equivalent degree/experience.
  • 15+ years of experience working with enterprise-class or HPC storage systems and/or distributed systems.
  • Familiarity with common HPC and AI benchmarking tools essential.
  • Strong team player with good communication skills and should be self-starter.
  • Familiarity with networking technologies, including TCP, Infiniband, RDMA preferred.
  • Good grasp of multi-threaded programming and parallel computing an asset.
  • Knowledge of Linux internals and administration is desired
  • Excellent time management skills, with the ability to prioritize, multitask, and work under deadlines in a fast-paced environment.
  • Knowledge of Parallel File Systems, in particular Lustre, is highly preferred.
  • Experience with Git, JIRA, Jenkins, Gerrit, and Github useful.
DDN:

Our team is highly motivated and focused on engineering excellence.

We look for individuals who appreciate challenging themselves and thrive on curiosity.

Engineers are encouraged to work across multiple areas of the company.

We operate with a flat organizational structure.

All employees are expected to be hands-on and to contribute directly to the company’s mission.

Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills.

They should be able to concisely and accurately share knowledge with their teammates.

 

Interview Process: After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 30-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

  • Coding assessment in a language of your choice.
  • Systems design: Translate high-level requirements into a scalable, fault-tolerant service.
  • Systems hands-on: Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive: Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week.
  • We don’t rely on recruiters for assessments.
  • Every application is reviewed by a member of our technical team.

 

DataDirect Networks, Inc. is an Equal Opportunity/Affirmative Action employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

 

#LI-Remote

Required profile

Experience

Level of experience: Expert & Leadership (>10 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Multitasking
  • Teamwork
  • Problem Solving
  • Time Management
  • Prioritization
  • Verbal Communication Skills

Field Engineer (Solutions) Related jobs