Logo for TransUnion

Lead Engineer – Software Defined Storage (Storage & Backup)

Roles & Responsibilities

  • Strong hands-on experience administering and troubleshooting Commvault environments (CommServe, MediaAgents, policies, schedules, retention, reporting) in an enterprise setting
  • Experience with IBM Spectrum Scale (GPFS) / ESS operations, health monitoring, capacity planning, performance tuning, and incident triage
  • Strong Linux administration skills with proficiency in scripting/automation (Bash and at least one higher-level language such as Python)
  • Proven ability to perform root-cause analysis, coordinate across global teams, and document operational standards clearly

Requirements:

  • Administer and optimize Commvault backup, recovery, and DR; monitor operations, troubleshoot failures, drive performance improvements, and implement governance with standard runbooks, KPIs, alerting, and change alignment
  • Operate and maintain GPFS/ESS environments (health monitoring, capacity planning, patching, incident triage) and develop procedures for scaling, upgrades, and resiliency
  • Build automation for repeatable Linux operations (provisioning, validation checks, reporting, backups) and develop scripts/workflows (Bash/Python) integrated with enterprise tooling; produce runbooks and knowledge articles
  • Provide engineering support for block storage platforms (Pure Storage, Dell EMC Unity, Dell PowerStore), including capacity and performance management, incident response, and cross-team coordination; participate in on-call rotations and governance

Job description

TransUnion's Job Applicant Privacy Notice

What We'll Bring:

We are seeking a Lead Engineer to design, operate, and continuously improve enterprise storage and data protection platforms across a hybrid environment. The role will focus on Commvault backup & recovery, IBM Spectrum Scale (GPFS) / ESS, Linux engineering with scripting/automation, and operational support for block storage platforms (Pure Storage, Dell EMC Unity, Dell PowerStore).
This position partners closely with Infrastructure, Cloud, Security, and Application teams to deliver reliable, scalable, secure, and well-governed storage and backup services.

What You'll Bring:

Key Responsibilities

1) Commvault – Backup, Recovery, and DR Engineering

  • Administer and support enterprise Commvault environments (e.g., CommServe, MediaAgents, policies, schedules, retention, reporting).
  • Monitor backup operations, troubleshoot failures, and drive performance/throughput improvements.
  • Engineer and enhance backup/restore strategies to meet RPO/RTO, compliance, and cyber resilience requirements, including routine restore validations and DR test support.
  • Implement operational governance (standard runbooks, KPIs, alerting, incident/problem/change alignment).

2) IBM Spectrum Scale – GPFS / ESS Operations & Engineering

  • Support and maintain GPFS / ESS environments: health monitoring, capacity planning, performance tuning, patching support, and incident triage.
  • Execute and improve operational procedures for scaling, upgrades, and resiliency across Spectrum Scale clusters.
  • Collaborate with vendors/partners as needed to resolve complex issues and reduce operational risk through proactive management.

3) Linux Engineering + Scripting/Automation (Toil Reduction)

  • Build automation for repeatable operations (provisioning, validation checks, reporting, backups, lifecycle tasks).
  • Develop and maintain scripts and workflows (e.g., Bash/Python) and integrate with enterprise tooling for observability and reliability.
  • Produce high-quality documentation, runbooks, and knowledge articles to enable consistent operations and cross-team execution.

4) Block Storage Operations (Recommended Skill Area) – Pure / Unity / PowerStore

  • Provide engineering support and operational oversight for block storage platforms including Pure Storage, Dell EMC Unity, and Dell PowerStore (capacity, performance, incident response, lifecycle coordination).
  • Support storage connectivity concepts (e.g., multipathing, iSCSI/FC fundamentals) and coordinate with network/compute teams for end-to-end issue resolution.

5) Service Delivery, Reliability, and Collaboration

  • Participate in on-call rotations as required and drive improvements that reduce repeat incidents.
  • Partner with application/platform teams to onboard workloads, define protection policies, and ensure recoverability.
  • Ensure changes follow governance and operational readiness standards (validation plans, rollback, post-change verification).

Impact You'll Make:

Required Qualifications

  • Strong hands-on experience with Commvault administration and troubleshooting in an enterprise environment.
  • Working experience with IBM Spectrum Scale (GPFS) / ESS operations and support.
  • Strong Linux administration skills and proficiency in scripting/automation (e.g., Bash and at least one higher-level scripting language).
  • Proven ability to troubleshoot complex infrastructure issues, perform root-cause analysis, and implement durable fixes.
  • Strong communication skills with ability to coordinate across global teams and document operational standards clearly.

Preferred / Nice-to-Have

  • Experience supporting block storage platforms: Pure Storage, Dell EMC Unity, Dell PowerStore. (For Costa Rica, this skill is recommended vs. mandatory.)
  • Experience with ITSM processes (incident/problem/change) and operational governance for infrastructure platforms.
  • Exposure to hybrid/cloud-integrated backup or storage designs.

Working Model / On-Call

  • Participate in an on-call rotation and support planned maintenance windows as needed.

#LI-PV1

This is a remote position which may require occasional in-person attendance at work-related events at the discretion of management.

TransUnion Job Title

Lead Engineer, Infrastructure Engineering and Provisioning

Related jobs

Other jobs at TransUnion

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.