Match score not available

Sr. Director, IT Reliability Automations

extra holidays - extra parental leave
Remote: 
Full Remote
Experience: 
Senior (5-10 years)
Work from: 
Texas (USA), United States

Offer summary

Qualifications:

Bachelor’s degree in Computer Science or related field., Extensive experience with AI and ML., Strong programming skills in Python, Go, or Java., Proven leadership and team management skills..

Key responsabilities:

  • Lead and mentor a team of engineers.
  • Develop AI/ML solutions for operational activities.
  • Oversee automation tools design and maintenance.
  • Monitor system performance and reliability.
RealPage, Inc. logo
RealPage, Inc. Large https://www.realpage.com/
5001 - 10000 Employees
See more RealPage, Inc. offers

Job description

We are looking for an experienced IT Reliability Automation Leader to oversee and enhance the reliability and performance of our IT systems through strategic Artificial Intelligence and Machine learning initiatives. This role involves leading a team of engineers, collaborating with cross-functional teams, and implementing best practices to ensure system resilience and efficiency.

Primary Responsibilities

  • Lead and mentor a team of IT reliability and automation engineers.
  • Develop an AI/ML solution for Operational Center activities
  • Develop and implement strategies for automating repetitive tasks and improving system reliability.
  • Oversee the design, development, and maintenance of automation tools and scripts.
  • Collaborate with development, operations, and product teams to ensure seamless integration and deployment of new systems and features.
  • Monitor system performance and reliability, proactively identifying and addressing potential issues.
  • Establish and enforce best practices for system monitoring, incident response, and disaster recovery.
  • Analyze system failures and develop comprehensive solutions to prevent recurrence.
  • Maintain detailed documentation of system configurations, processes, and procedures.



Required Knowledge/Skills/Abilities

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Extensive experience in solving problems with AI & ML
  • Proven leadership and team management skills.
  • Strong programming skills in languages such as Python, Go, or Java.
  • Experience with automation tools like Ansible, Puppet, or Chef.
  • Familiarity with monitoring tools such as Prometheus, Grafana, or Nagios.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills.



Preferred Knowledge/Skills/Abilities

  • Experience with containerization technologies like Docker and Kubernetes.
  • Knowledge of cloud platforms such as AWS, Azure, or Google Cloud.
  • Understanding of CI/CD pipelines and tools like Jenkins or GitLab CI.

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Computer Software / SaaS
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Team Management
  • Detail Oriented

Related jobs