Match score not available

Site Reliability Consultant

Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Solid understanding of microservices architecture, Experience with major cloud providers like Google, AWS, or Azure, Proficient in scripting and automation using Bash, Python, or Ruby, Systems hardware and network troubleshooting skills, Knowledge of Elasticsearch architecture and DevOps tools.

Key responsabilities:

  • Operate, maintain, and administer technology solutions
  • Provide Root Cause Analysis for incidents
  • Participate in on-call rotation for customer escalations
  • Mentor teammates and improve technical excellence
  • Collaborate to enhance infrastructure resilience
Pythian logo
Pythian Information Technology & Services SME https://www.pythian.com/
201 - 500 Employees
See more Pythian offers

Job description

Site Reliability Consultant
Remote | India | #LI-remote

Why you?

Do you thrive on solving tough problems—even under pressure? Are you motivated by fast-paced environments with continuous learning opportunities? Do you enjoy collaborating with a team of peers who push you to constantly up your game? At Pythian, we are building a next-generation Site Reliability Engineering team. We need motivated and talented individuals on our teams, and we want you! You’ll act as a technology leader, advisor for our clients, and mentor for other team members.  Projects would include infrastructure architecture, automation, and intelligent monitoring systems from design through implementation. If you Love Your Data and want to Love Your Career, this could be the job for you!

What will you be doing?
  • Operate, maintain and administer open technology solutions that contribute to the operational efficiency, availability and visibility of customer infrastructure.
  • Plan maintenance activity, design documentation and standard procedures.
  • Provide Root Cause Analysis reports for outages/incidents (ITIL - Problem Management)Participate in on-call rotation for customer escalations.
  • Observe and provide feedback on the current state of the customer’s infrastructure, and identify opportunities to improve resiliency, reduce the occurrence of incidents and automate repetitive administrative and operational tasks.
  • Contribute to, improve and maintain team documentation about customer systems and infrastructure, procedures, policies and schedules.
  • Gather and document information about customer environments through audit activities, and analyze the information to identify opportunities for improvement and application of best practices.
  • Work collaboratively with team mates to contribute to the continuous improvement of our working culture.
  • Act as a technology leader for customers, as well as drive customer discussions on technology road maps.
  • Mentor and cross train teammates on technologies and processes, to improve the technical excellence of the team.


  • What do we need from you?
  • Solid understanding of microservices architecture and container technologies (Kubernetes is a must, Docker, lxc etc)
  • Experience working with at least one major cloud provider. Preferably Google but AWS or Azure would suffice(including infrastructure as code deployment with Cloud Formation, Terraform, Opsworks etc). Certifications on any of the cloud providers programs are a plus.
  • Clear understanding of software development lifecycles and best practices from an infrastructure point of view
  • Understanding the end to end operations of a ‘Business System’ vs components.
  • Comprehensive systems hardware and network troubleshooting experience
  • Common Linux distribution platform installation, configuration and performance tuning
  • TCP/IP networking, NIC bonding and OS network services configuration (DNS, NTP, DHCP, SMTP, etc.)
  • Operation and administration of virtual infrastructure, including experience with at least one hypervisor (VMware, Hyper-V, KVM, etc.) is a plus.
  • Ability to describe IaaS, PaaS, SaaS, pros and cons of each, use cases for virtualization and cloud
  • Administration of web servers and supporting technologies, including network load balancers
  • Scripting and automation of administrative tasks using bash, python, ruby, go etc.
  • Experience with the design, development and deployment of at least one major configuration management framework (i.e. Puppet, Ansible, Chef, Salt)
  • System and application error investigation, troubleshooting of access/availability issues including deep multi-system root cause analysis
  • Solid understanding of DevOps tools, processes, and culture
  • Knowledge of Elasticsearch technology including architecting, deploying, monitoring and troubleshooting issues is a big plus.
  • Experience or at the very least strong interest in working with AI and using different tools that AI provides in order to optimize and speed up tasks for our customers.
  • Ability to pick up new technologies quickly
  • Ability to provide accurate work scheduling and task estimations for work delivery

  • What do you get in return?
  • Competitive total rewards package
  • Flexible work environment: Work from home! Why commute? Work remotely from your home, there’s no daily travel requirement to the office!
  • Outstanding people: Collaborate with the industry’s top minds.
  • Substantial training allowance: Hone your skills or learn new ones; participate in professional development days, attend conferences, become certified, whatever you like!
  • Amazing time off: Start with a minimum 3 weeks paid time off, 7 sick days, and 2 professional development days!
  • Office Allowance: Device of your choosing for day one and options to personalize your work environment!
  • Fun, fun, fun: Blog during work hours; take a day off and volunteer for your favorite charity.
  • Why Pythian?

    Pythian excels at helping businesses use their data and cloud to transform how they compete and win in this ever-changing environment by delivering advanced on-prem, hybrid, cloud and multi-cloud solutions to solve the toughest data challenges faster and better than anyone else. Founded and headquartered in Ottawa, Canada in 1997, Pythian now has more than 300 employees located around the globe with over 350 clients spanning industries from SaaS, media, and gaming to financial services, e-commerce and more. Pythian is known for its technology-enabled data expertise covering everything from ETL to ML. We pride ourselves on our ability to deliver innovative solutions that meet the specific data goals of each client and have built meaningful partnerships with major cloud vendors AWS, Google and Microsoft. The powerful combination of our extensive expertise in data and cloud and our ability to keep on top of the latest leading edge technologies make us the perfect partner to help mid and large-sized businesses transform to stay ahead in today’s rapidly changing digital economy.


    Intrigued to see what a job is like at Pythian? Check us out @Pythian and #pythianlife. Follow @PythianJobs on Twitter and @loveyourdata on Instagram!

    Required profile

    Experience

    Level of experience: Mid-level (2-5 years)
    Industry :
    Information Technology & Services
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Collaboration
    • Mentorship
    • Task Planning
    • Problem Solving

    Site Reliability Engineer (SRE) Related jobs