Site Reliability Engineer

Work set-up: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Strong understanding of system reliability and infrastructure design., Experience with scripting and automation using Python, Bash, or Go., Knowledge of containerization with Docker and orchestration with Kubernetes., Familiarity with cloud platforms such as AWS, Azure, or Google Cloud..

Key responsibilities:

  • Collaborate across development and operations teams to ensure system performance and uptime.
  • Design and implement automated monitoring and alerting systems.
  • Troubleshoot incidents to minimize service disruptions.
  • Conduct capacity planning and performance analysis.

Launchpad Technologies Inc. logo
Launchpad Technologies Inc. Scaleup https://www.golaunchpad.io/
51 - 200 Employees
See all jobs

Job description

Are you passionate about building resilient, scalable infrastructure and ensuring system reliability? Join our Talent Community and stay on our radar for future opportunities with top-tier global clients.

⚠️ This is not an active opening. By applying, you'll be considered when a suitable role becomes available.


🔍 Are you skilled in…?

• Collaborating across Dev and Ops to ensure system performance and uptime
• Designing and implementing automated monitoring and alerting systems
• Troubleshooting incidents and minimizing service disruptions
• Using Infrastructure as Code for configuration and deployments
• Conducting capacity planning and performance analysis
• Written and verbal English communication
• Problem-solving and teamwork

 

💡 Do you have experience with…?

• Scripting and automation using Python, Bash, or Go
• Containerization with Docker and orchestration with Kubernetes
• Working with cloud platforms such as AWS, Azure, or Google Cloud
• Infrastructure automation and CI/CD pipelines
• Monitoring and logging tools like Prometheus, Grafana, or the ELK stack

 

➕ Bonus points for:

• Understanding of networking principles and security best practices
• Hands-on experience with incident response and post-mortem analysis
• Certifications like AWS Certified DevOps Engineer or Kubernetes Administrator
• Experience designing fault-tolerant and highly available systems

 

Does our work culture resonate with you?

• 100% remote
• People-first culture
• Excellent compensation in US Dollars
• Hardware setup for working from home
• Work with global teams and prominent brands in North America, Europe, and Asia
• Training allowances
• Personal time off (PTO) for vacation, study leave, personal time, etc.
• ...and more!


Then apply now!

We’ll contact you when a matching role opens up!

 

 

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)
  • Teamwork
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs