DOXA Talent is redefining offshoring with a commitment to ethical practices and exceptional employee experiences. We help small and medium-sized business owners save up to 70% in payroll costs while providing their offshore and nearshore teams with a fully remote work environment. Our employees, whom we call VIPs, receive comprehensive benefits, including healthcare, vacation, and retirement plans, ensuring their well-being and professional growth. At DOXA Talent, we believe ethical outsourcing is essential. Our Conscious Offshoring model prioritizes our team’s needs and development. Here’s how we stand out: - Conscious Employer: We directly employ our team members, ensuring they receive full benefits and a flexible work-from-home framework. - Economic Alignment: Our model has no hidden costs, with a Build-Operate-Transfer (BOT) basis and a flexible 30-day termination policy. - Sustainable Practices: We operate without physical offices, promoting a fully digital work environment and environmental sustainability. - Bespoke Solutions: We deliver custom-fit solutions to ensure the right talent is placed in the right role, aligned with culture and fit. - Client Training: As remote work experts, we guide clients through the transition to remote operations. - Data & Security: We take the security of your data very seriously. We are proud to announce that DOXA Talent has been awarded the prestigious “Great Place to Work” certification. This award is based entirely on what our employees say about their experiences working here. This year, an impressive 96% of our employees affirmed that DOXA Talent is a great place to work, compared to just 57% at the average U.S. company. At DOXA, we’re not just about outsourcing; we’re about conscious offshoring that benefits businesses, employees, and the planet. Join us in creating a sustainable, ethical, and efficient future for work.

Job description

Role Summary

Our client is looking for a Site Reliability Engineer to join the client’s rapidly growing company in support of multiple SaaS applications. You will be responsible for cloud infrastructure, availability, reliability, performance, and security of production applications and systems.

SCHEDULE: 9:00 AM – 6:00 PM Pacific Daylight Time (12:00 AM – 9:00 AM Philippine Standard Time), follows Philippine holidays

POSITION TYPE: Full Time

WORK ARRANGEMENT: Remote

Essential Functions

Create, deploy, and maintain production infrastructure within the AWS accounts, using IAC/Terraform
Utilize various AWS services, including EC2, EKS, RDS, RedShift, S3, and IAM
Create, implement, and maintain automated application releases using Bitbucket Pipelines
Create, implement, and maintain application and infrastructure performance monitoring using Datadog or Prometheus/Loki/Grafana
Create, implement, and maintain application and infrastructure availability monitoring using Datadog or Prometheus/Loki/Grafana
Apply security practices and policies to identify and remediate security vulnerabilities
Oversee incident response procedures, including analysis and documentation of incidents to prevent future occurrences

Qualifications

A 4-year college degree (technical or quantitative science) is preferred or equivalent work experience with evidence of proficiency and achievement in virtual infrastructure management
3+ years experience in cloud computing and Infrastructure as Code (IaC) (e.g., Terraform, etc.) or related field
Experience with cloud-native tooling (Helm Charts, ArgoCD, HashiCorp Vault, Harbor, Reloader, Grafana, Prometheus, and Loki) is a plus
Experience with cloud native analytics tools (ElasticSearch, MongoDB, RedShift/SnowFlake, and Looker)
Any AWS certification is a big plus
Proficient in Linux system administration and security
Proficient with containerization technologies, especially Kubernetes
Proficient with code versioning tools (e.g., Git, Bitbucket, etc.)
Proficient with CI/CD tools (e.g., Bitbucket Pipelines, etc.)
Proficient in scripting languages such as Bash and Python
Exposure to Open Telemetry and Distributed Tracing
Awareness of recent industry trends related to observability and monitoring
Strong troubleshooting and problem-solving skills, with the ability to quickly diagnose and resolve complex issues
Excellent oral and written communication skills

Required profile