Site Reliability Engineer

Work set-up: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor's degree in Computer Science or related field or equivalent experience., 0–2 years of experience in software development, Linux system administration, or performance engineering., Familiarity with RESTful APIs, source control systems like Git, and containerization tools such as Docker and Kubernetes., Good communication skills in both Spanish and English, with understanding of compliance environments like FedRAMP and HIPAA..

Key responsibilities:

  • Troubleshoot and resolve issues across the system stack.
  • Design and improve logging, monitoring, and alerting systems.
  • Automate manual tasks and develop software to enhance system reliability.
  • Support compliance standards and collaborate with security teams for regulated environments.

Nextiva logo
Nextiva Large https://www.nextiva.com
1001 - 5000 Employees
See all jobs

Job description

Redefine the future of customer experiences. One conversation at a time.

We’re changing the game with a first-of-its-kind, conversation-centric platform that unifies team collaboration and customer experience in one place. Powered by AI, built by amazing humans.

Our culture is forward-thinking, customer-obsessed and built on an unwavering belief that connection fuels business and life; connections to our customers with our signature Amazing Service®, our products and services, and most importantly, each other. Since 2008, 100,000+ companies and 1M+ users rely on Nextiva for customer and team communication.

If you’re ready to collaborate and create with amazing people, let your personality shine and be on the frontlines of helping businesses deliver amazing experiences, you’re in the right place. 

Build Amazing - Deliver Amazing - Live Amazing - Be Amazing

 

We are looking for a Site Reliability Engineer to enhance, support, and troubleshoot our SaaS platform. We’re seeking someone with a wide breadth of knowledge, experience, and interest in a range of technology domains. The ideal candidate thrives as a generalist—comfortable operating between development and systems—with the ability to dive deep when needed.

In this role, you will also provide critical support for compliance-driven environments including healthcare, SLED (state, local, and education), and federal customers. You will contribute to initiatives related to FedRAMP authorization, security hardening, and industry-specific compliance standards such as HIPAA and CJIS.

Key Responsibilities

  • Triage, troubleshoot, and fix production problems in every layer of the stack
  • Design, develop, improve, and tune logging, monitoring, and alerting systems
  • Identify manual tasks, document fixes via runbooks, and drive automation
  • Write software to improve the reliability and recoverability of production systems
  • Perform and automate system administration tasks
  • Participate in on-call rotation supporting production systems
  • Collaborate with compliance and security teams to meet standards for FedRAMP, HIPAA, and other regulatory frameworks
  • Ensure platform reliability and availability for regulated customer environments, including healthcare and government sectors
  • Support infrastructure and deployments aligned with the needs of SLED and federal clients

Qualifications

  • Bachelor's degree in Computer Science or related field, or equivalent work experience
  • Bilingual Spanish and English
  • Experience in or exposure to compliance-focused environments (e.g., FedRAMP, HIPAA, CJIS, SOC 2) is preferred

Competencies

  • 0–2 years of software development experience
  • 0–2 years of Linux system administration experience
  • 0–2 years of performance engineering experience
  • Experience working with RESTful APIs
  • Experience troubleshooting complex systems
  • Experience working with source control systems (e.g., Git)
  • Familiarity with containerization and orchestration (e.g., Docker, Kubernetes)
  • Familiarity with front-end technologies
  • Familiarity with application performance monitoring tools
  • Familiarity with relational databases and SQL
  • Familiarity with microservices and distributed system design
  • Ability to clearly communicate technical concepts
  • Working knowledge of general SRE concepts and DevOps principles
  • Understanding of or experience supporting regulated environments and public sector clients

Nice to have

  • Datadog 
  • Atlassian Suite (Jira, Confluence, BitBucket) 
  • Java/Spring
  • Python
  • Javascript/React
  • SQL
  • Ansible
  • Jenkins
  • Tomcat
  • Git
  • Redis
  • RabbitMQ
  • Splunk/Kibana
  • Terraform

Typical Office Environment: Requires extensive sitting with periodic standing and walking. May be required to lift up to 35 pounds unassisted. May be required to lift over 35 pounds using an assistive device and/or team lift. Requires significant use of a personal computer, phone, and general office equipment. Needs adequate visual acuity, ability to grasp and handle objects. Needs the ability to communicate effectively through reading, writing, and speaking in person or on the telephone.

Nextiva DNA (Core Competencies)

Nextiva’s most successful team members share common traits and behaviors:

  • Drives Results: Action-oriented with a passion for solving problems. They bring clarity and simplicity to ambiguous situations, challenge the status quo, and ask what can be done differently. They lead and drive change, celebrating success to build more success.
  • Critical Thinker: Understands the "why" and identifies key drivers, learning from the past. They are fact-based and data-driven, forward-thinking, and see problems a few steps ahead. They provide options, recommendations, and actions, understanding risks and dependencies.
  • Right Attitude: They are team-oriented, collaborative, competitive, and hate losing. They are resilient, able to bounce back from setbacks, zoom in and out, and get in the trenches to help solve important problems. They cultivate a culture of service, learning, support, and respect, caring for customers and teams.

Total Rewards

Our Total Rewards offerings are designed to allow Nexties to take care of themselves and their families so they can do their best.

Our compensation packages are tailored to each role and candidate's qualifications. We consider a wide range of factors, including skills, experience, training, and certifications, when determining compensation. We aim to offer competitive salaries or wages that reflect the value you bring to our team. Depending on the position, compensation may include base salary, incentives, or bonuses.

  • Health 🍏 - Comprehensive medical coverage, including dental care
  • Insurance 💼 - Life insurance, covering life and disability
  • Work-Life Balance ⚖️ - PTO and Paid Sick time as per CBA, paid parental leave
  • Financial Security 💰 - Private pension plan available
  • Wellness 🤸‍ - Employee Assistance Program and comprehensive wellness initiatives
  • Growth 🌱 - Access to ongoing learning and development opportunities and career advancement

At Nextiva, we're committed to supporting our employees' health, well-being, and professional growth. Join us and build a rewarding career!

#LI-SC1 #LI-REMOTE

Required profile

Experience

Spoken language(s):
EnglishSpanish
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)
  • Critical Thinking
  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs