Job Summary
The Site Reliability Engineer (SRE) supports the Electronic Submission of Medical Documentation (eSMD) program by ensuring the reliability, availability, performance, and security of applications and infrastructure. The role focuses on system monitoring, automation, incident response, and operational excellence across cloud and on-premises environments supporting CMS healthcare data exchanges.
Key Responsibilities
- Monitor application and infrastructure health, performance, and availability.
- Implement and maintain observability solutions, dashboards, alerts, and logging.
- Support incident management, root cause analysis, and problem resolution.
- Automate operational tasks and deployment processes using DevOps tools and scripting.
- Collaborate with development, security, and infrastructure teams to improve system reliability and performance.
- Support CI/CD pipelines and release management activities.
- Manage system capacity planning, scalability, and disaster recovery processes.
- Ensure compliance with CMS, HIPAA, FISMA, and federal security requirements.
- Support cloud and containerized environments, including Kubernetes and AWS services.
- Maintain operational documentation, runbooks, and standard operating procedures.
Required Qualifications
- Bachelor's degree in Computer Science, Information Technology, Engineering, or related field.
- 3+ years of experience in Site Reliability Engineering, DevOps, System Administration, or Production Support.
- Experience with Linux/Unix administration and scripting.
- Knowledge of monitoring and logging tools.
- Experience supporting enterprise applications in cloud environments.
Preferred Qualifications
- Experience supporting CMS, Medicare, Medicaid, or federal healthcare programs.
- Experience with AWS cloud services.
- Knowledge of Kubernetes, Docker, and container orchestration.
- Familiarity with eSMD, healthcare interoperability, or healthcare data exchange programs.
- Experience with CI/CD tools such as Jenkins, GitHub Actions, or Azure DevOps.
Key Skills
- Site Reliability Engineering (SRE)
- Application Monitoring & Observability
- Incident Management & Root Cause Analysis
- AWS Cloud Services
- Kubernetes & Docker
- CI/CD & DevOps Automation
- Linux Administration & Scripting
- Performance & Capacity Management
- Security & Compliance
Residency Requirement
Must be eligible to obtain and maintain a U.S. Government Public Trust clearance. Candidates must have resided in the United States for at least three (3) of the last five (5) years to satisfy federal background investigation requirements.
Residency Requirement:
Candidate must be able to obtain Public Trust clearance and must have lived in the United States for at least three (3) out of the last five (5) years.
Salary & Benefits Information:
- The actual salary offer will carefully consider a wide range of factors, including your skills, qualifications, experience, and location.
- C-HIT offers Healthcare Benefits, Remote Working Options, Paid Time Off, PTO cash-out, Training/Certification opportunities, Healthcare Savings Account & Flexible Savings Account, Paid Life Insurance, Short-term & Long-term Disability, 401K Match & Profit sharing, Employee Assistance Program, Paid Holidays, and much more perks and Voluntary benefits!
- Employees of C-HIT shall, as an enduring obligation throughout their term of employment, adhere to all information security requirements as documented in company policies and procedures.
C-HIT, a CMMI Maturity Level 5 company, focuses on delivering information technology and professional services to Federal and State agencies.
"C-HIT is an EOE, including disability and veteransβ