Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

3+ years in SRE, DevOps or similar role, Proven experience with cloud platforms (AWS, GCP), Proficiency in Ruby on Rails or Python, Solid understanding of relational and NoSQL databases, Familiarity with CI/CD tools and agile methodologies.

Key responsabilities:

  • Design and implement monitoring and alerting systems for system availability
  • Automate operational tasks and infrastructure management
  • Analyze performance and plan for future scaling
  • Collaborate with software engineers for smooth deployments
  • Develop disaster recovery plans for high availability
HONK Technologies logo
HONK Technologies Information Technology & Services Scaleup https://www.honkforhelp.com/
51 - 200 Employees
See more HONK Technologies offers

Job description

HONK is a fast growing technology company disrupting the roadside assistance space. We are a group of out of the box thinkers and doers, driven by an immense passion to challenge the old ways by working together to bring innovative changes that impact the lives of others. We work in a creative environment where everyday is rewarding knowing we’re assisting people in their true moment of need, stuck on the side of roads, helping them get back to conquering their day. 

We are looking for a dynamic fully remote Site Reliability Engineer to join our engineering team and play a critical role in ensuring the stability, reliability, and scalability of our systems and applications. The SRE will collaborate closely with our development teams to build and maintain resilient systems, automate operational tasks, and implement robust monitoring and alerting solutions. The goal of this position is to minimize downtime, optimize performance, and drive improvements in system efficiency.

Responsibilities
  • Design and implement monitoring and alerting systems to ensure 24/7 system availability, including automating detection of outages, latency, and performance bottlenecks.
  • Automate repetitive operational tasks, deployments, and infrastructure management.
  • Analyze system performance and plan for future scaling to handle growth, ensure infrastructure meets performance needs, and adjust resources accordingly.
  • Work closely with software engineers to ensure smooth deployments and identify areas for operational improvement.
  • Assist in maintaining the security of systems by enforcing best practices and compliance with industry standards and regulatory requirements.
  • Develop and maintain disaster recovery plans, and ensure high availability and redundancy across systems.
  • Identify and drive improvements in system architecture, operational practices, and tooling to enhance reliability, security, and scalability.

  • Preferred Experience
  • 3+ years of experience in a SRE, DevOps or similar role.
  • Proven experience with cloud platforms (e.g., AWS, GCP), including container orchestration (we use ECS)
  • Proficiency in Ruby on Rails or Python and RESTful API development.
  • Experience with real-time systems, location-based services, and/or telephony apps (e.g., Twilio) is a plus.
  • A healthy hatred of all things Windows.
  • Solid understanding of relational databases (PostgreSQL) and NoSQL databases (Redis).
  • Familiarity with system scalability, performance management, and agile methodologies.
  • Experience with version control (Git), CI/CD tools, and project management software (Jira, Bitbucket).
  • At HONK, we're a community of diverse and passionate individuals who believe in the power of remote work and the strength of inclusivity. As a remote-first company, we embrace the boundless possibilities of collaboration and flexibility, allowing our team members to thrive from anywhere in the US.

    HONK is proud to be an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Employment decisions at HONK are based on merit, qualifications, and business needs without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or any other protected characteristic as outlined by law.

    Required profile

    Experience

    Level of experience: Mid-level (2-5 years)
    Industry :
    Information Technology & Services
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Site Reliability Engineer Related jobs