Match score not available

Senior Site Reliability Engineer (Metal Team)

unlimited holidays - fully flexible
Remote: 
Full Remote
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Proficiency in Golang for observability development, Experience with Kubernetes and Helm, Expertise in Prometheus and Grafana, Familiarity with log management tools like Splunk.

Key responsabilities:

  • Define observability requirements and develop solutions
  • Collaborate with teams to deploy observability tools
Semrush logo
Semrush Large https://www.semrush.com/
1001 - 5000 Employees
HQ: Boston
See more Semrush offers

Job description

Hi there!

We are Semrush, a global IT company developing our own product – a platform for digital marketers. New stars are born here, so don’t miss your chance.

Our role Site Reliability Engineer for those who want to ensuring the company's IT ecosystem runs smoothly and reliably

Tasks in the role:

  • Collaborate with cross-functional teams to define observability requirements and develop robust solutions
  • Configure and maintain Prometheus and VictoriaMetrics for monitoring and alerting
  • Utilize Grafana to create customized dashboards and visualizations for performance and system health monitoring
  • Implement Grafana Tempo for distributed tracing and enhanced observability
  • Develop and maintain log management and analysis solutions using Splunk
  • Collaborate closely with product teams to ensure seamless deployment of observability tools and practices
  • Configure and maintain Sentry for error tracking and real-time error monitoring
  • Investigate and troubleshoot complex issues related to observability
  • Automate and streamline observability system setup and configuration
  • Stay updated with industry best practices and emerging observability technologies
  • Participate in on-call rotation to address critical incidents and outages of Observability services


Who we are looking for:

  • Proficiency in Golang for custom observability solution development
  • Strong experience working with Kubernetes (K8s) and Helm for container orchestration and deployment
  • Proven expertise in Prometheus and Grafana for monitoring and visualization
  • Familiarity with distributed tracing and tracing instrumentation
  • Experience with Splunk or similar log analysis and management tools
  • Strong understanding of system and application performance metrics and observability
  • Effective team collaboration and communication skills
  • Excellent problem-solving and troubleshooting abilities


Not required, but a plus:

  • Prior experience in a DevOps, SRE, or observability-related role is advantageous
  • You share our common values: Trust, because we prefer to speak up and be our true selves; Sense of Ownership, because it’s not worth wasting time on something you don’t believe in; and enthusiasm for Constant Changes, because we are always looking to make things better


A bit about the team:

You can get to know the team better at one of the interviews, but some brief information about future colleagues will be useful now.

Our primary goal is to design and maintain a robust and scalable observability infrastructure that empowers other teams to meet their monitoring and observability needs, ensuring the company's IT ecosystem runs smoothly and reliably. Additionaly, this team have night shifts (on calls) and have access to some security data

We will try to create all the right conditions for you to work and rest comfortably:

  • This offer stands for the remote work format. Digital nomadism, #wfh – call it what you like ;)
  • Flexible working day start
  • Unlimited PTO
  • Hobby benefit
  • Breakfast, snacks, and coffee at the office
  • Corporate events
  • Training, courses, conferences
  • Gifts for employees


Finally, a little more about our company:

Semrush is a leading online visibility management SaaS platform that enables businesses globally to run search engine optimization, pay-per-click, content, social media and competitive research campaigns and get measurable results from online marketing.

We’ve been developing our product for 16 years and have been awarded G2s Top 100 Software Products, Global and US Search Awards 2021, Great Place to Work Certification, Deloitte Technology Fast 500 and many more. In March 2021 Semrush went public and started trading on the NYSE with the SEMR ticker.

10,000,000+ users in America, Europe, Asia, and Australia have already tried Semrush, and over 1,000 people around the world are working on its development. The Semrush team is constantly growing.

Our new colleague, we are waiting for you!

Semrush is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based upon race, religion, creed, color, national origin, sex, pregnancy, sexual orientation, gender identity, gender expression, age, ancestry, physical or mental disability, or medical condition including medical characteristics, genetic identity, marital status, military service, or any other classification protected by applicable local, state or federal laws. All employment decisions are based on business needs, job requirements, merit, and individual qualifications.

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Communication
  • Problem Solving
  • Troubleshooting (Problem Solving)

Site Reliability Engineer (SRE) Related jobs