Sr. Site Reliability Engineer II

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor’s degree in computer science, engineering, or related field., Minimum 4 years of experience in software engineering roles such as SRE, DevOps, or related areas., At least 3 years of experience with SRE/DevOps practices and automation tools., Proficiency in scripting languages like Python, PHP, Perl, Ruby, or Shell, and experience with infrastructure as code tools like Terraform..

Key responsibilities:

  • Develop and maintain system monitoring tools, alerts, and dashboards.
  • Analyze system and application data for performance tuning and fault isolation.
  • Collaborate with development teams to implement reliable, secure, and high-performance features.
  • Mentor junior team members and promote best practices across teams.

Shutterfly logo
Shutterfly Computer Software / SaaS XLarge Unknown
10001 Employees
See all jobs

Job description

Description

At Shutterfly, we make life’s experiences unforgettable. We believe there is extraordinary power in the self-expression. That’s why our family of brands helps customers create products and capture moments that reflect who they uniquely are.

At Shutterfly, we make life’s experiences unforgettable. We believe there is extraordinary power in the self-expression. That’s why our family of brands helps customers create products and capture moments that reflect who they uniquely are.
We are in the process of doing a comprehensive consumer website re-platforming effort, with the SRE team being pivotal in establishing the new shared infrastructure while paving the way to future efficiencies and supportability. This Senior SRE role is ultimately responsible for ensuring the reliability, availability, and performance of our technology and systems directly supporting our end customers and internal customers. They will work closely with the product development and platform engineering teams to build and maintain scalable systems and robust automation that supports the company's business goals.

The ideal candidate will have a history of successfully implementing and using tools like Terraform, Packer, Splunk, SignalFx, and other observability/IAC tools supporting systems with around the clock availability requirements. In addition, the ideal candidate will possess sufficient software skills to properly scrutinize and troubleshoot applications supporting our customers. They should have a strong aptitude for learning new technologies, embracing and driving solutions to challenging projects and problems. This role requires a seasoned engineer with the ability to collaborate across multiple cross-functional teams while exhibiting a rich set of problem-solving skills, along with being self-motivated and have a passion for quality!
 
Responsibilities: 
  • Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
  • Proactively gather and analyze both metric and log data from systems and applications to perform anomaly detection, performance tuning, capacity planning and fault isolation.
  • Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability, security and performance standards.
  • Partner closely with other teams on enterprise standards/best practices. Identify options for problem resolution and initiate corrective actions. Mentor junior members, document and share solutions.
Qualifications: 
  • Minimum 4 years’ experience in any combination of software engineering roles of some type: SRE, DevOps, applications, services, tools/automation, release, etc.
  • Minimum 3 years’ experience with SRE/DevOps practices and automation tooling Experience with observability solutions tools like Splunk, Datadog, SignalFx, etc.
  • Experience deploying, maintaining and supporting software applications/services in the AWS ecosystem Proactive approach to identifying problems and solutions
  • Experience writing code with one or more interpreted languages such as: Python, PHP, Perl, Ruby, Linux Shell
  • Experience with Terraform or Cloud Formation scripting
  • Experience with configuration management tools like Ansible, Chef or Puppet
  •  Experience with standard software development best practices and tools such as code repositories (Git preferred)
  • Experience executing in an agile software development environment
  • Good understanding of pricing/cost models across AWS services, especially compute, storage, and database offerings
  • Must be able to multitask and work well with changing priorities in a fast paced, 24x7 environment Must be highly collaborative and be able to work in a team environment consisting of both technical and business people
  • Excellent communication, problem solving and customer service skills A strong ability to learn and adapt to new technologies
  • Education: Bachelor’s degree in computer science, science, engineering or workforce equivalent technical certifications preferred

Supporting a diverse and inclusive workforce is important to Shutterfly not only because it directly reflects our value of Embracing our Differences, but also because it’s the right thing to do for our business and for our people. We welcome all applicants and evaluate them based on their qualifications, without regard to age, race, creed, color, national origin, ancestry, marital status, affectional or sexual orientation, gender identity or expression, disability, nationality, sex, or other characteristic covered by law. Learn more about our commitment to Diversity, Equity, and Inclusion on our Career Site.

This position will accept applications on an ongoing basis until filled.

The compensation package for this role is based on multiple factors, such as job level, responsibilities, location, and candidate experience. The base pay ranges included below are specific to the locations listed, and may not be applicable to other locations.

California : [$106,000-151,000]

Connecticut and New York: [$106,000-138,250]

Colorado, Illinois, Minnesota and Washington: [$106,000-128,000]

Nevada: [$99,750-138,250]

Maryland and New Jersey: [$114,500-138,250]

Hawaii : [$99,750-120,250]

This position may be eligible for a bonus incentive, health benefits, a 401K program, and other employee perks. More details about our company benefits can be found at https://shutterflyinc.com/benefits/.

This opportunity can be remote, but candidates must reside in a state in which Shutterfly is registered to do business. This includes all US states except District of Columbia, North Dakota, Mississippi, Rhode Island, Vermont, and Wyoming.

This position will accept applications on an ongoing basis until filled.

#SFLYTechnology

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Computer Software / SaaS
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Customer Service
  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs