Match score not available

Staff Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

8+ years in site reliability engineering, Experience with cloud platforms (AWS, GCP, Azure), Familiarity with Kubernetes, Docker, Terraform, Strong background in workflow automation, Proficient with monitoring tools like Prometheus.

Key responsabilities:

  • Enhance platform reliability and availability
  • Monitor and optimize platform performance
  • Implement monitoring and alerting systems
  • Collaborate across product and development teams
  • Mentor junior engineers in reliability practices
Blinq logo
Blinq TPE https://blinq.me/
11 - 50 Employees
See more Blinq offers

Job description

WHAT IS BLINQ?

🤝 The first interaction two people have is the bedrock of all strong business relationships. If you can make that experience special, you can start to build a great second interaction, and so on. Blinq is the tool to help people do that. We're building a platform that allows you to share a snapshot of who you are with anyone, anywhere via digital business cards, dynamic email signatures and virtual backgrounds. Join us on our mission to help the world connect.

(We will get to the fun perks part at the bottom, keep going!)


Let's dive into what makes Blinq an extraordinary product:

🚀 We're on an incredible growth trajectory, doubling our ARR every few months. Get ready to soar to new heights as we make waves in the industry!
😃 Our app is trusted and loved by employees at renowned companies like Patreon, Tesla, Uber, and Google. Rub shoulders with industry leaders and be part of the Blinq revolution.
🙌 Backed by Australia's top venture capitalists, Blackbird and Square Peg, we've brought together their investment prowess since Canva's seed round. It's a testament to our potential and the caliber of our vision.
❤️ With over 100K reviews and a stellar 4.9/5 star rating on the App Store, we've become one of the top 65 Business apps. Join a team that's making waves and be a part of our success story!

Role Overview: 

As a Staff Reliability Engineer, you will be at the forefront of maintaining the reliability and performance of our digital business card platform. You will play a key role in designing systems that ensure high availability, optimizing our infrastructure to handle increasing demand, and automating processes that keep our services resilient. Your expertise will be vital in ensuring that our users can exchange digital business cards and manage their profiles without interruption. You’ll also work closely with product and development teams to integrate reliability as a core aspect of our platform's growth.

This is a hybrid role working from our Melbourne or Sydney locations. 


What You Will Own:
  • Ensure platform reliability: Lead efforts to enhance the reliability and availability of our digital business card platform, ensuring users have a seamless experience when sharing and managing their information.
  • Monitor and optimize performance: Continuously improve platform performance, making sure that it scales efficiently and remains responsive as our user base grows.
  • Incident detection and response: Implement robust monitoring and alerting systems to detect and resolve issues swiftly, minimizing downtime for users during critical networking moments.
  • Collaborate with cross-functional teams: Work with product, development, and operations teams to integrate reliability engineering into the product lifecycle, ensuring that reliability is considered from design through deployment.
  • Automation and scaling: Automate manual processes and optimize system scalability, reducing human intervention and ensuring the platform remains stable under increased user demand.
  • Leadership and mentoring: Mentor junior engineers in reliability best practices, fostering a culture of reliability across engineering teams.
  • Post-incident analysis: Perform root cause analysis for incidents and outages, driving initiatives to prevent future occurrences and improve system resiliency.

  • What We Look For In You:
  • 8+ years experience in site reliability engineering within SaaS or digital products.
  • Experience with cloud platforms (AWS, GCP, Azure), Kubernetes, Docker, Terraform, and infrastructure-as-code.
  • Strong expertise in automating workflows with Typescript, Node or similar programming languages to improve efficiency and system resilience.
  • Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog) to implement effective observability and alerting systems.
  • Demonstrated ability to lead incident response processes, manage critical outages, and implement long-term improvements.
  • Excellent communication skills and a collaborative mindset for working with cross-functional teams.
  • Now, let's talk about our inspiring work environment:

    🇦🇺 Based in Melbourne, Australia, our vibrant team of 40 (and growing rapidly) is making waves in the industry.
    🍺 Fun-fact: Our office overlooks the oldest building in Australia, an enchanting old Irish pub. Experience history and innovation coming together!
    🎲 We believe in fostering a healthy work-life balance, board games, and top-notch stand-up desk workstations. It's all about creativity and collaboration.
    🏡 Autonomy is our guiding principle, which is why we embrace hybrid work. Come in when you need to, or work at your optimal hours—whether that's burning the midnight oil or rising with the sun.

    And here's what we offer:
    😎As an early member of the Blinq family, you'll enjoy a one-of-a-kind chance to influence the company's direction in a dynamic, self-managed, and results-driven startup environment. Say goodbye to corporate nonsense and micro-management because here, you are your own boss. We believe in empowering our team members to unleash their full potential.
    💸Equity in the business and a competitive salary: We value our team members and want to ensure they share in our success.
    ✨But here's the real magic: at Blinq, we're not just creating innovative solutions – we're creating a culture that thrives on transparency, autonomy, collaboration, and big ideas. We believe in celebrating individuality and encouraging out-of-the-box thinking. With us, you'll be inspired to push boundaries, drive innovation, and ultimately leave a lasting impact on our users on both B2C and B2B realms

    🚨 If you do not check all the boxes above, that is okay - we enthusiastically encourage you to apply!
    We welcome individuals at all experience levels and take pride in being an equal opportunity employer committed to creating an inclusive and diverse workforce. Join us on this remarkable journey as we reshape the way people connect and network

    #LI-RM1

    Required profile

    Experience

    Level of experience: Senior (5-10 years)
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Collaboration
    • Leadership
    • Mentorship
    • Verbal Communication Skills

    Site Reliability Engineer (SRE) Related jobs