Senior Software Development Engineer in Test (SDET) Chaos Engineering Specialist (Brazil)

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor's degree in Computer Science, Engineering, or related field., At least 5 years of experience in software testing and quality assurance, with a focus on chaos engineering., Proficiency in programming languages such as Python, Go, or Rust., Experience with chaos engineering tools, monitoring systems, and cloud platforms..

Key responsibilities:

  • Design and maintain advanced test automation frameworks incorporating chaos engineering.
  • Create and execute chaos experiments to test system resilience.
  • Implement monitoring solutions and establish observability practices.
  • Collaborate with development teams to enhance system resilience and lead post-mortem analyses.

Articul8 AI logo
Articul8 AI https://www.articul8.ai
51 - 200 Employees
See all jobs

Job description

Company Overview

At Articul8 AI, were building the next generation of resilient, scalable software systems that help organizations transform their operations. Our commitment to quality and reliability drives our engineering culture, where we continuously test and improve our systems under realworld conditions.

Why Join Articul8 AI?
  • Make an Impact: Shape the resilience and reliability of AIdriven systems at scale.

  • Build with Modern Tech: Leverage cuttingedge tools and platforms (Multicloud, AIfirst tooling).

  • Ownership & Growth: Take ownership of chaos engineering initiatives and influence engineering culture across teams.

  • Continuous Learning: Collaborate with top engineers, participate in mentoring, and stay ahead in chaos engineering and SRE practices.

    • Position Summary

      We are seeking a Senior SDET specializing in chaos engineering and monitoring to join our Quality Engineering team. You will design and implement sophisticated test automation frameworks, create and run chaos experiments to validate our systems resilience against realworld failures, while ensuring comprehensive monitoring capabilities that provide actionable insights during both testing and production scenarios.

      Key Responsibilities
      • Design, develop, and maintain advanced test automation frameworks that incorporate chaos engineering principles

      • Create and execute chaos experiments that simulate various failure modes and edge cases in our distributed systems

      • Implement monitoring solutions that effectively track system performance, resilience, and failure recovery

      • Establish observability practices that provide deep insights into system behavior during chaos experiments

      • Collaborate with development teams to build resilience into our applications from the ground up

      • Develop metrics and dashboards to visualize system reliability and the impact of chaos experiments

      • Lead postmortem analyses to identify system weaknesses discovered through chaos testing

      • Integrate chaos testing into CICD pipelines to validate system resilience continuously

      • Mentor engineers through code reviews, technical sessions, and handson guidance in test automation, chaos engineering, and monitoring best practices.

      • Contribute to the companys overall testing strategy and quality assurance practices

        • Qualifications
          Required
          • Bachelors degree in Computer Science, Engineering, or related field

          • 5+ years of experience in software testing and quality assurance, with at least 2 years focused on chaos engineering

          • Strong programming skills in languages such as Python, Go, andor Rust

          • Experience with chaos engineering tools such as Chaos Monkey, Gremlin, or similar frameworks

          • Indepth knowledge of monitoring systems like Prometheus, Grafana, ELK Stack, or similar tools

          • Experience implementing observability practices (metrics, logging, tracing) in distributed systems

          • Familiarity with container orchestration platforms like Kubernetes and related chaos tools

          • Experience with SRE practices and principles

          • Strong understanding of CICD pipelines and how to integrate testing workflows

          • Experience with cloud platforms (AWS, GCP, Azure) and their monitoring capabilities

          • Excellent communication skills with the ability to present technical findings to various stakeholders

            • Preferred
              • Master’s degree in Computer Science, Engineering, or related field

              • Knowledge of statistical analysis for evaluating test results and system performance

              • Experience with distributed systems and microservice architectures

              • Contributions to opensource testing or chaos engineering projects

              • Familiarity with AIML systems and their unique testing challenges

              • Relevant certifications in cloud platforms, testing methodologies, or chaos engineering

                • Ready to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow’s AI at Articul8 AI!

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Communication

Software Development Engineer in Test (SDET) Related jobs