Match score not available

Associate Principal Engineer, Performance and Site Reliability

Remote: 
Full Remote
Work from: 

Offer summary

Qualifications:

BS, MS, or PhD in Computer Science or equivalent work experience, Strong knowledge of Java/.NET, SQL/NoSQL, and event driven architectures, Hands-on experience with performance testing tools like Locust, JMeter, Gremlin, Working knowledge of HTTP load generation & measurement software, APM, and observability tools.

Key responsabilities:

  • Design scalable cloud-native architecture for billions of daily transactions
  • Develop performance and reliability test plans, analyze results and optimize enterprise software
  • Collaborate with teams to set performance goals, milestones, and evaluate system architectures
  • Execute stability, scalability, stress, reliability, and longevity tests, communicate results and risks
Nagarro logo Icon for a company verified by Jobgether
Nagarro Information Technology & Services XLarge https://www.nagarro.com/
10001 Employees

Job description

Company Description

We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (15000+ experts across 26 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!

Job Description

Must have Skills : Performance Tuning - General Experience, Performance Testing - General Experience, Cloud architecture (Strong), Observability,

Job Description : The ISV practice is focused on building new offerings targeted towards the diverse market segments in the horizontal tech space, creating new competencies in niche technology areas and supporting sales team to make convincing pitches of Nagarro’s capabilities in the ISV market. As a rapidly growing company, we are seeking geeks who want to squeeze the last bit of performance and maintain every second of uptime for an application. We are looking for a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability and resiliency are critical to support billions of online transactions everyday.

Profile requirements:

  • Strong knowledge of Java/.NET and SQL/NoSQL, event driven architectures
  • Hands-on experience with Locust, JMeter, Gremlin and other performance and reliability tools
  • Performance engineering experiences with a distinguished track record on technically demanding projects
  • Experience with performance testing, analysis, troubleshooting, and optimizing enterprise software - BS, MS, or PhD in Computer Science or equivalent work experience
  • Outstanding attention to detail and communication skills, ability to analyze / interpret technical data, ability to identify / resolve system bottlenecks, ability to identify potential limitations in software architectures, ability to write clear and concise technical documents, excellent troubleshooting and reasoning skills
  • Understanding of throughput, latency, memory and CPU utilization, concepts of multi-threading, cache, memory management, Java Virtual Machine, JIT
  • Working knowledge of HTTP load generation / measurement software (e. g. LoadRunner, JMeter, TestMaker, Silk Performer, etc)
  • Working knowledge of APM and Observability tools like Splunk, NewRelic, Prometheus and Grafana - Working knowledge of AJAX, HTTP, REST, JSON
  • Working knowledge of database tuning and administration (e. g. Oracle, DB2, Oracle AWR / STASPACK)
  • Working knowledge of operating system design / tuning / tools (e. g. Linux, Solaris, OProfile, VTune Roles and responsibilities: The Performance and Reliability engineering architect must demonstrate the ability to understand novel system architectures, pinpoint problem areas, and develop architectures and succinct test plans across multiple hardware and software tiers.
  • Design modern scalable cloud-native architecture capable of transacting billions of online transactions every day
  • Orchestrate SLO/SLI design workshops and set performance goals with product owners and engineering leaders
  • Work closely with management, product owners, developers and quality engineers to understand product requirements and design appropriate performance and reliability test plans to verify features performance and scalability requirements
  • Provide accurate scoping, clear strategies, and milestones for performance and reliability tests
  • Communicate status, risks to test complete dates, and offer mitigation strategies to ensure on-time delivery of all committed projects to cross functional teams
  • Participate in discussions and planning with development teams
  • Design and implement scalability, stability, stress, reliability, availability, and longevity tests using industry standard tools and custom developed software written in Java or . NET
  • Review automation implementations and encourage sound design principles
  • Review test plans, results analysis, capacity planning guides for in-flight releases
  • Work closely with the Production Success Performance Team to provide product and automation expertise for use in patches, research projects, sizing, and capacity planning
  • Coordinate technical presentations and training for capacity planning guides
  • Report weekly trends highlighting product performance and team status against committed milestones and projects

Required profile

Experience

Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)
  • Detail Oriented
  • Open Mindset
  • Verbal Communication Skills
  • Questioning Skills

Site Reliability Engineer (SRE) Related jobs