Proficiency in at least one programming language (Java, C#, JavaScript, Python, or Ruby).
Experience configuring and administering Windows or Linux systems.
Strong Azure Cloud experience with cloud architecture design using PaaS and DevOps practices.
Experience with CI/CD, source code repositories, and configuration management tools and best practices.
Requirements:
Design scalable systems and services, connecting distributed components using a broad range of tools.
Apply evidence-based methods to solve real-time service problems for fastest recovery.
Write code to test, load, instrument, and analyze system properties to identify bottlenecks and reliability issues.
Configure and tune systems at scale using configuration management tools (Puppet, Chef, Ansible) and optimize networks, storage, databases, web apps, containers, and messaging systems.
Job description
Azure Site Reliability Engineer - 100% Remote
Location - Sandy Springs GA
Rate - DOE
Duration - 1 year
Start - 2/27/23
Job Description
Required Skills for all SREs
You program at a high level in at least one language such as: Java, C#, JavaScript, Python or Ruby.
You configure and administrate systems using either Windows or Linux.
You design scalable systems and services, connecting distributed systems together using a broad range of skills and tools.
You apply an evidence based approach to solving service problems in real time to provide the fastest path to recovery.
Cloud Skills
Strong Azure Cloud.
Experience with designing cloud implementation architectures and solutions using PaaS, DevOps & Advanced Application coding
Experience with application transformation and modernization & data migrations projects.
Experience with various Continuous Integration and Continuous Delivery (CI/CD), Source Code Repos and configuration management tools, technologies, and best practices
Systems Engineering Skills
You write code to test systems, generate load, instrument, analyze, profile and Client system properties and attributes.
You use configuration management (tools; puppet, chef or ansible) to expertly manage configuration at scale.
You investigate system components discovering and removing performance bottlenecks and sources of unreliability.
You applying the scientific method to system components to identify improvements to the configuration and design to improve reliability, performance and operability.
You select, configure, analyze and tune [Network, Storage, Database, Web Applications, Application Containers, Message Queuing] systems.