Bachelor's degree in Computer Science and Engineering or a related field, or equivalent experience.
5+ years of professional experience as a DevOps Engineer or Site Reliability Engineer.
Experience in managing a small team of DevOps / SRE engineers in an Agile environment.
Strong AWS hands-on experience including preferably (but not limited to): Well architected framework on managing multiple AWS accounts, managing multiple production EKS clusters, EC2/Auto-scaling, Load balancer, Secret Manager/KMS, measuring and delivering CIS compliance of a multi-region, multi-account platform.
Certified AWS Solution architect associate/professional level and/or Certified Security Specialty is a good bonus.
A strong bonus would be experience on Global or multi region architecture design knowledge with AWS Global accelerator or multi region setup on EKS, Networking (VPC, Subnet, Peering , Transit Gateway, Route53)
Strong knowledge or experience in setting Secure and Reliable design and tools for AWS cloud such as: AWS Inspector and GuardDuty, Well architected on DNS security, VPC (Security group, NACL), WAF, Firewall, data access, least privilege, need-to-know, strong with IAM and policy. Able to design the secure solution involving networking, encryption, secret management, DNS security, data privacy, ensuring availability, resilient, secure.
Ability to review the resources and provide input on security points to improve. Awareness of risk as a key drive for Availability, Reliability and Security
3-5 year hands-on experiences with containerization (Docker, EKS, Kubernetes): managing multiple EKS workload, debugging and troubleshooting with EKS/Kubernetes, Helm chart usage, best practice on Kubernetes.
Hands on experience securing a Kubernetes environment
Experience in setting Strong CI tools and flow preferably using CircleCI workflow, context, orbs.
Experience in using CD (continuous delivery) tools and flow for Kubernetes, preferably using Argocd (Rollout, rollout strategy, application, application set, Argocd RBAC and permissions, managing repository and repository credentials),
Experience integrating security into the development process and CI/CD pipelines to find and fix security vulnerabilities early in the development lifecycle including SAST and DAST tooling
Experience in Infrastructure As Code (IaC), preferably using Terraform for AWS: Terraform cloud workspace, state management, terraform module (KMS, EKS, VPC, etc...), creation of custom module, and setup for multiple account and environment
Experience in Observability and Monitoring using Data-dog, Elastic Cloud Observe ability, AWS Cloud-watch, CloudTrail and integrating third party threat monitoring tools with AWS.
Experience in using one or multiple artifacts repository management (Artifactory Jfrog, NPM repository, Helm Chart repositories, etc)
Java knowledge and troubleshooting experience in a Microservice environment would be a great plus.
Programming experience or knowledge in NodeJs, Python, Go are a plus
Strong expertise in software development and agile methodologies, with full-stack software development experience as a plus.
Ability to strategize and weigh options, considering pros and cons of new technologies.
Track record of coordinating and communicating effectively with diverse teams and individuals (e.g solution architects, scrum masters, developers, and clients)