Looking for a Senior DevOps & Infrastructure Engineer to join a fast-growing startup focused on developing innovative casino video slot games. They offer a diverse portfolio of games, including virtual soccer betting and crash games.
As a trusted game provider, they partner with online gaming platforms to deliver high-quality, engaging gaming experiences to players worldwide.
As a DevOps & Infrastructure Engineer you will manage and support our partner's AWS Kubernetes-based infrastructure ensuring system reliability, security, and scalability, while supporting CI/CD pipelines, monitoring and incident response.
This role may evolve into multiple positions based on workload and requirements.
Infrastructure Management
• Maintain and update existing AWS Kubernetes infrastructure;
• Perform version upgrades for Kubernetes, Terraform, and other core infrastructure
components;
• Ensure high availability and reliability of the system through proactive monitoring and
scaling;
• Manage Nginx configurations, including load balancing and sticky sessions;
Monitoring & Observability
• Set up and maintain monitoring for CPU, memory, and disk usage;
• Monitor and maintain logging and metrics using industry-standard tools;
• Define and implement alerting strategies for infrastructure and services;
• Monitor external dependencies such as exchange rates, Auth0, and third-party APIs;
Security & Access Control
• Manage user access across different services, ensuring proper permissions and least
privilege principles;
• Secure infrastructure against threats by implementing best practices (network security,
IAM policies, vulnerability management);
• Rotate API keys and credentials regularly;
• Define and enforce security policies for cloud infrastructure and deployments;
CI/CD & Release Management
• Enhance and maintain CI/CD pipelines to improve deployment efficiency;
• Assist in the release process, monitoring and signing off on production deployments;
• Ensure infrastructure changes are version-controlled and follow best practices;
Incident Management & Response
• Set up and manage incident response processes, including escalation procedures;
• Define SLAs for system availability and response times;
• Conduct root cause analysis (RCA) and implement preventative measures;
Cost Management & Optimization
• Implement cost monitoring and reporting for AWS infrastructure;
• Set up alerts for cost anomalies to prevent unexpected budget overruns;
• Optimize cloud resources for cost efficiency;
Provisioning & Environment Management
• Automate provisioning of environments for platform integrations;
• Maintain consistency across different environments (development, staging, production);
• Implement infrastructure as code (IaC) best practices using Terraform;
Internal Tools Management
• Manage and support internal tools such as Slack, Office 365, Jira, Confluence, and
GitHub;
• Ensure seamless integration of internal tools with the overall infrastructure;
• Maintain security and access control for internal collaboration tools;
Canal & River Trust
IFS
Unisys
Binnies
Veeva Systems