Logo for Provenir

Cloud Operations Engineer

Roles & Responsibilities

  • 6-10 years of industry experience in cloud operations or related fields
  • Strong Linux skills for production environments, including patch management, performance tuning, JVMs, and system monitoring
  • Experience with SaaS hosting on AWS and incident/problem management following ITIL practices
  • Proficiency with infrastructure automation and monitoring tools (Git, Jenkins, Terraform, scripting) and containerization/Kubernetes familiarity

Requirements:

  • Product issue resolution and incident management to ensure high availability and performance of hosting infrastructure
  • Collaborate with cross-functional teams and clients, providing guidance on hosting and infrastructure requirements and participating in ITIL-aligned incident, change, and problem management
  • Infrastructure setup and automation for multi-vendor cloud hosting using Git, Jenkins, Terraform, and scripting
  • Cloud optimization and observability using tools like DataDog, NewRelic, Splunk, Grafana, and Prometheus to monitor, diagnose, and improve performance

Job description

Who We Are

Provenir is a global fintech company with offices across North America, the UK, and Singapore backed by talented teams across APAC, EMEA, and LATAM. Provenir helps fintechs, financial institutions, and payment providers make smarter decisions, faster. We are passionate about technology and empowering businesses to become industry leaders. As a leading provider of decisioning and analytics products for financial services and other industries, we empower businesses to create digital-first decisioning solutions that drive business growth. If you’d like to work at an innovative fintech with a global footprint that is redefining the industry, then we want you!

What You'll Do

As a Cloud Operations Engineer, you will play a crucial role in managing and supporting the infrastructure necessary for hosting our products on the AWS cloud. Your expertise in resolving product-related issues, coupled with your ability to automate tasks and optimize performance, will ensure our hosting operations are seamless and efficient. This is 24/7 rotational shift role.

Your responsibilities will include, but are not limited to, the following:

Product Issue Resolution: Tackle technical challenges identified through monitoring tools or reported by customers, ensuring timely resolution of issues related to product hosting, infrastructure setup, networking, security, and more. Work closely with cross-functional teams, product development, and clients to maintain high availability and performance.

Collaboration: Engage with cross-functional teams, product development, DevOps, and clients to comprehend their requirements, offer technical guidance, and address product hosting and infrastructure concerns. Participate actively in incident, change, and problem management, adhering to ITIL best practices.

Infrastructure Setup and Automation: Establish and maintain essential infrastructure components and services for product hosting on multi-vendor cloud platforms. Utilize tools such as GIT, Jenkins, Terraform, and scripting/automation to automate setup processes, enhancing deployment, configuration efficiency, and overall operational effectiveness.

Cloud Optimization: Utilize cloud observability tools like DataDog, NewRelic, Splunk, Prometheus, etc., to monitor and enhance the hosting environment's performance and health. Identify and resolve performance bottlenecks, ensuring optimal performance and availability.

Task Automation: Leverage scripting and automation tools to automate routine tasks, including backups, scaling, monitoring, and maintenance, thereby boosting operational efficiency and minimizing manual efforts.

Documentation and Reporting: Keep comprehensive documentation of infrastructure setups, configurations, and best practices. Regularly report on infrastructure performance, issue resolutions, and automation efforts to stakeholders.

Qualifications, Strengths, and Skills

  • Experience 6-10 yrs of Industry experience

  • Strong Linux Skills: Overseeing the day-to-day operations of cloud-based applications running on production Linux environments, ensuring their stability, performance, and security. This includes patch management, performance tuning, and system monitoring., including familiarity with JVMs, heap dumps, system performance analysis, installations, configurations, upgrades, and proficient command-line usage.

  • Cloud Support Operations: Proven background in service operations roles, especially in daily customer interactions, technical issue resolution through ticket triaging, and independent RCA drafting.

  • SaaS and AWS Experience: Demonstrated experience with SaaS solutions and services, particularly managing enterprise applications on AWS cloud, focusing on availability and performance. In-depth knowledge of AWS services like Storage, Databases, IAM, ECS, EKS, and CloudWatch.

  • Troubleshooting and Problem-Solving: Exceptional skills in diagnosing and resolving technical issues related to product hosting and infrastructure.

  • Cloud Observability and Monitoring: Experience with tools like Datadog, Splunk, Grafana, and Prometheus for cloud observability, monitoring, and alerting.

  • Infrastructure Management: Experience managing cloud infrastructure upgrades and change management. Proficiency in using tools like Jenkins, Terraform, and scripting/automation for infrastructure setup, automation, release, and task management.

  • Kubernetes and Containerization: Experience with Kubernetes and familiarity with containerization technologies is highly desirable.

Our employees are our top priority; we offer comprehensive health and wellness plans. You will enjoy paid time off and company holidays, flexible and remote-friendly opportunities, and maternity/paternity leave.

At Provenir, we recognize that diversity and inclusion make our teams stronger. We are committed to equal employment opportunity and welcome everyone regardless of race, colour, ancestry, religion, national origin, age, sex, gender identity, sexual orientation, disability, marital status, domestic partner status, citizenship, or veteran status or medical condition. We encourage people from all backgrounds to apply.

Cloud Engineer Related jobs

Other jobs at Provenir

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.