Production Engineer (f/m/x)

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Minimum 5 years of IT experience with a focus on production support and SRE practices., Proficiency in Linux/Unix systems, scripting languages like Python and Shell., Experience with automation tools such as Ansible, SSH, and monitoring tools like Prometheus and Grafana., Strong understanding of incident, problem, and change management processes, preferably with ITIL V3 certification..

Key responsibilities:

  • Ensure the reliability and performance of production systems through monitoring and incident response.
  • Develop and maintain automation tools to streamline operational tasks.
  • Act as a primary responder to system outages, conducting post-mortem analyses.
  • Collaborate with development teams to design reliable and scalable features.

Deutsche Postbank Group logo
Deutsche Postbank Group Banking XLarge https://www.postbank.de/
10001 Employees
See all jobs

Job description

Job Description:

DB Global Technology is Deutsche Bank’s technology centre in Central and Eastern Europe. Since its set-up in 2013, Bucharest Technology Centre (BEX) has constantly proven its capacity to deliver global technology products and services, playing a dynamic role in the Bank’s technology transformation.

We have a robust, hands-on engineering culture dedicated to continuous learning, knowledge-sharing, technical skill development and networking. We are an essential part of the Bank’s technology platform and develop applications for many important business areas.

This role is part of Risk-RFT Production Integrity SRE portfolio. Risk SRE team is based out of India and Cary,US. This role will add to the follow the sun support model.

Risk IT is an integral function of DB providing Risk Management capabilities. The Current team supports the below Risk domains:

Core Risk provides single access solution for all Risk/P&L and consolidates valuation data.

Market Risk reports daily Value at Risk and Limit management with tight SLAs.

NFRM manages the bank’s Non- Financial Risk Management profile and improve the controls.

Responsibilities

  • System Reliability: Ensure the reliability, availability, and performance of production systems by implementing best practices in monitoring, alerting, and incident response.
  • System Maintenance: Understand thoroughly the end to end application support process and escalation procedures, become fully conversant with all support tools. Maintain an end to end view of the application and infrastructure landscape.
  • Automation: Develop and maintain automation tools and scripts to streamline deployment, scaling, and operational tasks.
  • Incident Management: Act as a primary responder to system outages and incidents, ensuring rapid resolution and thorough post-mortem analysis to prevent recurrence.
  • Monitoring & Alerting: Design and implement robust monitoring and alerting systems to proactively identify and address potential issues.
  • Performance Optimization: Identify and resolve performance bottlenecks across the stack, from application code to infrastructure.
  • Collaboration: Work closely with development teams and other stakeholders to ensure that new features and services are designed with reliability and scalability in mind.
  • Documentation: Maintain comprehensive documentation of systems, processes, and procedures to ensure knowledge sharing and continuity.
  • Continuous Improvement: Continuously evaluate and improve our infrastructure, tools, and processes to enhance system reliability and operational efficiency.

Experience

  • 5+ years overall IT experience resource, with an ability to drive the right level of SRE Production engagement and controls within the Change organization, and from production support standpoint.
  • Ability to work in a fast paced environment with competing and alternating priorities with a constant focus on delivery.
  • Ability to balance business demands and IT fulfilment in terms of standardization, reducing risk and increasing IT flexibility.

Skills

Tech Stack:

MQ, DWEB, JMS, JAVA, Oracle, Hadoop, Unix and PL/SQL, Google Cloud Platform(GCP), Monitoring and Automation Tools.

Technical/Functional Skills:

• Dev Ops – Experience with Linux/Unix systems and scripting languages such as, Python and shell

• Automation with tools such as Ansible, SSH, and Shell

• Monitoring Experience with the design and implementation of AI/ML and RPA/ automation tools or tools like Geneos, Prometheus, Grafana etc.

• Problem analysis and solving in multiple layers such as hardware, Linux, networking and application

• Hosting services (PaaS) DHSO, VHS, DAP, DWEB, etc.

• Working knowledge of networks and load balancing and ssh keys.

• Expertise in Unix command line and shell scripting.

• Deep knowledge of the Incident, Problem and Change Management processes within the ITSM framework at minimum must be ITIL V3 Foundation certified. Proficient at using Service Management tools (e.g. ServiceNow, JIRA, etc.) and service monitoring tools.

• Exposure to GCP and monitoring tools such as NewRelic will be preferred.

• Exposure to SRE Model and execution/maintain deliveries within the SRE model.

Soft Skills:

• Excellent communication and collaboration skills

• Able to adapt to a changing environment and drive change

• Able to successfully interface with various stakeholders

• Self-motivated, delivery focused with the ability to work independently where required.

• Able to own and drive solution understanding the real issues behind Business Requirements.

• Committed and demonstrate a strong ownership.

Well-being & Benefits

Emotionally and mentally balanced:

  • Empowering managers who value your ideas and decisions. Show your positive attitude, determination, and open-mindedness.
  • A professional, passionate, and fun workplace with flexible Work from Home options.
  • A modern office with fun and relaxing areas to boost creativity.
  • Continuous learning culture with coaching and support from team experts.

Physically thriving:

  • Private healthcare and life insurance with premium benefits for you and discounts for your loved ones.

Socially connected:

  • Kids@TheOffice - support for unexpected events requiring you to care for your kids during work hours.
  • Enjoy retailer discounts, cultural and CSR activities, employee sport clubs, workshops, and more.

Financially secure:

  • Competitive income, performance-based promotions, and a sense of purpose.
  • 24 days holiday, loyalty days, and bank holidays (including weekdays for weekend bank holidays).

We strive for a culture in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively.

Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group.

We welcome applications from all people and promote a positive, fair and inclusive work environment.

Required profile

Experience

Industry :
Banking
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Reliability
  • Collaboration
  • Adaptability
  • Communication
  • Self-Motivation
  • Problem Solving

Related jobs