Logo for CodersBrain

Prometheus SME (8year )

Roles & Responsibilities

  • Minimum of 8 years of experience in Enterprise monitoring.
  • At least 4 years of hands-on experience with Prometheus Platform.
  • Strong knowledge of designing and implementing IT Operations monitoring solutions.
  • Experience with integrating monitoring tools into ITSM applications and managing large-scale analytics.

Requirements:

  • Manage and maintain Prometheus monitoring and alerting infrastructure.
  • Design, develop, and implement monitoring solutions with integrations to other ITSM tools.
  • Configure and optimize Prometheus components, exporters, and dashboards.
  • Collaborate with support teams to resolve infrastructure monitoring issues.

Job description

Minimum of 8 years of experience in the area of Enterprise monitoring.
Minimum 4 years of handson experience in Prometheus Platform
Should have good experience in Design, development, and implementation of IT Operations monitoring solutions with integration into other ITSM applications
Expertise in Installing, configuring Prometheus components.
Experience in timeseries databases
Experience with managing large amounts of product analytics
Manage daytoday maintenance and evolution of Prometheus monitoring and alerting infrastructure
Experience in Grafana is must
Expertise in install and configure required exporter in the Targets
Expertise in configuring the Thresholds for Servers, Network, Storage, Backup, Databases
Experience in configuring the Dashboard & Reports
Experience in Prometheus integration with other tools
Experience in custom exporter
Experience in event management functionality. Netcool Omnibus, ScienceLogic, LogicMonitor, Zabbix and other event management tools are added advantage.
Experience in integration with Service Management tools like ServiceNow, BMC Remedy
Experience in integration with Notification, Collaboration and automation tools like xMatters, Everbridge, Slack, Ansible, etc.
Knowledge in third party discovery tools like ServiceNow and BMC discovery is an added advantage.
Strong understanding of Infrastructure network concepts and protocols.
Experience in remediation of discovery and monitoring issues in the infrastructure
Good analytical, problem solving, logical thinking. Standby support during nonoffice hour is required.
Coordinating with support teams in resolving issues.
Should Collects, generates, or helps refine high level requirements and creates implementation strategy, acceptance criteria (with input from the customer) and test cases
Knowledge in programming languages like Python, NodeJS, etc. will be additional advantage.
Interested to work in multi skilled environment and adapted to learn new technologies that supports any Enterprise Infrastructure
Customer facing experience is a must.
Prometheus Certifications are an added advantage

Related jobs

Other jobs at CodersBrain

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.