Logo for Diverse Lynx

Site Reliability Engineer (SRE) – Data Analytics & Observability

Key Facts

Remote From: 
Full time
Senior (5-10 years)
English

Other Skills

  • Problem Solving
  • Teamwork

Roles & Responsibilities

  • Bachelor’s degree or equivalent experience
  • 5 plus years in SRE, DevOps, or Production Support
  • Strong knowledge of SRE principles and reliability engineering practices
  • Hands-on experience with Dynatrace, Splunk, Power BI, SQL across multiple platforms

Requirements:

  • Apply SRE principles to improve system reliability
  • Implement proactive monitoring, alerting, and self-healing capabilities
  • Design and deliver operational dashboards and reports using Power BI
  • Integrate observability tools with enterprise reporting and ITSM systems

Job description

Job Title: Site Reliability Engineer (SRE) – Data Analytics & Observability
Location: –
REMOTE

JD Below:-
Position Summary
We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on Data Analytics, Observability, and Reporting to support and enhance enterprise production systems. This role combines SRE principles with deep expertise in Dynatrace, Splunk, Power BI, and Snowflake, along with strong knowledge of Oracle and MS SQL Server, to drive system reliability, performance optimization, and proactive incident management.
The ideal candidate will have hands-on experience with Power BI reporting, SQL development across multiple platforms, PowerShell automation, ServiceNow, Managed File Transfer (MFT), and enterprise scheduling tools, with a strong emphasis on data-driven operational insights.

Key Responsibilities
Site Reliability Engineering
  • Apply SRE principles (SLIs, SLOs, error budgets) to improve system reliability
  • Implement proactive monitoring, alerting, and self-healing capabilities
  • Lead incident response, RCA, and postmortems
  • Drive continuous improvement in availability, scalability, and resilience

Data Analytics, Reporting & Observability
  • Design and deliver operational dashboards and reports using Power BI
  • Leverage Splunk and Dynatrace to analyze logs, metrics, and traces
  • Correlate data across platforms to identify trends, anomalies, and risk patterns
  • Use Snowflake, Oracle, and MS SQL Server SQL to query, transform, and analyze operational datasets
  • Build data models and curated datasets to support reporting and analytics
  • Translate operational data into actionable insights for engineering and leadership

Monitoring & Tooling
  • Administer and optimize:
    • Dynatrace (APM, Grail, DQL, synthetic monitoring)
    • Splunk (SPL queries, dashboards, ingestion pipelines)
  • Create alerting strategies aligned to SLOs and business priorities
  • Integrate observability tools with enterprise reporting and ITSM systems

Power BI & Data Integration
  • Develop and maintain Power BI dashboards, reports, and semantic models
  • Integrate Power BI with Snowflake, Oracle, MS SQL Server, Splunk, and operational data sources
  • Optimize query performance, data refresh, and dataset design
  • Implement row-level security and governance controls
  • Support enterprise reporting standards and governance

Data Platforms (Snowflake, Oracle, MS SQL Server)
  • Write and optimize SQL across:
    • Snowflake (advanced analytics, semi-structured data)
    • Oracle (PL/SQL, performance tuning, indexing strategies)
    • MS SQL Server (T-SQL, stored procedures, query optimization)
  • Perform cross-platform data analysis and reconciliation
  • Support data modeling (views, marts, transformations) for analytics
  • Troubleshoot data performance issues across heterogeneous platforms
  • Partner with data engineering teams to improve data quality, lineage, and availability

Automation & Scripting
  • Develop automation using PowerShell (primary), Python, or REST APIs
  • Build automation workflows for:
    • Monitoring enhancements
    • Incident enrichment
    • Data extraction, transformation, and reporting
  • Create self-service tooling for operations teams
  • Integrate automation with ServiceNow, schedulers, and observability tools

IT Operations & Service Management
  • Integrate monitoring with ServiceNow (incident, event, change management)
  • Automate ticket creation, enrichment, and routing workflows
  • Ensure alignment with ITIL best practices

File Transfer & Scheduling
  • Support and optimize Managed File Transfer (MFT) platforms
  • Monitor and troubleshoot file transfer failures, protocol issues, and throughput
  • Manage and support enterprise schedulers:
    • Control-M
    • Stonebranch
    • Redwood
  • Analyze batch workflows, dependencies, and SLA adherence

Required Qualifications
  • Bachelor’s degree or equivalent experience
  • 5+ years in SRE, DevOps, or Production Support

Technical Expertise
  • Strong knowledge of SRE principles and reliability engineering practices
  • Hands-on experience with:
    • Dynatrace (APM, DQL, observability)
    • Splunk (search, SPL, dashboards)
    • Power BI (data modeling, DAX, performance tuning)
    • SQL across multiple platforms:
      • Snowflake
      • Oracle
      • MS SQL Server
    • PowerShell automation and scripting
    • ServiceNow integration

Platforms & Tools
  • Experience with:
    • Snowflake data platform
    • Oracle and SQL Server databases in enterprise environments
    • MFT tools (Axway, Globalscape, JSCAPE, Boomi MFT)
    • File transfer protocols (SFTP, FTPS, HTTPS, AS2)
    • Enterprise schedulers (Control-M, Stonebranch, Redwood)
  • Knowledge of cloud and hybrid architectures

Preferred Qualifications
  • Experience integrating Power BI with Snowflake, Oracle, and SQL Server
  • Strong understanding of cross-platform data architecture and ETL/ELT patterns
  • Familiarity with Dynatrace Davis AI and automation workflows
  • Advanced Splunk data modeling and ingestion optimization
  • Exposure to Chaos Engineering (e.g., Gremlin)
  • Certifications:
    • Dynatrace
    • Splunk
    • Snowflake
    • Microsoft (Power BI / SQL Server)
    • Oracle
    • ITIL






Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.

Site Reliability Engineer (SRE) Related jobs

Other jobs at Diverse Lynx

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.