Logo for Rapinno Health Care

Site Reliability Engineer

Roles & Responsibilities

  • 3-5 years managing and administering middleware technologies (WebLogic, WebSphere, Tomcat)
  • 3+ years hands-on experience with Solaris, Linux (RHEL, CentOS, Ubuntu) in bare-metal and cloud-based infrastructure (AWS, OpenStack)
  • Experience with CI/CD tools such as Jenkins and Ansible
  • Experience with AWS cloud platforms including Auto Scaling, EC2, EFS, EBS, S3, KMS

Requirements:

  • Design and development of medium to highly complex systems, including infrastructure design, configuration, deployment of applications, and advanced troubleshooting.
  • Deployment, middleware administration and operational support of production, staging, test, and development environments using WebSphere, WebLogic, and Tomcat.
  • Monitor capacity and performance, plan and execute disaster recovery procedures, and provide Tier 2 technical support.
  • Collaborate with development, QA, and production support teams to resolve open issues/defects and communicate production issues to upper management; contribute to release engineering and automation (CI/CD, configuration management) using Jenkins and Ansible.

Job description

Role: Site Reliability Engineer

Location: Piscataway, NJ

Duration: Long Term Contract

Domain: Largest Enterprise Telecom Client

Middleware tech WebSphere/WebLogic/tomcat, Shell scripting, AWS, Ansible/jenkins - Must have Some Production support exp

Description
As a member of the Platform as a Service team, you will be responsible for the design and development of medium to highly complex systems. This includes the design and implementation of infrastructure from specifications, configuration and deployment of applications, connecting to back-end resources, and advanced troubleshooting of moderately complex software applications. Deployment, middleware administration and operational support of (production, staging, test and development) environments for multiple projects using WebSphere, Weblogic, and Tomcat Application Server. Monitors systems capacity and performance, plans and executes disaster recovery procedures, and provides Tier 2 technical support.

In addition, this role requires the candidate to be highly flexible in hours of work because of its customer-facing, highly available infrastructure requirements. Work closely with Dev, QA and production support team members to align and orchestrate resolutions on open issues/defects. Provides high level written communications to upper management regarding production issues.

Required Skills

3-5 years managing and administrating middleware technologies(Weblogic, Websphere, Tomcat).
3+ years hands-on experience with Solaris, Linux (RHEL, CentOS, Ubuntu), in bare-metal and Cloud-based infrastructure (AWS, OpenStack)
Experience with cloud platforms AWS( Auto scaling , AVI, security, EC2 , EFS , EBS , S3 , KMS)
Strong experience with Installing IBM WebSphere MQ and creating multi instance Queue manager in AWS by using EBS/EFS volumes, creating MQ objects, clusters, channels etc.
Experience with configuring the clustered Queue managers for HA and load-balancing as well troubleshooting in clustered environment
Installing open source Rabbit MQ on AWS EC2 instances with the use of CFTs/ansible and automating it by using Jenkins. Also creating Classic Load balancer to distribute traffic among those Rabbit MQ instances
Experience with migrating applications from monolithic to kubernetes container platform
Experience with APIGEE Proxy configurations and troubleshooting
Hands on experience with CI/CD tools such as Jenkins, Ansible
Working knowledge of monitoring tools like CA Wily, New Relic, and Datadog
Experience with Elasticsearch, Kibana, and Logstash
Execution on all release engineering aspects of DevOps including the configuration management , Build and Deployment Management, Continuous Integration and Delivery
Ansible based deployment and configuration automation solutions.
Experience with web based services and protocols ( HTTP , HTTPS, REST , Apache , Tomcat)
Experience with micro-service architectures and deployment.
Knowledge on L2/L3 protocols , IPv4/IPv6 and TCP/IP stack .
Proficiency in high level script languages (Python preferred) as well as script environments like bash Experience with DevOps workflow automation (Jenkins, Ansible, Puppet)
Strong analytical & troubleshooting skills.
Experience with tools like JIRA, Confluence, Stash
M.S. or relevant experience required.

Preferred to have:
AWS Certification

Site Reliability Engineer (SRE) Related jobs

Other jobs at Rapinno Health Care

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.