As a Reliability Engineer IV, you will be responsible for the availability, performance, monitoring, and incident response, among other things, of the cloud platforms and services. Ensure that everything that goes to production complies with a set of general requirements like diagrams, dependencies of other services, monitoring and logging plans, backups and possible high availability setups. Manages uncaught exceptions, hardware degradation, networking problems, high usage of resources, or slow responses that could happen at any time. Uses metrics such as mean time to recover (MTTR) and mean time to failure (MTTF). Considered an emerging authority, who applies extensive technical expertise. Develops technical solutions to complex problems. Exercises considerable latitude in determining objectives and approaches to assignment.
Evaluate and analyze products, components, materials, and equipment to predict failures and improve reliability
Review product designs, material specifications, and manufacturing processes to assess dependability and recommend improvements
Create prototypes and conduct product tests to gather and analyze reliability data
Interpret test results using statistical distributions and reliability models to recommend design changes
Recommend modifications to product designs, manufacturing processes, and quality controls to enhance reliability
Monitor production equipment diagnostics and review maintenance records to predict and prevent downtime
Document findings, conduct root cause analysis, and implement corrective actions to maintain product and equipment reliability
Collaborate with engineering and development teams to design and implement process and product improvements
Determine maintenance requirements and schedules for products and equipment
Review subcontractors’ proposals for reliability programs and provide evaluations for decision-making
Assess engineering specifications and drawings, proposing design modifications to improve reliability while meeting cost and performance targets
Observe testing at supplier, plant, or field locations to evaluate reliability factors, including causes of unit failures
Monitor failure data generated by customer usage to determine potential requirements for product improvements
Provide technical support to operational strategies that optimize processes, enhance productivity, and ensure quality across all program functions
Ensure 100% of planned hours are worked and recorded
Identify and escalate opportunities for growth within the work area to leadership
Participate in growth initiatives as requested
Ensure all contractual deliverables are met or exceeded to customer satisfaction
Complete personal PDP and attend Staff Meeting and Storytime (with camera on)
Build productive and positive professional relationships with clients within the program
Execute all contract requirements in accordance with contract-specific LCAT and requirements
Perform other related duties as assigned
Clearance: Active Secret Clearance
Education and Years of Experience: Bachelor's degree with 8 years of experience, or a Master’s degree with 6 years of experience
Recognized as an emerging authority in reliability engineering with expertise in solving complex technical challenges
Ability to develop and implement reliability solutions while exercising significant autonomy in decision-making
Strong analytical skills for interpreting test data, conducting root cause analysis, and optimizing reliability models
Effective problem-solving and leadership skills to drive reliability initiatives and process improvements
Proficiency in reliability modeling, failure mode analysis, and predictive maintenance methodologies
DoD 8570 / 8140 IAT Level II certification
At least one cloud certification
Experience leading reliability programs in manufacturing, aerospace, defense, or similar industries
Familiarity with statistical analysis tools and software for reliability modeling and data analysis
Hands-on experience with predictive maintenance techniques, performance testing, and diagnostics tools
Knowledge of industry reliability standards, compliance requirements, and best practices
Background in quality management systems (QMS), Six Sigma, and other process improvement methodologies

Recare

Pismo

AmerisourceBergen

World Courier

Visa

TalentWerx

TalentWerx

TalentWerx