Hello All,
Greetings from Rootshell Inc.
Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking Site Reliability Engineer with Root Cause Analysis (RCA) for one of our client, Please share your resume with current location & full contact info
Role: Site Reliability Engineer with Root Cause Analysis (RCA)
Location: Boston, Massachusetts- Remote
Job Summary:
We are seeking a passionate Site Reliability Engineer to address operational challenges in automated warehouse solutions. This role focuses on enhancing system stability and scalability while leading Root Cause Analysis (RCA) meetings to identify and resolve resiliency gaps. Key responsibilities include analyzing critical incident information, hosting RCA calls, troubleshooting VMware, Kubernetes, custom software, and infrastructure issues, and implementing improvements to major software components and systems.
You will engage in the entire service lifecycle, from design to deployment, and work closely with Operations, IT, and Software Engineering teams. The role demands strong technical expertise, with a Bachelor s degree in software engineering or related fields, 12+ years of experience with ITSM tools like Jira, and 8+ years in infrastructure engineering and production system operations. Experience in leading technical RCAs and proficiency with tools such as Prometheus, Grafana, and Elastic is essential.
Successful candidates will demonstrate the ability to prioritize tasks, communicate effectively with both technical and non-technical audiences, and lead complex RCAs to conclusion. Success measures include completing technical training, initiating and leading RCAs, collaborating with cross-functional teams, and maintaining up-to-date problem tickets. This role requires occasional out-of-hours work based on workload demands.
With regards,
Naveen | Talent Acquisition
Rootshell Enterprise Technologies Inc.
[email protected] | www.rootshellinc.com