Job Overview:
We are seeking a skilled and experienced Service Reliability Analyst to join our diverse team as part of newly created Service Reliability Centre (SRC). In this role, you will help improve the availability and performance of Arm infrastructure by utilising Arms AI Operations (AIOPS) and observability platforms. You will collaborate closely with development and platform teams to build and maintain robust observability and response processes.
Responsibilities:
- Lead the analysis and resolution of infrastructure incidents across physical and virtual servers, storage, identity, and engineering platforms.
- Work with platform and engineering teams to expand monitoring coverage, define alert thresholds, and onboard new applications and services into SRC support.
- Drive proactive monitoring, tuning, and optimization of systems using Dynatrace and other observability tools.
- Look for opportunities to adapt automation to support the AIOps platform
- Conduct root cause analysis of incidents and implement preventive measures.
- Management of incidents to suppliers and Arms technical on-call rotas as appropriate
- To log all issues in the Service Management Tool and manage them to completion within EIT service levels and quality criteria matrix
- Work on a shift pattern, on a 24/7/365 operating model, while being able to work independently and flexibly in response to emergencies or critical issues
Required Skills and Experience:
- 3–6 years of hands-on experience in Platform Operations, or Infrastructure Support roles.
- Solid experience with observability tools managing and optimising an enterprise observability (e.g., Dynatrace, Datadog, Splunk) for real-time monitoring, alerting, and diagnostics.
- Proficiency in one or more scripting or programming languages (e.g., Python, Java, .NET, Node.js, Ansible or JavaScript).
- Practical knowledge of infrastructure automation using Ansible, including writing and managing playbooks.
- Understanding of UAM and IAM across on Premise, OUD/LDAP and Azure AD, including fault finding and access issues.
- Experience supporting Windows and Linux operating systems
- Experience with engineering tools such as Github, Jira, and Confluence
- Virtualization and Storage infrastructure, High Performance computing and Cloud services in an enterprise environment.
- Proficient in ticket management via an ITSM platform such as ServiceNow
- Experience leading incident response, driving service restoration and coordinating root cause analysis under pressure.
- Effective communicator within a team with a proactive approach and personal accountability for outcomes.
- Ability to analyze incident patterns and metrics to proactively recommend reliability improvements.
“Nice To Have” Skills and Experience:
- Exposure to high performance computing or cloud-native services
- Proven background in automation and DevOps practices
In Return:
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email accommodations@arm.com. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Hybrid Working at Arm
Arm’s hybrid approach to working is centred around flexibility, where we split our time between the office and other locations to get our work done. Within that framework, we empower groups and teams to determine their own particular hybrid working pattern, depending on the work and the team’s needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
#LI-LK2
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email accommodations@arm.com. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Hybrid Working at Arm
Arm’s approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the team’s needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.