L2 Linux Engineer Ops Centre – Associate Manager / Senior Analyst
Salary: Competitive salary and package (depending on level of experience).
Location: London (must be willing to travel to client sites throughout the UK on an ad hoc basis).
Accenture is partnering with scaled UK AI compute pioneers to lead the charge on next‑generation infrastructure for sovereign AI. To support this endeavour, we’re building a high‑performance compute operations team in London.
Our work will be sensitive, secure, 24×7, and on the most up‑to‑date high‑density compute stacks available. Shift teams will be set up and operate 24×7; successful candidates working on shift will receive a shift premium for non‑standard unsociable shift hours that will be part of that rota.
Any offer of employment is subject to satisfactory BPSS and SC security clearance, which requires 5 years continuous UK address history (typically including no periods of 30 consecutive days or more spent outside of the UK) at the point of application. Eligibility for UK Government security clearance is required.
Key Responsibilities
* Managing and maintaining Linux, including installation, configuration, and troubleshooting.
* Managing and supporting hypervisors.
* Deployment and configuration of clusters on‑premises and private cloud platforms.
* Deploying clusters in a containerised environment.
* Performing system administration, networking, scripting, and automation to ensure efficient system operations.
* Effective line and shift management, people development, and leadership for junior team members.
* Responding to real‑time alerts and dashboards for compute, storage, and networking resources to detect service‑impacting events.
* Performing initial triage and isolation for incidents following established runbooks and procedures.
* Documenting incidents, actions, and outcomes accurately in the incident management system; supporting service delivery and SLA compliance.
* Ensuring thorough shift handovers documenting operational status and ongoing incidents.
* Investigating and resolving incidents escalated from L1.5 Engineers, conducting in‑depth analysis of compute, storage, and network issues.
* Developing and refining troubleshooting guides, runbooks, and knowledge base articles.
* Coordinating with engineering, automation, and vendor teams for persistent or complex technical problems.
* Monitoring and analysing performance metrics and incident trends, recommending proactive measures for reliability improvements.
* Mentoring L1.5 Engineers to support skill development and knowledge sharing.
* Participating in shift rotations and on‑call schedules as required.
Required Skills
* Proficient Linux experience.
* Technical experience in networking, storage, compute and related infrastructure.
* Strong understanding of network protocols, configurations, and security measures within a Linux environment.
* Understanding of storage systems (e.g. NAS, Cloud).
* Ability to write and utilises shell scripts (e.g., Bash) and automation tools like Ansible for efficient system management.
* Ability to monitor, diagnose, and optimise system performance for efficiency.
* Expertise in scripting languages such as Python, PowerShell & Shell.
* Expertise in Kubernetes for container orchestration and managing containerised applications.
* Experience in incident management, advanced troubleshooting, and operational best practices.
* Experience with Docker, Kubernetes, EKS.
* Familiarity with ITSM, Agile methodologies and tools e.g. ServiceNow, Jira.
* Bachelor’s Degree in Electrical Engineering (relevant experience will be considered).
Seniority Level
Mid‑Senior level
Employment Type
Full‑time
Job Function
Information Technology
Industries
Information Services
Referrals increase your chances of interviewing at Accenture UK & Ireland by 2x.
#J-18808-Ljbffr