Job Description: Senior Network SRE (London)
Role Overview:
We areseekinga highly experienced Senior Network Site Reliability Engineer (SRE) to join our global network operations team. This role is critical in ensuring the reliability, scalability, and performance of our network infrastructure. You will lead incidentresponses, troubleshoot complex issues, and drive automation initiatives tomaintainworld-class network services.
Required Skills:
Minimum 10 years hands-on experience in network engineering and operations.
Deepexpertisein routing, switching, firewalling, and wireless across multiple vendors.
Strong troubleshooting skills, including overlay/underlay network understanding.
Proficiencyin Linux/Unix environments.
Experience with automation and monitoring platforms.
Ability to work independently, set technical direction, and mentor others.
Tools
Netbox/Nautobot
Prometheus /VictoriaMetrics
Salt
Networking (either one of the following)
EVPN Segment routing (although I would accept someone with significant MPLS depth on their resume)
Key Responsibilities
Lead Incident Management: Own and resolve critical network incidents, manage outages, andprovideexpert guidance during high-pressure situations.
Advanced Troubleshooting: Diagnose and resolve complex issues across routing, switching, firewalling, and wireless domains.
Technical Leadership: Set technical direction, mentor junior engineers, and foster a culture of operational excellence.
24/7 Operations: Participatein a shift-based model to ensure continuous availability of critical network services.
Multi-Vendor Expertise: Operateacross diverse environments including Arista, Cisco, Cumulus, Spectrum Ethernet, InfiniBand, Palo Alto, Check Point, Mist, Aruba, A10,Netscaler, and F5.
Security & Segmentation: Support network segmentation, policy enforcement, and VPN solutions (GlobalProtect, AnyConnect).
Automation & Observability: Utilizetools like Grafana, Big Panda, ServiceNow, ITMP, syslog, Splunk,Salt, Ansible, andPrometheusto enhance monitoring and automation.
Innovation Projects: Collaborate on wireless design and AI cluster deployments to supportcutting-edgeinitiatives.
Preferred Skills
Experience with InfiniBand and AI cluster deployments .
Familiarity with network asset management systems (e.g.,Nautobot).
Wireless design experience with Cisco, Mist, Aruba .
TPBN1_UKTJ