Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while driving operational excellence.
Key responsibilities include:
* Supporting and enhancing existing network infrastructure
* Developing observability tools and self-healing/event-driven automation
* Performing advanced troubleshooting and incident resolution
* Contributing to the evolution of a high-performance compute datacentre
Skills Required:
* Proficient in monitoring and resolving incidents across diverse environments
* Strong diagnostic skills in network infrastructure, collaborating closely with vendor support teams for in-depth investigations when needed
* Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability
* Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability
* Implement BAU changes with a focus on automation, fostering close collaboration with research and infrastructure engineering teams
* Conduct trend analysis using multiple data sources to identify potential issues, enhance data correlation, and address capacity challenges
* Provide robust support for network automation, utilizing tools such as CI/CD pipelines, orchestration frameworks, Ansible, Python, GitOps, and more
* Take ownership of production deployments, ensuring rigorous code reviews and reliable release processes for new changes and features
* Proactively identify opportunities to automate repetitive tasks and lead initiatives to deliver cross-functional automation solutions involving Network Engineering and DevOps teams
* Demonstrate solid understanding of infrastructure monitoring and visualization tools, including Kibana, Splunk, Prometheus, and Grafana
* Maintain up-to-date knowledge of industry trends and new technologies to ensure automation practices remain advanced and relevant
* Possess strong foundational networking expertise equivalent to Cisco CCNP level