The team is seeking an automation and observability engineer who will focus on automation as well as creating the observability, and alerting solutions to meet the Firm's data protection strategy.
Tasks will involve working with internal teams to understand the opportunities to improve reliability of the data protection plant. This will be done either by addressing the underlying issues via automation, or by improving alerting and observability to accelerate resumption of service, or both.
You will work closely with the operations team to understand pain points and find solutions. You must have strong experience with automation and observability using the modern technology stack while also being comfortable with having to work with legacy technology where required.
Required Skills:
* Excellent programming skills with Python. Extra points for experience with Perl and/or PowerShell
* Strong experience with Prometheus, Grafana, Loki, Cortex
* Strong experience with Ansible
* Strong experience with REST API
* Excellent ability to debug complex and novel issues, understanding the need to go beyond the documentation provided with a product
* Excellent analytical skills, capable of fast decision making using sound judgement, and not afraid to explore new ideas
* Excellent interpersonal skills in dealing with customers with differing technical specializations
* Good organizational and English communication skills are required, including prioritization of multiple projects and objectives
* Experience of backup and data protection platforms, in particular Veritas NetBackup
* Understanding of data deduplication technology
* Systems administration experience in UNIX and/or Windows Server environments
* Experience in other areas of storage SAN, NAS, S3 object storage
#J-18808-Ljbffr