Site Reliability Engineer (Python Preferred)
We are looking for a skilled and adaptableSite Reliability Engineer (SRE)to join our team. This role is a blend ofscripting and operational responsibilities, ideal for someone who enjoys both building automation and engaging in hands-on support to ensure system reliability and performance.
London hybrid working - Contract Opportunity - 3 days in Battersea
Must have's
1. Python scripting - They could take someone with Go
2. Automation experience
3. Prometheus / grafana / Prom QL
4. CI/CD
5. AWS
6. Splunk
Key Responsibilities
7. Develop and maintain automation scripts, primarily inPython(Go experience also considered).
8. Respond to and resolveincidents, managechanges, and performproblem analysisto maintain system uptime and reliability.
9. Collaborate with internal teams and customers to troubleshoot and resolve infrastructure and application issues.
10. Operate and enhance observability tooling, includingPrometheus,Grafana, andSplunk, with a strong focus onPromQL.
11. Participate in anon-call rotationto support critical production systems.
12. Improve and maintainCI/CD pipelinesand deployment processes.
13. Work withAWS cloud infrastructureto support scalable, secure, and resilient systems.
14. Operate within aGitOpsworkflow and supportKubernetes-based environments.
Required Skills & Experience
15. Strong scripting skills inPython(Go, Bash, or SQL also beneficial).
16. Proven experience withautomationand infrastructure-as-code practices.
17. Deep understanding ofmonitoring and observability, particularly withPrometheus,Grafana, andPromQL.
18. Experience withCI/CD toolsand modern deployment strategies.
19. Solid hands-on experience withAWS servicesin a production environment.
20. Proficiency withSplunkfor log analysis and monitoring.
21. Familiarity withGitHub,GitOps, andKubernetes operations.
Nice to Have
22. Experience in customer-facing or operational support roles.
23. Exposure to container orchestration and microservices architecture.
24. Ability to work effectively in a fast-paced, collaborative environment.
Location
London, UK
Rate/Salary
- GBP Daily
Trading as TEKsystems. Allegis Group Limited, Maxis 2, Western Road, Bracknell, RG12 1RT, United Kingdom. No. 2876353. Allegis Group Limited operates as an Employment Business and Employment Agency as set out in the Conduct of Employment Agencies and Employment Businesses Regulations 2003. TEKsystems is a company within the Allegis Group network of companies (collectively referred to as "Allegis Group"). Aerotek, Aston Carter, EASi, Talentis Solutions, TEKsystems, Stamford Consultants and The Stamford Group are Allegis Group brands.