Mission: Installation, configuration, and maintenance of data center infrastructure, including servers, storage systems, and network devices.
Essential Duties and Responsibilities:
DC Technician: As a Data Center Technician, you will serve as the Directly Responsible Individual (DRI) for daily operations within the data center. You will lead hands-on installation, maintenance, and troubleshooting of compute and network infrastructure critical to Groq's high-performance AI workloads.
Your responsibilities will include:
1. Hardware Operations:
2. Receive, unpack, and move servers and other equipment to the data center floor.
3. Install, cable, and maintain servers, network switches, and power distribution units (PDUs) in racks.
4. Perform hardware-level bring-up and testing using Linux command-line tools.
5. Ensure proper accountability for equipment and assets through inventory management.
6. Troubleshooting & Support:
7. Troubleshoot and resolve complex technical issues related to rack and node failures.
8. Run scripts to debug and repair rack cabling and other hardware problems.
9. Create, update, and resolve tickets in Groq's ticketing system to document all work.
10. Participate in an on-call rotation to provide 24/7 support for data center operations.
11. Infrastructure & Collaboration:
12. Execute final test sign-offs for newly built racks.
13. Collaborate with other engineering teams to design and implement data center upgrades and expansions.
14. Develop and maintain technical documentation, including diagrams and procedures, to ensure operational consistency.
15. Ensure compliance with data center standards, policies, and procedures.
Ideal candidates have:
16. 2+ years of experience in data center operations or a related field
17. Strong knowledge of data center infrastructure, including servers, storage systems, and network devices
18. Experience with data center management software, such as DCIM or BMS
19. Strong problem-solving and analytical skills
20. Excellent communication and teamwork skills
21. Ability to work in a fast-paced environment and prioritize tasks effectively
22. Strong attention to detail and ability to maintain accurate records
23. Experience with scripting languages, such as Python or Bash
24. Familiarity with virtualization technologies, such as Kubernetes
25. Advanced fiber optic cabling skills
26. Intermediate Linux skills
27. Intrinsic curiosity and drive to stay up-to-date with the latest technologies and trends in data center infrastructure and operations
28. Familiarity with Macbooks, Slack and Google docs
29. Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent experience
30. Ability to travel up to 50% of the time
Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, salary range is determined by your location, skills, qualifications, experience and internal benchmarks. Compensation for candidates outside the USA will be dependent on the local market.