Datacenter Operations Manager
Location: Remote - PST timezone or PHX DCPosted On: 03/17/2026
Requirement Code: 73460
Requirement Detail
We are looking for a contractor to add to our Data Center Capacity Management team, you will bridge the gap between physical infrastructure and logical reliability. This role demands a blend of traditional data center management—racking, cabling, and power planning—with modern SRE principles and advanced network fabric design. You will be responsible for ensuring that our global data center footprint is scalable, redundant, and manageable through "lights-out" remote operations.
Your Role
• Capacity & Power Engineering: Monitor and analyze capacity metrics, aligning hardware requirements with physical data center resources. Perform complex rack-based 3-phase power calculations (kVA loads) to balance redundant bus feeds and ensure electrical stability.
• Network Fabric & Connectivity: Design and configure network fabric architectures, ensuring high availability through robust network connection redundancy at the node level.
• Remote Operations & Troubleshooting: Manage remote ticketing and troubleshooting for lights-out environments, utilizing Out-of-Band (OOB) management for full remote operational control.
• SRE & Automation: Apply Site Reliability Engineering (SRE) principles to data center operations, focusing on automation (IaC), latency reduction, and the overall reliability of physical and virtual infrastructure.
• Technical Execution: Execute hands-on tasks, including server builds, hardware assembly, and troubleshooting. Coordinate with Data Center Operations to support physical deployments, including racking and precision cabling.
• Workflow Management: Manage incoming requests through Jira/ServiceNow, prioritizing tasks and coordinating with cross-functional teams to maintain 24/7 uptime.
• Logistics & Vendor Relations: Coordinate procurement and vendor relations to ensure timely hardware orders, addressing shipment discrepancies and maintaining accurate documentation.
• Process Optimization: Identify and implement process improvements to enhance operational efficiency, scalability, and "time-to-compute" metrics.