Description
The Data Center Maintenance Manager oversees critical data center infrastructure's maintenance, operation, and reliability, including HVAC, normal, emergency, UPS electrical supply, and mechanical systems. This role ensures that all equipment and facilities operate at peak efficiency, with minimal downtime and maximum resilience, supporting continuous and secure data center operations. The Maintenance Manager manages the technical teams, including leave planning, performance management, and the disciplinary process on an operational level. Manages preventive and corrective maintenance per PLPM, PLCM, and ADHOC schedules on SAP and HHD to ensure SLA delivery. Enforces compliance with the Occupational and Safety Act, building/electrical, compliance, and the Data Centre industry standards and safety regulations.
Maintenance Planning and Management
o Drive timeous execution of effective maintenance strategies i.e. planned, preventative, corrective and emergency maintenance for:
o Medium voltage systems.
o Low voltage power distribution.
o Emergency power distribution.
o UPS power and distribution.
o EPS (Generators).
o Solar systems.
o Fire and Access systems.
o Gas suppression systems.
o HVAC systems.
o Mechanical systems.
o Infrastructure.
o Environmental control and monitoring systems.
o Data center power distribution up to cabinet PDUs.
Facility Operations Management
o Manage daily operations of data center facilities to ensure the optimal performance of power, cooling, and security systems.
o Monitor data center infrastructure and equipment using Building Management Systems (BMS) and Data Center Infrastructure Management (DCIM) tools.
o Identify and address potential risks proactively.
o Manage risks to maintain the required availability levels.
o Demonstrate and instill effective adherence to processes on infrastructure maintenance.
o Explore and recommend innovative methods, based on best practices, to bring about cost-effective solutions.
Team Leadership and Vendor Management
o Lead and develop a team of technicians, facilitating training and skills development. Monitor and report on team performance and utilization.
o Collaborate with procurement on external vendors and service providers for maintenance and repairs, ensuring compliance with service level agreements (SLAs) and quality standards.
o Ensure timely delivery and appropriateness of parts and spares for effective maintenance execution.
o Manage back-to-back SLA agreements with suppliers and contractors.
o Monitor service providers’ (contractors) performance and effect corrective action on any deviations to the SLA.
o Organize monthly meetings with critical service suppliers and vendors.
o Manage back-to-back SLA agreements with suppliers and contractors.
Compliance and Safety Standards
o Ensure compliance with the Occupational Health and Safety Act (85) and Regulations.
o Ensure adherence to all South African regulatory requirements and safety standards for all systems in a data center environment.
o Conduct regular safety audits to verify compliance with industry best practices and local codes.
o Participate in health and safety protocols, including the proper handling of refrigerants and hazardous materials.
o Manage change control requests, including detailed scope of work documentation.
o Prepare for internal and external audits, issuing all necessary work permits (e.g., hot work, MV permits).
o Maintain comprehensive records of all compliance and health, safety, and environmental (HSE) documents, including monthly reports and updates to the Integrated Management System (IMS).
o Ensure that medium voltage switching procedures are compiled and adhered to by relevant competent person/s.
Power and Environmental System Management
o Oversee the maintenance and operation of critical power systems, including low-voltage, medium voltage, UPS, generators, and power distribution units (PDUs).
o Manage HVAC systems to ensure appropriate temperature, humidity, and airflow within the data center.
o Monitor energy efficiency and implement initiatives to reduce power and cooling costs.
Emergency and Incident Response
o Develop and maintain incident response and disaster recovery protocols for maintenance-related emergencies.
o Coordinate rapid response efforts to infrastructure failures, collaborating with IT teams to restore full functionality with minimal downtime.
o Review incident reports and implement continuous improvement initiatives to mitigate future occurrences.
o Conduct regular emergency drills and engage in disaster recovery planning to minimize operational risks.
Budgeting and Reporting
o Prepare and manage the maintenance budget, optimizing resources for cost-effective operations.
o Track maintenance costs, forecast expenditures, and implement cost-control measures.
o Generate detailed reports on maintenance activities, equipment status, and key performance indicators (KPIs) to inform facility management.
o Explore and recommend innovative methods, based on best practices, to bring about cost-effective solutions.
People Management
o Ensure adequate staff placement i.e. prepare suitable shift-roster and manage/approve applicable leave for subordinate employees.
o Manage technical staff performance and facilitate improvement through regularly monitoring performance and providing required coaching, support, and feedback.
o Manage performance and conflicts within subordinate employees and effect corrective actions, in line with company policies/procedure.,
o Ensure optimum utilization of available resources in various maintenance works.
o Continuously monitor and evaluate maintenance work performed by technical staff to ensure quality, cost optimization and timely execution as per SLA, work instructions and Client’s instructions.
o Identify gaps and deficiencies in services, advise and effect solutions as part of
Continuous Improvement on FM services rendered.
o Facilitate operational meetings and toolbox talks.
o Compile personal development plans, and disciplinary processes.
o Conduct investigations into incidents, including near misses and medical or fatal incidents.
o Perform tool inspections and monitor vehicle usage, conducting vehicle inspections as needed.
o Capture and approve overtime, ensuring effective time management.
o Responsible for training, coaching, mentoring and development of technical staff.
Project involvement
o Assist in the management of technical projects and provide technical support, where applicable
o Witness commissioning of installed equipment.
o Site specific requirements.
Asset management and end-of-life management.
o Ensuring that equipment assets are managed in maintenance management system (SAP).
o Ensuring that equipment life cycle is managed, and client informed about equipment replacement requirements.
o Manage maintenance records and ensure all activities are accurately documented and accessible.
Qualifications and experience required:
B-Tech or Degree in Engineering: Mechanical/Electrical or related formal qualification
Licenses Valid SA Driver’s License
10yrs relevant engineering experience in maintenance engineering, CRM, and Property Management
Engineering maintenance, CRM, and Property Management
IT Training (General MS etc.) MS Word, MS Excel, MS PowerPoint, MS Project & MS Outlook (Intermediate skill level), SAP knowledge
Statutory Requirements OHS Act, ISO 9001 Quality Management and Risk Management Systems
Security clearance Comply with National Key point security and client requirements
Salary: R500 - R680 000pa CTC
Send cv to cindy@toptalentps.co.za