Certified Data Centre Facilities Operations Manager
Overview:
Introduction:
This course is designed to expose participants to in-depth knowledge in managing data center operation which includes the following key subject matters such as; capacity planning, latest green initiatives, how to properly commission and de-commission equipment, compliance to safety standards, statutory compliance, and international standards, managing people.
Additionally vendor management, handling incident/crisis management as well the how to keep operations really simple, manageable, effective, and efficient will receive full attention in this course.
Course Objectives:
At the end of this course the participants will be able to:
- Setup a data center facility operations team.
- Manage and motivate your facilities management team.
- Setup SLA’s and manage them including liabilities, KPI’s, etc..
- Manage vendors and measure their performance.
- Manage physical security taking into account requirements of standards such as ANSI/TIA-942 etc..
- Manage safety & statutory requirements.
- Effectively and efficiently manage data center operations.
- Manage documents.
- Setup equipment life cycle including testing.
- Define data center design limits and set-up and manage a proper capacity management plan.
- Manage day-to-day data center operations.
Targeted Audience:
- The primary audience for this course is an IT, Facilities or Data Centre Operations professional working in and around the data center (representing both end-customers and/or service provider/facilitators) and having a responsibility to achieve and improve hi-availability and manageability of the Data Centre, such as Datacenter managers, Operations / Floor / Facility managers, data center engineers, network/system engineers/data center sales/consultants.
Course Outlines:
Unit 1: The Data Centre Operations Team:
- Leadership criteria and attributes.
- How to set-up up an efficient and effective facility management operations team structure.
- Defining roles, responsibilities, and skill metrics.
- Key Performance Objectives (KPO) and appraisals.
- Job rotation, reward, promotion, and succession planning as strategies to grow and retain talent.
- Training and assessments and Shift management, scheduling, and roster planning.
Unit 2: Vendor Management:
- Vendor selection and qualification.
- Managing risk and dealing with non-compliance, public liability, legal, escalation, and complaint procedures.
- Key considerations of a vendor agreement for services.
- Performance measurement and reporting.
Unit 3: Facilities Maintenance:
- Maintenance options.
- Main considerations for maintenance agreements.
- The practicality in deciding between comprehensive/non-comprehensive maintenance regimes.
- Warranty pitfalls.
- Service reports alignments with maintenance agreements.
- Tiered maintenance considerations.
- Preventive, Predictive, Condition, and Reliability Centred (RCM) based maintenance.
- Managing on-site/on-sites spares and how to determine which spares to keep on-site.
Unit 4: Managing Safety & Statutory Requirements:
- Statutory and industry compliance/regulations.
- Emergency response and safety policies and procedures.
- PTW (Permit To Work) requirements and procedures.
- General rules and regulations for the data center.
- Ergonomic workspace.
- SOP’s for a power outage, fire, bomb threat, etc.
Unit 5: Service Level Agreement (SLA) Management:
- Defining data center design limitations.
- Defining measurement criteria and reporting.
- Alignment of business SLA with vendor SLA.
- Defining change management procedure for installation and de-installation of new equipment.
- Reporting and escalation management.
Unit 6: Managing Physical Security:
- Guidelines from standards; ANSI/TIA-942, ISO/IEC- 27001/02, SS507.
- SOP (Standard Operating Procedures) in managing day to day security access control, such as:
- Entry/exit control and access management.
- Permit-To-Work (PTW) and contractor work in progress.
- Delivery of goods.
- Customer access.
- Etc. - Effective patrols routing and how to ensure 24x7 vigilance.
- Handling external threats; crisis/emergency situations.
- Security incident management.
Unit 7: Managing Daily Data Centre Operations/Floor Management:
- ITSM/ITIL (IT service management) in the data center.
- Shift hand-over requirements and procedures.
- Asset and inventory management for hardware, software, spares, consumables, etc..
- Floor management procedures and duties such as rack space allocations, management of installers.
- Pre-installation analysis for power, cooling, weight, EMF, fire protection, and other in_uencing factors.
- From truck to rack.
- Handling of incoming equipment.
- Inspection, unpacking, and security procedures.
- Staging procedure and requirements.
- Equipment movement into the computer room.
- Finishing up the installation.
- De-installation/commissioning procedures.
Unit 8: Capacity Management:
- Defining the design limits of the data center.
- Setting up thresholds, monitoring, and reporting.
- Business review and future capacity planning.
- Technical solutions aiding capacity planning such as Computational Fluid Dynamics (CFD), capacity and configuration management solutions.
Unit 9: Cable Management:
- Overview of ANSI/TIA-942, ANSI/TIA-606 requirements.
- Cabling specification & labeling based on ANSI/TIA-606.
- In-rack power and network cabling.
- Labeling requirements.
- Cabling/cable tray layout documentation.
Unit 10: Data Centre Cleaning and Pest Control:
- Types of pollution found in data centers such as H2S, air-particulates, etc..
- Common causes of pollution in the data center.
- Standards, policies, and techniques to reduce and clean up dust, pests, and other pollution and disturbances.
Unit 11 Data Centre Monitoring and Automation:
- Datacenter monitoring requirements.
- Threshold setting and reporting requirements.
- Notification and escalation requirements.
- Automated 24hrs helpdesk ticketing systems.
- Incident and customer complaint management & change management.
- Performance measurement and monitoring requirements such as fuel and water consumption, PUE/DCiE, etc..
Unit 12: Managing Documentation/Archives:
- Document management standards.
- Document management process requirements.
- Minimum and desired design documentation set.
- Operational management documents.
Unit 13: Equipment Life-Cycle Management:
- Policies and procedures governing life cycle management.
- Asset management including software and firmware.
- Service situations.
- Review, triggers, and reporting.
- Test life cycle.