DCS Daily - Compute

All the checks should be completed on the day !!

vSphere & UCS

    You can get a consolidated view of all the vCenter alarms by clicking on the "Alarms" tab at the bottom left hand corner of vSphere Client
    This info is available under the "Alarm Actions State Report (vSphere and UCS)" Tab of the daily health report @ http://dcsaklmgmt1.dcs.local

    This info is available under the "Clusters with HA/DRS Turned Off" Tab of the daily health report @ http://dcsaklmgmt1.dcs.local

    AK Cisco DPAY 1, WLG Cisco DPAY 1 and HLZ Cisco BYOL 4 should be there as Partially Automated for DRS.
    No other Clusters should be listed

    This info is available under the "Datastores with Storage I/O Control Disabled" Tab of the daily health report @ http://dcsaklmgmt1.dcs.local
    This info is available under the "VMs With One or More Snapshots" & "VMs that need Consolidation" Tab of the daily health report @ http://dcsaklmgmt1.dcs.local
    You would have received an email with Subject "dvSwitch Backups Completed", check the log file in that email.

    This info is available under the "Zombie Files" Tab of the daily health report @ http://dcsaklmgmt1.dcs.local

    Make sure the file / folder is safe to delete and Remove the files.

    Check the recommendations and apply them as required:

    1) Before applying balance datastore space usage alerts please check the SDRS cluster view, if all Datastores are 80% or less utilized then we can simply clear this alert. 

    2) Some (not all) affinity rules are purposely violated to avoid large disks bursting too hight within the same Datastore etc.

    3) Before applying balance datastore I/O workload, please check that it's not suggesting to separate VMDK's that currently reside within the same datastore for burst performance. Other than this, all I/O workload balance rules should be applied (as long as they are not going to fill up the datastore over 80%).

    1. Please put the appropriate VM's into their related groups.

Portal

    Log into Portal, go into an ORG that you have admin access to and run the Inventory Differences Report. Check for any VMs that are in vSphere but not in Portal and vice versa. Ignore any network/portgroup mismatch entries.

VEEAM

    Talk to Elsa or Sam D initially to understand how this works

SPECTRUM

    Please login to Spectrum(see keepass for details) and check the "Datacom Cloud Services - DCS All Devices" for any alarms.

    Please action or log an incident in R12 and clear the alerts.

    Note: Recommend using the java console (start console) rather than the web client.

    Please also help to check for any brown(maintenance mode = yes) devices under each of the DC's. - you can take a device out of maintenance mode by clicking on the device then under the information tab, change "In Maintenance" to "No".

ZERTO

BACKUPS

Daily Diff Check: Check for any Failed Backups - Watch out for Time Since Last Backup

    This info is available under the "Daily Backup Report - VMs That Failed Last Backup" Tab of the daily health report @ http://dcsaklmgmt1.dcs.local - Inform the On- Call person about any re-runs needed.

Investigate any Backups that failed more than once or have a pattern

Check Tape Libraries and any Tape Requirements

Check Replication Status

    This check needs to be performed from two locations:

    1) https://DCSHLZDDMC1.dcs.local - check replication status

    2) sign into a master/media server in each location to check for any failed replications in NBU.


Check DataDomain Capacity and Cleaning Schedule and investigate any Alerts

    Log into https://DCSHLZDDMC1.dcs.local and check status of all the DDs:


    WLG-EMCDD6300-1

    WLG-EMCDD2500-1 (soon to be replaced by WLG-EMCDD3300-1)

    HLZ-EMCDD9300-1

    HLZ-EMCDD6300-1

    CHC-EMCDD7200-1 - (pending decom)

    CHC-EMCDD6800-1

    AKL-EMCDD9800-1

    AKL-EMCDD2500-2 (soon to be replaced by AKL-EMCDD6300-2)

    Please notify your team leader if any DD is exceeding 60% capacity.

Check all Tape Libraries have Cleaning Tapes and 10+ cleans remaining (if not please order more)

    Please make sure EVERY Tape Library has a cleaning tape and check it has more than 10x "Cleanings Remaining" in NBU.

    If you find we need a new cleaning tape, please email the Datacenter Ops team in the first instance and Cc DCSEngineers. (they should have spares more often than not).


Weekly/Monthly Backups Check: check for any failed backups that have not been re-run successfully

    Please sign into the master/media server in each location and check for any weekly/monthly full backups that have failed and have not been re-run successfully

Use this template in Manifestly

Start a Free 14 Day Trial
Use Slack? Start your trial with one click

Ready to take control of your recurring tasks?

Start Free 14-Day Trial


Use Slack? Sign up with one click

With Slack