Server Security Checklist
Quarterly server security review run by the IT operations or MSP engineering team. Covers identity hygiene, network controls, patching, backup verification, and SIEM coverage on a roughly two-week cadence.
User and Access Management
-
Reconcile accounts against the HR roster
Pull the active user list from Entra ID (or AD) and reconcile against the current HR roster. Flag any account whose owner left more than 30 days ago — orphaned accounts with cached Kerberos tickets are the offboarding gap auditors look for first.
-
Audit privileged group membership
Review Domain Admins, Enterprise Admins, Schema Admins, and any custom privileged groups. The bar is named humans only — no service accounts, no "temporary" elevations from six months ago. Membership changes since last review go in the audit log.
-
Verify MFA enforcement on admin accounts
Run the Entra ID conditional access report (or Okta admin export) and confirm every privileged account has MFA enforced via phishing-resistant factor (FIDO2 / WHfB), not SMS. Confirm legacy basic-auth endpoints (IMAP, POP, SMTP AUTH) are blocked org-wide so attackers can't sidestep MFA entirely.
-
Disable accounts inactive 30+ days
-
Rotate service account credentials
Pull the service account inventory and rotate any credential older than the policy window (typically 90 days). Coordinate with app owners — rotating a service account that's hardcoded in a forgotten scheduled task is the #1 way to cause a self-inflicted Monday outage.
Network Security
-
Review firewall rules for unused entries
Export rules from FortiGate / Palo Alto / Meraki and check the hit-counter or last-used timestamp. Rules with zero hits in 90 days are candidates for removal. Document the business justification for any any-any or wide-open inbound rule before next quarter's review.
-
Verify VLAN segmentation boundaries
From a host on the user VLAN, attempt to reach the server, IoT, and (if applicable) PCI VLANs on prohibited ports. Flat networks expand PCI and HIPAA scope; the segmentation test is what proves the boundary is real, not just configured.
-
Patch firewall and switch firmware
-
Audit VPN access list against active employees
-
Confirm legacy authentication is blocked
Verify conditional access policies block legacy auth protocols (IMAP, POP, MAPI, EWS basic, SMTP AUTH). Run the M365 sign-in log filter for legacy clients over the last 30 days; any non-zero count is a finding worth investigating before close.
System and Software Updates
-
Review pending CVEs in the vuln scanner
Pull the latest Tenable / Qualys / Rapid7 scan and triage anything CVSS 7.0+ with known exploitation. Critical and KEV-listed CVEs go on the emergency change track; the rest land in the next standard maintenance window.
-
Deploy patches to the test ring
Push the patch bundle from WSUS / Intune / Automox to the test ring (typically 5-10% of endpoints, including one of every server role). Wait 48 hours for telemetry before promoting. The Friday-afternoon-direct-to-prod approach is how 800 users can't log in Monday morning.
-
Confirm test ring stability after 48 hoursCollects list
-
Pause rollout and file a patch exception
Halt promotion to production. Roll back the affected patches in the test ring, open a vendor case (KB number, repro steps, event log excerpts), and document the exception in the change record. Schedule a re-test once the vendor publishes a fix.
-
Promote patches to production
-
Document deviations and rollback plan
Backup and Recovery
-
Verify nightly backup job success
Pull the last 30 days of Veeam / Datto / Rubrik job reports. Investigate any consecutive failures, not just last-night green. A green dashboard with a silently-failing offsite replication leg is the classic ransomware-day surprise.
Collects file -
Confirm the immutable offsite copy is current
Verify the 3-2-1 picture: object-locked S3 bucket / immutable Datto cloud / write-once tape — whichever you run. The immutable copy is the only one that survives a backup-aware ransomware attack that pivots from the production network into the backup repository.
-
Run a restore drill to an isolated VM
Restore one production server's most recent backup into an isolated test network. Boot it, confirm services start, and time the recovery against your stated RTO. A backup that hasn't been restored is a hypothesis, not a backup.
Collects list -
Open an incident for restore failure
File a P2 incident in the PSA / ITSM. Capture the failure mode (corrupt media, missing credential, key-management gap, vendor format change) and assign an owner. Re-run the drill within 14 days; until then the documented RTO is unverified.
-
Update the DR runbook with measured RPO/RTO
Logging, Monitoring, and Sign-Off
-
Confirm SIEM ingestion from critical hosts
Run the Sentinel / Splunk / QRadar source-coverage report. Every domain controller, file server, EDR console, firewall, and identity provider should be reporting within the last 24 hours. A silent source is usually a forwarder that died after the last patch.
-
Tune alert thresholds to reduce noise
Review the top 10 noisiest alert rules from the last 30 days. Suppress, retune, or retire — alert fatigue is what causes the on-call to mute the channel and miss the real one. Keep a tuning log so changes are auditable.
-
Review IOC hits from the last quarter
-
Verify log retention meets policy
SOC 2 typically expects 12 months of security-relevant logs available for review; PCI DSS requires 1 year with 90 days immediately available. Confirm the SIEM hot/warm/cold tiers match what your auditor was told last cycle.
-
Sign off on the quarterly security reviewCollects list Collects paragraph Collects signature
Use this template
Copy it to your account, customize the steps, and run it with your team in minutes.
Browse hundreds of free templates across every team and industry.
Back to template libraryRun Server Security Checklist with your team
Customize the steps, assign roles, set a schedule, and keep a complete record for every run.