Release Management Checklist
Change Planning and RFC
Capture the RFC summary in ServiceNow / Jira Service Management / ConnectWise: what is changing, the business reason, and which application or infrastructure components are touched. Vague RFCs ("upgrade middleware") get bounced at CAB — name the version, the host, and the customer-visible behavior.
Pull the CMDB record and trace upstream / downstream dependencies — load balancers, scheduled jobs, integrations, monitoring agents. Common gotcha: a "single app" upgrade silently breaks the nightly batch job that pulls from its API.
Document the exact rollback procedure: snapshot revert commands, package downgrade syntax, config restore steps, and the named decision point at which the engineer aborts and rolls back. "We can roll back" is not a plan — the runbook is the plan.
Standard = pre-approved, repeatable, low-risk (e.g., routine WSUS patch ring). Normal = requires CAB review. Emergency = bypasses normal CAB cadence with expedited approval. Misclassifying a normal change as standard is the most common audit finding in SOX ITGC reviews.
Check the change calendar for blackout periods (month-end close, retail freeze, fiscal cutover). Pick a window that gives the on-call team daylight to roll back if smoke tests fail — Friday 5pm is the runbook-author's enemy.
Pre-Deployment Validation
Use the staging or pilot OU / VLAN / cluster that mirrors production topology. Test rings that diverge from prod (different OS build, different agents installed) provide false confidence — patch Tuesday horror stories almost always trace to a staging environment that wasn't really staging.
Hit the named user-facing checks: SSO login, primary report runs, scheduled job fires, monitoring agent reports in. The smoke-test list lives in the runbook — if it's not written down, it's not a test, it's vibes.
Confirm Veeam / Datto / Rubrik shows a successful job within the past 24 hours and that the restore point is mountable — "green dashboard" is not the same as "restorable." If the change touches a database, take a fresh backup before the window starts; do not rely on last night's job.
Walk the rollback runbook step by step on the staging system after the change is applied there. The first time you discover the rollback depends on a credential that rotated should not be at 2am on production.
CAB Review and Communication
Most CABs require RFCs in the queue 48-72 hours before the meeting. Late-submitted RFCs get deferred to the next cycle by default, which usually means slipping the deployment window by a week.
Walk the board through scope, blast radius, rollback plan, and test evidence. Be ready for the question "who owns the rollback decision during the window?" — name the engineer and their backup.
Notify via the standing channel — status page, Teams / Slack #announcements, email to affected DLs. Include start time, expected duration, services impacted, and where to file tickets if something is broken after the window closes.
Page the primary, secondary, and the named escalation owner via PagerDuty / Opsgenie before the window opens. For MSP work, confirm the client's after-hours contact is reachable in case decisions need their sign-off mid-window.
Deployment Execution
Take VM-level snapshots in vCenter / Hyper-V / Proxmox immediately before the deployment runbook starts. Note: snapshots are not backups — they expire on day-counters and they balloon storage. Tag the snapshot with the change number and a 7-day expiry.
Follow the runbook exactly — do not improvise during the window. If a step fails or surfaces unexpected behavior, stop and call the change owner before continuing. Off-script execution is the most common cause of post-mortem findings.
Execute the same smoke-test list used on staging. Verify monitoring (PRTG, Datadog, LogicMonitor) shows agents reporting and no new alerts. Customer-facing endpoints should be checked from outside the corporate network.
Follow the rollback runbook attached to the RFC. Restore from snapshot or run the documented downgrade steps, re-run smoke tests, and re-open the change as failed. Notify users on the same channel where the maintenance notice was posted.
Post-Release Closure
Watch SIEM, EDR, and APM dashboards for 48-72 hours post-cutover. Track any new tickets tagged to the change in the PSA / ITSM queue. Spike in helpdesk volume on day 2 is often the first signal of a partial regression.
Walk through what went per plan, what deviated, and which runbook steps need correction. Capture action items with owners and due dates — a PIR with no follow-through is theater.
Push CI updates into ServiceNow / IT Glue / Hudu — version, host, dependencies, owner. Stale CMDB entries are how the next change owner trips on the same dependency you just discovered.
Set final status, attach evidence (smoke test output, monitoring screenshots, communication artifacts), and link the PIR notes. SOX, SOC 2, and HIPAA audits sample closed change records — a half-closed ticket is an audit finding.
Use this template in Manifestly
- Cloud Migration Checklist
- Cloud Security Checklist
- User Access Review Checklist
- Data Recovery Checklist
- Containerization Rollout Checklist
- Database Backup Checklist
- Password Management Checklist
- Backup and Restore Checklist
- Network Upgrade Checklist
- Server Backup Checklist
- Business Continuity Plan Checklist
- Problem Management Checklist
- Server Decommissioning Checklist
- Cloud Monitoring Checklist
- Hardware Inventory Checklist
- IT Regulatory Compliance Review
- Server Maintenance Checklist
- Rollback Plan Checklist
- Customer Support Ticket Workflow
- Software Upgrade Checklist
- Quarterly Compliance Reporting Checklist
- Patch Management Checklist
- Hardware Maintenance Checklist
- Server Security Checklist
- IT Emergency Response Checklist
- Incident Management Checklist
- Disaster Recovery Plan Checklist
- User Role Management Checklist
- Software Installation Checklist
- Compliance Audit Checklist
- Access Control Checklist
- Cloud Cost Management Checklist
- IT Staff Performance Review
- Change Management Checklist
- Firewall Configuration Checklist
- Security Audit Checklist
- Quarterly Network Security Review
- Database Migration Checklist
- Employee Onboarding Checklist
- Capacity Planning Checklist
- IT Budgeting Checklist
- Network Monitoring Checklist
- Cloud Deployment Checklist
- Database Installation Checklist
- IT Service Request Checklist
- Database Security Checklist
- System Monitoring Checklist
- Hardware Troubleshooting Checklist
- IT Strategy Checklist
- Patch Deployment Checklist
- Hardware Upgrade Checklist
- Performance Tuning Checklist
- Application Performance Monitoring Checklist
- Employee Training Checklist
- User Onboarding Checklist
- IT Vendor Management Checklist
- Server Build and Hardening Checklist
- IT Policy Review Checklist
- Help Desk Ticket Handling Checklist
- Infrastructure as Code Checklist
- Hardware Disposal Checklist
- IT Resource Allocation Checklist
- Incident Response Checklist
- Network Troubleshooting Checklist
- User Offboarding Checklist
- Change Management Checklist
- Software Installation Checklist
- Software Update Checklist
- Network Upgrade Checklist
- Server Decommissioning Checklist
- Rollback Plan Checklist
- Software Upgrade Checklist
- Software Installation Checklist
- Change Management Checklist
- Cloud Deployment Checklist
- Database Installation Checklist
- Patch Deployment Checklist
- Hardware Upgrade Checklist
- Performance Tuning Checklist
- Release Checklist
- User Acceptance Testing (UAT) Checklist
- Deployment Plan Checklist
- Release Planning Checklist
- Software Update Checklist
- Post-Deployment Testing Checklist
- Feature Development Checklist
- Regression Testing Checklist
- Deployment Checklist
- Release Notes Checklist
- User Acceptance Testing Checklist
Ready to take control of your recurring tasks?
Start Free 14-Day TrialUse Slack? Sign up with one click
