Home > Docs > Production Checklist

CSA-in-a-Box: Production Checklist¶

Note

Quick Summary: Comprehensive pre-production readiness checklist covering environment promotion, infrastructure hardening, security, data platform validation, observability, disaster recovery, governance, operational readiness, cost management, and compliance — with sign-off table for stakeholder approval.

Use this checklist when promoting CSA-in-a-Box from development to production. Each section includes actionable items with cross-references to detailed guides where available.

🌍 1. Environment Promotion¶

See also: ENVIRONMENT_PROTECTION.md

Separate Azure subscriptions for dev, staging, and production
Environment-specific parameter files created (params.dev.json, params.prod.json)
GitHub Environment protection rules configured (required reviewers, wait timers)
Branch protection on main — require PR reviews, status checks, signed commits
Deployment pipeline tested end-to-end in staging before production
Rollback procedure validated in staging (see ROLLBACK.md)

🏗️ 2. Infrastructure Hardening¶

🔒 3. Security¶

See also: Secret rotation functions in csa_platform/functions/secretRotation/

📊 4. Data Platform¶

📈 5. Observability¶

🔄 6. Disaster Recovery¶

See also: DR.md, ROLLBACK.md

RTO and RPO targets defined and documented per tier:
- Tier 1 (Gold analytics): RPO 1h, RTO 4h
- Tier 2 (Silver/Bronze): RPO 24h, RTO 8h
- Tier 3 (Raw/Seed): RPO 7d, RTO 24h
Storage: GRS or RA-GRS enabled for production ADLS accounts
Cosmos DB: multi-region writes configured (if streaming is Tier 1)
Key Vault: soft-delete + purge protection enabled
Databricks workspace: cross-region cluster policies configured
Delta Lake: time travel retention set to 30 days minimum
Backup validation: restore procedure tested quarterly
Infrastructure-as-Code: all resources rebuildable from Bicep + parameter files
Deployment tags created on every production deployment (see deploy.yml)

📋 7. Governance¶

Purview catalog bootstrapped (collections, glossary, scan sources)
- Run: python scripts/purview/bootstrap_catalog.py
Business glossary terms defined for all data products
Data classification rules applied (PII, Financial, Confidential)
ADF → Purview lineage integration enabled (purviewId in ADF resource)
Data product contracts reviewed and approved by domain owners
Cross-domain data sharing agreements documented
Data retention policies defined per domain (Bronze: 2yr, Silver: 5yr, Gold: 7yr)
Access request workflow defined (who can request, who approves)

⚙️ 8. Operational Readiness¶

See also: TROUBLESHOOTING.md

💰 9. Cost Management¶

🏛️ 10. Compliance¶

✍️ Sign-Off¶

Role	Name	Date	Signature
Data Engineering Lead
Security / InfoSec
Platform / SRE
Domain Owner (Shared)
Domain Owner (Finance)
Domain Owner (Inventory)

Note

This checklist complements the Quick Start Guide for development and the DR / Rollback guides for incident response.

See also:

← Previous: Developer Pathways
→ Next: AI Copilot
⌂ Index: Documentation home