🛡️ Security Incident Response Runbook¶ Runbook

Last Updated: 2026-05-05 | Version: 1.0 Audience: Security operations, on-call SRE, compliance officers, identity administrators Purpose: Detect unauthorized access, investigate audit logs, rotate compromised credentials, triage Purview alerts, and contain security breaches in Microsoft Fabric.

Trigger Conditions¶

Use this runbook when any of the following conditions are observed:

#	Condition	Detection Source
1	Unauthorized user accessed a workspace or Lakehouse	Entra ID sign-in logs; Fabric audit log `GetWorkspace` from unknown principal
2	Anomalous sign-in (impossible travel, unfamiliar location, risky sign-in)	Entra ID Protection → Risky sign-ins
3	Bulk data export or download from OneLake	Fabric audit log `ExportData`, `DownloadReport` in high volume
4	Service principal or Workspace Identity secret exposed	Secret scanning alert (GitHub Advanced Security, Defender for Cloud)
5	Purview sensitivity label violation — highly sensitive data accessed by unauthorized role	Purview alert → Data Loss Prevention policy match
6	RBAC change on production workspace by non-admin	Fabric audit log `AddWorkspaceMember`, `UpdateWorkspaceAccess`
7	SQL Audit Log shows `ALTER`, `DROP`, or `GRANT` from unexpected principal	SQL audit logs in Lakehouse
8	Data exfiltration pattern — large query results sent to external email or storage	Microsoft Defender for Cloud Apps alert

Severity Classification¶

Severity	Condition	Example	Response SLA
SEV1	Confirmed data breach; PII / PHI / PCI data accessed by unauthorized party; credential compromise confirmed	SSN data exported from Gold Lakehouse by unknown service principal; admin credential found in public GitHub repo	5 min page
SEV2	Suspicious access detected; no confirmed exfiltration yet; RBAC change on prod workspace	Impossible travel sign-in to Fabric admin portal; new member added to prod workspace by non-admin	15 min page
SEV3	Purview DLP policy informational match; minor RBAC anomaly in non-prod	Sensitivity label mismatch on dev dataset; service principal granted Viewer on staging workspace	2 hr ack
SEV4	Failed authentication attempts; informational alerts	Multiple failed sign-in attempts from known IP range; password spray with no success	24 hr ack

Decision Flowchart¶

flowchart TD
    A([Security Alert Received]) --> B{Alert source?}
    B -->|"Entra ID<br/>risky sign-in"| C{Sign-in<br/>succeeded?}
    C -->|No| D[SEV4 — Log failed attempts,<br/>monitor for escalation]
    C -->|Yes| E{User account<br/>or service principal?}
    E -->|User| F[Revoke sessions<br/>→ Step 5]
    E -->|Service principal| G[Rotate secret<br/>→ Step 7]

    B -->|"Fabric audit log<br/>bulk export"| H{Volume > baseline<br/>or unknown principal?}
    H -->|No| I[SEV3 — Review and close]
    H -->|Yes| J[SEV1 — Contain immediately<br/>→ Step 5]

    B -->|Purview DLP alert| K{Sensitivity level?}
    K -->|"Highly Confidential<br/>or PII/PHI"| L[SEV1/SEV2 — Investigate<br/>→ Step 9]
    K -->|General or Internal| I

    B -->|RBAC change| M{Production<br/>workspace?}
    M -->|Yes| N[SEV2 — Revert change,<br/>investigate → Step 8]
    M -->|No| I

    B -->|Secret exposure| O[SEV1 — Rotate immediately<br/>→ Step 7]

Step-by-Step Procedure¶

Phase 1 — Detect and Contain (0–15 min)¶

Step 1. Receive the security alert and identify the alert source (Entra ID, Fabric audit log, Purview, Defender for Cloud, secret scanning).

Step 2. Classify severity using the table above. For SEV1 or SEV2, immediately open a secure incident channel (do not discuss details in general channels).

Step 3. Determine the compromised principal (user account, service principal, or Workspace Identity).

Step 4. Assess the blast radius: - Which workspaces did the principal have access to? - What data sensitivity levels are in those workspaces? - Was any data exported or modified?

Step 5. Contain the threat — revoke access immediately:

For user accounts:

# Revoke all active sessions
Revoke-MgUserSignInSession -UserId "<user-object-id>"

# Disable the account (if confirmed compromise)
Update-MgUser -UserId "<user-object-id>" -AccountEnabled:$false

For service principals:

# Remove service principal credentials
Remove-MgServicePrincipalPassword -ServicePrincipalId "<sp-id>" -KeyId "<key-id>"

For Fabric workspace access:

# Remove user from workspace via Fabric REST API
Invoke-RestMethod -Method Delete `
  -Uri "https://api.fabric.microsoft.com/v1/workspaces/$workspaceId/roleAssignments/$principalId" `
  -Headers @{ Authorization = "Bearer $token" }

Phase 2 — Investigate (15–60 min)¶

Step 6. Query the Fabric unified audit log to determine what the compromised principal accessed:

# Search Office 365 audit log for Fabric activities
Search-UnifiedAuditLog -StartDate (Get-Date).AddDays(-7) -EndDate (Get-Date) `
  -UserIds "<compromised-principal>" `
  -RecordType PowerBIAudit `
  -ResultSize 5000 | Export-Csv -Path "audit_export.csv"

Step 7. If a service principal secret or Workspace Identity was exposed, rotate credentials:

Generate a new secret in Entra ID → App registrations → Certificates & secrets.
Update the secret in all Fabric connections and pipelines that reference it.
Delete the old secret.
Verify all dependent workloads still function.

# Generate new secret
$newSecret = Add-MgApplicationPassword -ApplicationId "<app-id>" `
  -PasswordCredential @{ displayName = "rotated-$(Get-Date -Format yyyyMMdd)" }

# Remove old secret
Remove-MgApplicationPassword -ApplicationId "<app-id>" -KeyId "<old-key-id>"

Step 8. If an unauthorized RBAC change was detected, revert it:

Query the audit log for the original RBAC state.
Remove the unauthorized role assignment.
Investigate who made the change and whether their account is compromised.

Step 9. For Purview DLP alert triage:

Open the alert in the Microsoft Purview compliance portal.
Review the matched policy and the data that triggered the alert.
Determine whether the access was legitimate (false positive) or unauthorized.
If unauthorized, identify all copies of the sensitive data and revoke access.
If a false positive, tune the DLP policy to reduce noise.

Phase 3 — Remediate (1–4 hr)¶

Step 10. Based on the investigation, implement permanent fixes:

Compromised user: Force password reset, require MFA re-registration, review Conditional Access policies.
Compromised service principal: Rotate to certificate-based authentication, restrict IP ranges.
RBAC gap: Implement Privileged Identity Management (PIM) for workspace admin roles.
DLP gap: Tighten sensitivity labels, enable auto-labeling, restrict external sharing.

Step 11. Verify containment by confirming the compromised principal can no longer authenticate:

# Verify user is disabled and sessions revoked
Get-MgUser -UserId "<user-object-id>" | Select-Object AccountEnabled, SignInSessionsValidFromDateTime

Phase 4 — Close¶

Step 12. Notify affected stakeholders per the communication plan (see Escalation Path).

Step 13. Preserve forensic evidence: - Export audit logs for the incident timeframe. - Screenshot Purview alert details. - Document the timeline of compromise and containment.

Step 14. Complete the Post-Incident Review Checklist and schedule a blameless postmortem within 48 hours.

Audit Log Investigation¶

Key Fabric Audit Events¶

Event	Description	Investigation Value
`GetWorkspace`	User opened a workspace	Access scope
`ExportReport`	Report exported to file	Data exfiltration risk
`GetDatasource`	Connection string accessed	Credential exposure risk
`UpdateWorkspaceAccess`	RBAC changed	Unauthorized privilege escalation
`CreateDataflow`	New dataflow created	Potential data movement
`ExecuteQuery`	SQL/KQL query executed	Data access scope
`DeleteWorkspace`	Workspace deleted	Destructive action

Useful KQL Queries¶

// All actions by a specific user in the last 24 hours
FabricAuditLogs
| where Timestamp > ago(24h)
| where UserId == "<principal>"
| summarize count() by Operation
| order by count_ desc

// Bulk data exports in the last 7 days
FabricAuditLogs
| where Timestamp > ago(7d)
| where Operation in ("ExportReport", "ExportData", "DownloadReport")
| summarize ExportCount = count() by UserId, bin(Timestamp, 1h)
| where ExportCount > 10
| order by ExportCount desc

// RBAC changes on production workspaces
FabricAuditLogs
| where Timestamp > ago(30d)
| where Operation in ("AddWorkspaceMember", "UpdateWorkspaceAccess", "RemoveWorkspaceMember")
| where WorkspaceName contains "prod"
| project Timestamp, UserId, Operation, WorkspaceName, TargetPrincipal

Credential Rotation Procedures¶

Credential Type	Rotation Steps	Downstream Impact
User password	Entra ID → Users → Reset password → Require change at next sign-in	User must re-authenticate everywhere
Service principal secret	Entra ID → App registrations → New secret → Update connections → Delete old	All pipelines using old secret will fail until updated
Workspace Identity	Regenerate managed identity credential → Update RBAC assignments	Connections using WI may need re-authorization
On-premises gateway key	Gateway app → Regenerate key → Re-register gateway	Gateway goes offline during re-registration
Connection string / API key	Rotate at source → Update in Fabric connection manager	Dependent refreshes fail until updated

Purview Alert Triage¶

flowchart TD
    PA([Purview Alert Received]) --> PB{Alert type?}
    PB -->|DLP policy match| PC{Data sensitivity?}
    PC -->|Highly Confidential / PII / PHI| PD[SEV1/SEV2 — Investigate access<br/>and revoke if unauthorized]
    PC -->|General / Internal| PE[SEV3 — Review, tune policy<br/>if false positive]

    PB -->|"Sensitivity label<br/>mismatch"| PF{Production<br/>dataset?}
    PF -->|Yes| PG[Apply correct label,<br/>review who removed it]
    PF -->|No| PH[Apply correct label,<br/>log for awareness]

    PB -->|"Auto-labeling<br/>conflict"| PI[Review conflicting rules,<br/>resolve priority]

Escalation Path¶

Time Elapsed	Action	Contact
0 min	Security on-call begins triage; contain compromised principal	Security on-call (PagerDuty)
5 min	If SEV1 confirmed, page CISO and VP Engineering	CISO + VP Engineering
15 min	If PII/PHI breach confirmed, notify Legal and Compliance	Legal Counsel + Compliance Officer
30 min	If data exfiltration confirmed, engage Microsoft DART (Detection and Response Team)	Microsoft Unified Support (Sev A)
1 hr	If regulatory notification required (HIPAA, GDPR, state breach laws), begin notification process	Legal + Privacy Officer
4 hr	Executive briefing if breach is customer-impacting	CTO / CEO

Regulatory notification deadlines: HIPAA requires notification within 60 days. GDPR requires notification within 72 hours. State breach notification laws vary. Engage Legal immediately for SEV1 breaches.

Post-Incident Review Checklist¶

Document	Description
Identity & RBAC Patterns	RBAC design and Entra ID integration
OneLake Security	OneLake access control and encryption
Network Security	Private endpoints and network isolation
SQL Audit Logs Compliance	SQL-level audit logging
Monitoring & Observability	Alert setup and dashboards
Auth Failure Playbook	Authentication failure diagnosis
Incident Response Template	Master incident response structure

⬆️ Back to Top | 📋 Runbook Index | 🏠 Home

← PreviousData Quality IncidentRead more →Next →Disaster Recovery ExecutionRead more →