Home > Docs > Reference Architectures > Small-Medium Enterprise

🏢 Small-Medium Enterprise Reference Architecture¶

Single Capacity, Medallion Lakehouse, Direct Lake Power BI

Last Updated: 2026-05-05 | Version: 1.0.0

📑 Table of Contents¶

🎯 Architecture Overview
🏗️ Architecture Diagram
📦 Component Table
📏 Capacity Sizing Guidance
🔒 Network Architecture
💰 Cost Estimation Framework
🚀 Deploy This Architecture
⚖️ Tradeoffs and Limitations
📚 References

🎯 Architecture Overview¶

This architecture is designed for small-to-medium teams (5–20 data practitioners) running their first production Fabric deployment. A single Fabric capacity hosts 2–3 workspaces following a Dev/Test/Prod promotion model. Data flows through a medallion lakehouse (Bronze → Silver → Gold) within a single OneLake instance, and Power BI reports connect via Direct Lake for sub-second analytics without data duplication. Governance is handled at the workspace level using Fabric's built-in RBAC, sensitivity labels, and optional Purview integration.

🏗️ Architecture Diagram¶

graph TB
    subgraph Sources["Data Sources"]
        S1[Operational DBs]
        S2[Flat Files / APIs]
        S3[SaaS Applications]
    end

    subgraph FabricCapacity["Fabric Capacity (F64)"]
        subgraph DevWorkspace["Dev Workspace"]
            DEV_LH[Lakehouse<br/>Dev]
            DEV_NB[Notebooks<br/>Dev]
        end

        subgraph ProdWorkspace["Prod Workspace"]
            subgraph OneLake["OneLake"]
                BRONZE[Bronze Lakehouse<br/>Raw Ingestion]
                SILVER[Silver Lakehouse<br/>Cleansed & Validated]
                GOLD[Gold Lakehouse<br/>Business KPIs]
            end

            PIPE[Data Pipelines<br/>Orchestration]
            NB[Spark Notebooks<br/>Transformations]
            SQL[SQL Analytics Endpoint<br/>Ad-hoc Queries]
            SEM[Semantic Model<br/>Direct Lake]
        end

        subgraph BIWorkspace["BI Workspace"]
            PBI[Power BI Reports]
            DASH[Dashboards]
        end
    end

    subgraph Consumers["Report Consumers"]
        U1[Business Users]
        U2[Executives]
        U3[Analysts]
    end

    S1 --> PIPE
    S2 --> PIPE
    S3 --> PIPE
    PIPE --> NB
    NB --> BRONZE
    BRONZE --> NB
    NB --> SILVER
    SILVER --> NB
    NB --> GOLD
    GOLD --> SQL
    GOLD --> SEM
    SEM --> PBI
    SEM --> DASH
    PBI --> U1
    PBI --> U2
    DASH --> U3
    DEV_LH -.->|Deployment<br/>Pipeline| BRONZE

📦 Component Table¶

Component	Fabric Item	Purpose	Sizing Notes
Bronze Lakehouse	Lakehouse	Raw data ingestion, append-only Delta tables	Size grows with source volume; partition by date
Silver Lakehouse	Lakehouse	Cleansed, deduplicated, schema-enforced data	~60–80% of Bronze size after filtering
Gold Lakehouse	Lakehouse	Business aggregations, star schema, KPIs	~10–30% of Silver size; optimized for queries
Data Pipelines	Pipeline	Orchestrate ingestion and transformation jobs	Schedule based on freshness requirements
Spark Notebooks	Notebook	PySpark transformations between medallion layers	Concurrency limited by capacity CUs
SQL Analytics Endpoint	SQL Endpoint	Ad-hoc SQL queries against lakehouse tables	Auto-generated; no separate provisioning
Semantic Model	Semantic Model	Direct Lake model for Power BI consumption	One model per Gold lakehouse
Power BI Reports	Report	Interactive dashboards and paginated reports	Render CUs shared with compute workloads
Dev Lakehouse	Lakehouse	Development and testing environment	Use sample data; smaller scale
Deployment Pipeline	Deployment Pipeline	Promote items from Dev → Prod	Built-in Fabric deployment pipelines

📏 Capacity Sizing Guidance¶

Data Volume	Concurrent Users	Recommended SKU	Monthly Cost (est.)
< 500 GB	5–10	F16	~$1,050
500 GB – 2 TB	10–20	F32	~$2,100
2–5 TB	15–25	F64	~$4,200
5–10 TB	20–30	F128	~$8,400

Sizing considerations:

Spark workloads consume the most CUs — schedule heavy transforms during off-peak hours
Direct Lake is far more CU-efficient than Import or DirectQuery for BI
Pause capacity during non-business hours to reduce cost by 40–60%
Start with F32 and monitor utilization via Workspace Monitoring before scaling

🔒 Network Architecture¶

For small-medium deployments, a simplified network posture balances security with operational simplicity:

Layer	Approach	Notes
Authentication	Microsoft Entra ID (SSO)	All users authenticate via organizational Entra tenant
Authorization	Workspace roles (Admin/Member/Contributor/Viewer)	Map to Entra security groups
Data Access	Item-level permissions + Row-Level Security (RLS)	RLS defined in semantic model
Network	Public endpoint with Entra Conditional Access	No VNet required at this scale
Data at Rest	Microsoft-managed encryption (default)	Customer-managed keys optional
Sensitivity Labels	Microsoft Purview Information Protection labels	Apply to lakehouses and reports

When to add network isolation: If compliance requirements mandate private connectivity (HIPAA, FedRAMP), add private endpoints and a VNet data gateway. See Network Security.

💰 Cost Estimation Framework¶

Cost Component	Estimation Method	Typical Range
Fabric Capacity	SKU × hours running per month	$1,050–$8,400/mo
OneLake Storage	$0.023/GB/month (hot tier)	$12–$115/mo for 0.5–5 TB
Egress	Typically minimal for internal BI	< $50/mo
Purview (optional)	Free tier covers basic governance	$0 (basic)
Power BI Pro Licenses	$10/user/month for report consumers	$50–$200/mo
Total Estimate		$1,200–$9,000/mo

Cost optimization strategies:

Pause/Resume — Pause capacity during nights and weekends (saves 40–60%)
Reservations — 1-year commitment saves ~40% on capacity
Smoothing — Fabric's 24-hour CU smoothing lets you burst without immediate throttling
Direct Lake — Eliminates Import refresh CU cost entirely

🚀 Deploy This Architecture¶

Infrastructure as Code¶

Resource	Bicep Module	Description
Fabric Capacity	`infra/modules/fabric/fabric-capacity.bicep`	Deploy F16–F128 capacity
Storage Account	`infra/modules/storage/storage-account.bicep`	Landing zone for file ingestion
Log Analytics	`infra/modules/monitoring/log-analytics-workspace.bicep`	Monitoring and diagnostics
Alerts & Budgets	`infra/modules/monitoring/alerts-and-budgets.bicep`	Cost alerts and CU budget

Step-by-Step Tutorials¶

Step	Tutorial	What You'll Build
1	Environment Setup	Provision capacity, create workspaces
2	Bronze Layer	Ingest raw data into Bronze lakehouse
3	Silver Layer	Cleanse and validate data
4	Gold Layer	Build business KPIs and star schema
5	Direct Lake Power BI	Connect Power BI via Direct Lake
6	Data Pipelines	Orchestrate end-to-end data flow
7	Governance	Set up Purview and sensitivity labels
8	CI/CD	Implement deployment pipelines

Deployment Commands¶

# Deploy the capacity and supporting resources
az deployment sub create --location eastus2 \
  --template-file infra/main.bicep \
  --parameters infra/environments/dev/dev.bicepparam

# Validate before deploying
az deployment sub what-if --location eastus2 \
  --template-file infra/main.bicep \
  --parameters infra/environments/dev/dev.bicepparam

⚖️ Tradeoffs and Limitations¶

Tradeoff	Impact	Mitigation
Single capacity	All workloads share CUs; heavy Spark jobs can impact BI query performance	Schedule transforms off-peak; monitor CU utilization
No network isolation	Data traverses public endpoints (encrypted in transit)	Add private endpoints if compliance requires it
Limited blast radius	A misconfigured pipeline can affect the entire capacity	Use Dev workspace for testing; deploy via pipelines
Governance ceiling	Workspace-level RBAC may be insufficient for complex data domains	Upgrade to Large Enterprise Multi-Domain architecture
Single region	No built-in cross-region failover	Acceptable for non-critical workloads; see BCDR for DR options
Team scaling	Beyond ~25 users, contention for capacity and workspace sprawl become issues	Plan migration path to multi-capacity architecture

📚 References¶

Resource	Link
Microsoft Fabric Documentation	learn.microsoft.com/fabric
Capacity Planning Guide	Capacity Planning & Cost Optimization
Direct Lake Documentation	Direct Lake Feature Doc
Medallion Architecture	Medallion Deep Dive
Identity & RBAC	RBAC Patterns
Workspace Monitoring	Monitoring Feature Doc

Cost Component	Estimation Method	Typical Range
Fabric Capacity	SKU × hours running per month	\(1,050–\)8,400/mo
OneLake Storage	$0.023/GB/month (hot tier)	\(12–\)115/mo for 0.5–5 TB
Egress	Typically minimal for internal BI	< $50/mo
Purview (optional)	Free tier covers basic governance	$0 (basic)
Power BI Pro Licenses	$10/user/month for report consumers	\(50–\)200/mo
Total Estimate		\(1,200–\)9,000/mo