Home > Docs > Best Practices > Lakehouse vs Warehouse vs SQL Database Decision Guide

🏗️ Lakehouse vs Warehouse vs SQL Database — Decision Guide¶

Choose the Right Fabric Analytics Store for Every Workload

Last Updated: 2026-04-21 | Version: 1.0.0

📑 Table of Contents¶

🎯 Overview
📊 Feature Comparison Matrix
🔍 Deep Dive: Each Store
🌳 Decision Trees
⚡ Performance Characteristics
🔐 Security Model Comparison
🔄 Hybrid Patterns
🎰 Casino Implementation
🏛️ Federal Agency Implementation
🔀 Migration Paths
⚠️ Limitations
📚 References
🔗 Related Documents

🎯 Overview¶

Microsoft Fabric offers three distinct data stores — Lakehouse, Warehouse, and SQL Database — each optimized for different workload patterns. Choosing the wrong store leads to performance issues, governance gaps, or unnecessary complexity. This guide provides a structured decision framework.

The Three Stores at a Glance¶

flowchart TB
    subgraph Lakehouse["🏠 Lakehouse"]
        L1["Delta Lake / Parquet on OneLake"]
        L2["Spark + SQL analytics endpoint"]
        L3["Best for: Data engineering,<br/>medallion architecture, ML"]
    end

    subgraph Warehouse["🏢 Warehouse"]
        W1["Managed columnar storage"]
        W2["Full T-SQL DML/DDL"]
        W3["Best for: BI serving,<br/>star schema, ad-hoc SQL"]
    end

    subgraph SQLDB["🗄️ SQL Database"]
        S1["Transactional rowstore + columnstore"]
        S2["Full SQL Server engine"]
        S3["Best for: Operational apps,<br/>OLTP, microservices"]
    end

    OneLake["📦 OneLake<br/>Unified Storage"] --- Lakehouse
    OneLake --- Warehouse
    OneLake --- SQLDB

    style Lakehouse fill:#27AE60,stroke:#1E8449,color:#fff
    style Warehouse fill:#2471A3,stroke:#1A5276,color:#fff
    style SQLDB fill:#8E44AD,stroke:#6C3483,color:#fff
    style OneLake fill:#F39C12,stroke:#E67E22,color:#fff

📊 Feature Comparison Matrix¶

Core Capabilities¶

Feature	Lakehouse	Warehouse	SQL Database
Storage Format	Delta Lake (Parquet)	Managed columnstore	Rowstore + optional columnstore
Write Support	Spark, Pipelines, Dataflows	T-SQL INSERT/UPDATE/DELETE	Full T-SQL DML
Read Interface	Spark + SQL analytics endpoint	T-SQL	T-SQL
Schema Enforcement	Schema-on-read + optional enforcement	Schema-on-write	Schema-on-write
Transactions (ACID)	Delta ACID (per table)	T-SQL transactions (multi-table)	Full SQL Server transactions
Concurrency	High read, moderate write	High read, moderate write	High read + write
Direct Lake Support	✅ Native	✅ Native	❌ Import/DQ only
Stored Procedures	❌	✅	✅
Triggers	❌	❌	✅
Foreign Keys	❌	✅ (informational)	✅ (enforced)
Indexes	❌ (file-level pruning)	Auto columnstore	Full index support
Change Data Capture	Delta Change Data Feed	❌	✅ Native CDC
Mirroring Target	✅	❌	✅
Cross-DB Queries	✅ (read-only)	✅ (read/write)	✅ (read/write)

Query Language Surface¶

T-SQL Feature	Lakehouse SQL Endpoint	Warehouse	SQL Database
SELECT	✅	✅	✅
INSERT/UPDATE/DELETE	❌	✅	✅
CREATE TABLE	❌	✅	✅
CREATE VIEW	❌	✅	✅
CREATE PROCEDURE	❌	✅	✅
CREATE FUNCTION	❌	✅	✅
Temp Tables	✅ (limited)	✅	✅
CTEs	✅	✅	✅
Window Functions	✅	✅	✅
JSON Functions	✅	✅	✅
MERGE	❌	✅	✅
Constraints	❌	Informational only	✅ Enforced

Data Engineering Features¶

Feature	Lakehouse	Warehouse	SQL Database
Spark Access	✅ Native	❌	❌
Python/PySpark	✅	❌	❌
Delta Time Travel	✅	❌	❌
OPTIMIZE/VACUUM	✅	Auto-managed	Auto-managed
V-Order	✅	N/A	N/A
Shortcut Support	✅	❌	❌
Schema Evolution	✅ (merge schema)	ALTER TABLE	ALTER TABLE
File Format Access	✅ (CSV, JSON, Parquet)	❌	❌

🔍 Deep Dive: Each Store¶

🏠 Lakehouse¶

When to use: Data engineering, medallion architecture, ML/AI workloads, multi-format ingestion, data science exploration.

Architecture:

flowchart TB
    subgraph Lakehouse["🏠 Lakehouse"]
        subgraph Tables["Managed Tables (Delta)"]
            B["Bronze Tables"]
            S["Silver Tables"]
            G["Gold Tables"]
        end
        subgraph Files["Files Section"]
            RAW["Raw Files<br/>CSV, JSON, Parquet"]
        end
        subgraph Endpoints["Access Points"]
            SPK["Spark Engine<br/>Read + Write"]
            SQL["SQL Analytics Endpoint<br/>Read-Only T-SQL"]
        end
    end

    SPK --> Tables
    SPK --> Files
    SQL --> Tables

Key characteristics: - Open Delta Lake format — no vendor lock-in - Dual access: Spark (read/write) + SQL (read-only) - Shortcuts bring external data without copying - V-Order optimizes for both Spark and Direct Lake - File section supports arbitrary formats

🏢 Warehouse¶

When to use: BI serving layer, star/snowflake schema, ad-hoc SQL analytics, stored procedures, T-SQL-heavy workloads.

Architecture:

flowchart TB
    subgraph Warehouse["🏢 Warehouse"]
        subgraph Schema["Schemas"]
            DBO["dbo (default)"]
            AN["analytics"]
            STG["staging"]
        end
        subgraph Objects["Database Objects"]
            TBL["Tables<br/>(auto columnstore)"]
            VW["Views"]
            SP["Stored Procedures"]
            FN["Functions"]
        end
        subgraph Access["Access Points"]
            TSQL["T-SQL Engine<br/>Full DML/DDL"]
            DL["Direct Lake<br/>BI Access"]
        end
    end

    TSQL --> Schema --> Objects
    DL --> TBL

Key characteristics: - Familiar SQL Server experience (T-SQL, schemas, procedures) - Auto-managed columnstore — no index tuning needed - Full DML support (INSERT, UPDATE, DELETE, MERGE) - Custom schemas for logical organization - Workload isolation from Spark jobs

🗄️ SQL Database¶

When to use: Operational/transactional workloads, microservice backends, OLTP, applications requiring enforced constraints, CDC to Fabric.

Architecture:

flowchart TB
    subgraph SQLDB["🗄️ SQL Database"]
        subgraph Engine["SQL Server Engine"]
            TXN["Transaction Manager<br/>Full ACID"]
            QP["Query Processor"]
            IDX["Index Engine<br/>B-tree + Columnstore"]
        end
        subgraph Objects["Full SQL Server Objects"]
            TBL["Tables + Constraints"]
            TRG["Triggers"]
            SP["Stored Procedures"]
            CDC["Change Data Capture"]
        end
        subgraph Replication["OneLake Integration"]
            MIR["Auto-mirroring to<br/>OneLake Delta"]
        end
    end

    TBL --> MIR
    CDC --> MIR

Key characteristics: - Full SQL Server engine with enforced referential integrity - Triggers, constraints, foreign keys — all enforced - Native CDC for streaming changes to analytics - Auto-mirrors to OneLake for Lakehouse/Warehouse consumption - Best for sub-second transactional workloads

🌳 Decision Trees¶

Primary Decision Tree¶

flowchart TD
    Start([What's the primary workload?]) --> Q1{Data engineering<br/>or data science?}
    Q1 -->|Yes| LH["🏠 Lakehouse"]
    Q1 -->|No| Q2{Operational app<br/>or microservice?}
    Q2 -->|Yes| Q3{Need enforced<br/>constraints/triggers?}
    Q3 -->|Yes| SQL["🗄️ SQL Database"]
    Q3 -->|No| Q4{High write<br/>concurrency?}
    Q4 -->|Yes| SQL
    Q4 -->|No| WH["🏢 Warehouse"]
    Q2 -->|No| Q5{BI serving<br/>with T-SQL?}
    Q5 -->|Yes| WH
    Q5 -->|No| Q6{Mixed Spark<br/>+ SQL?}
    Q6 -->|Yes| LH
    Q6 -->|No| WH

    style LH fill:#27AE60,stroke:#1E8449,color:#fff
    style WH fill:#2471A3,stroke:#1A5276,color:#fff
    style SQL fill:#8E44AD,stroke:#6C3483,color:#fff

"Which Store for This Table?" Decision Tree¶

flowchart TD
    Start([Where should this table live?]) --> Q1{Does an app<br/>write to it<br/>transactionally?}
    Q1 -->|Yes| SQL["🗄️ SQL Database<br/>+ mirror to OneLake"]
    Q1 -->|No| Q2{Is it a raw<br/>or staging table?}
    Q2 -->|Yes| LH["🏠 Lakehouse<br/>(Bronze/Silver)"]
    Q2 -->|No| Q3{Is it a dimension<br/>or fact for BI?}
    Q3 -->|Yes| Q4{Need T-SQL<br/>stored procs<br/>for transforms?}
    Q4 -->|Yes| WH["🏢 Warehouse<br/>(Gold/Star Schema)"]
    Q4 -->|No| Q5{Need Spark<br/>for transforms?}
    Q5 -->|Yes| LH2["🏠 Lakehouse<br/>(Gold layer)"]
    Q5 -->|No| WH
    Q3 -->|No| Q6{ML feature<br/>store?}
    Q6 -->|Yes| LH3["🏠 Lakehouse"]
    Q6 -->|No| WH

    style LH fill:#27AE60,stroke:#1E8449,color:#fff
    style LH2 fill:#27AE60,stroke:#1E8449,color:#fff
    style LH3 fill:#27AE60,stroke:#1E8449,color:#fff
    style WH fill:#2471A3,stroke:#1A5276,color:#fff
    style SQL fill:#8E44AD,stroke:#6C3483,color:#fff

⚡ Performance Characteristics¶

Workload Profile Comparison¶

Workload	Lakehouse	Warehouse	SQL Database
Batch ETL (TB-scale)	⭐⭐⭐⭐⭐	⭐⭐⭐	⭐⭐
Ad-hoc SQL queries	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐
Point lookups (by PK)	⭐⭐	⭐⭐⭐	⭐⭐⭐⭐⭐
BI dashboard queries	⭐⭐⭐⭐ (Direct Lake)	⭐⭐⭐⭐⭐ (Direct Lake)	⭐⭐⭐ (Import only)
High concurrency writes	⭐⭐	⭐⭐⭐	⭐⭐⭐⭐⭐
Streaming ingestion	⭐⭐⭐⭐	⭐⭐	⭐⭐⭐
ML model training	⭐⭐⭐⭐⭐	⭐	⭐
Complex joins (star schema)	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐
Full-text search	⭐⭐	⭐⭐	⭐⭐⭐⭐⭐

Concurrency Benchmarks (F64 SKU)¶

Metric	Lakehouse SQL Endpoint	Warehouse	SQL Database
Max concurrent queries	~20	~30	~100
Max concurrent writers	N/A (Spark only)	~10	~50
Query timeout (default)	5 min	30 min	Configurable
Max result set	80 GB	80 GB	Configurable

🔐 Security Model Comparison¶

Security Feature	Lakehouse	Warehouse	SQL Database
Workspace roles	✅	✅	✅
Object-level permissions	✅ (SQL endpoint)	✅ (GRANT/DENY)	✅ (GRANT/DENY)
Row-Level Security (RLS)	✅	✅	✅
Column-Level Security (CLS)	✅	✅	✅
Dynamic Data Masking	❌	✅	✅
OneLake security (file-level)	✅	N/A	N/A
Sensitivity labels	✅	✅	✅
Managed Identity access	✅	✅	✅
Entra-only auth	✅	✅	✅

🔄 Hybrid Patterns¶

Pattern 1: Lakehouse for Engineering + Warehouse for BI¶

The most common pattern. Use the Lakehouse for medallion processing and the Warehouse as a curated BI serving layer.

flowchart LR
    subgraph Sources["📥 Sources"]
        S1["Eventstreams"]
        S2["Pipelines"]
        S3["Dataflows"]
    end

    subgraph LH["🏠 Lakehouse"]
        B["Bronze"]
        SV["Silver"]
        G["Gold (Spark)"]
    end

    subgraph WH["🏢 Warehouse"]
        DIM["Dimensions"]
        FACT["Facts"]
        AGG["Aggregations"]
        VW["Views + Procs"]
    end

    subgraph BI["📊 Power BI"]
        DL["Direct Lake<br/>Semantic Model"]
    end

    Sources --> B --> SV --> G -->|"INSERT INTO"| WH --> DL

    style LH fill:#27AE60,stroke:#1E8449,color:#fff
    style WH fill:#2471A3,stroke:#1A5276,color:#fff
    style BI fill:#F39C12,stroke:#E67E22,color:#fff

Pattern 2: SQL Database for Ops + Lakehouse for Analytics¶

Operational applications write to SQL Database; data mirrors to OneLake for analytics processing.

flowchart LR
    subgraph App["📱 Application"]
        API["REST API"]
        UI["Web UI"]
    end

    subgraph SQLDB["🗄️ SQL Database"]
        OPS["Operational Tables"]
        CDC_["CDC Enabled"]
    end

    subgraph Mirror["🔄 Auto-Mirror"]
        DLT["Delta Tables<br/>in OneLake"]
    end

    subgraph LH["🏠 Lakehouse"]
        AN["Analytics Tables"]
    end

    App --> SQLDB --> Mirror --> LH

    style SQLDB fill:#8E44AD,stroke:#6C3483,color:#fff
    style LH fill:#27AE60,stroke:#1E8449,color:#fff

Pattern 3: All Three Together¶

Enterprise deployments often use all three stores, each for its optimal workload.

Layer	Store	Tables	Rationale
Ingestion	Lakehouse (Bronze)	Raw events, files, API responses	Spark ingestion, schema-on-read
Processing	Lakehouse (Silver)	Cleansed, validated tables	PySpark transforms, Delta merge
BI Serving	Warehouse (Gold)	Star schema, aggregations	T-SQL procs, Direct Lake
Operational	SQL Database	Config, user state, app data	OLTP, enforced FK, triggers

🎰 Casino Implementation¶

Recommended Store Allocation¶

Data Domain	Store	Rationale
Slot telemetry (raw)	🏠 Lakehouse Bronze	High-volume streaming, Spark processing
Table game hands (raw)	🏠 Lakehouse Bronze	Semi-structured JSON, schema evolution
Player sessions (cleansed)	🏠 Lakehouse Silver	PySpark dedup, validation
Compliance filings (CTR/SAR/W-2G)	🏠 Lakehouse Gold	Delta time travel for audit trail
Machine dimensions	🏢 Warehouse	Star schema, joined in BI queries
Player dimensions	🏢 Warehouse	RLS by player tier, Direct Lake
Revenue fact (daily)	🏢 Warehouse	Aggregation table, T-SQL procs
Machine configuration	🗄️ SQL Database	Operational app writes config changes
Player account state	🗄️ SQL Database	Real-time balance, loyalty points
Alert rules	🗄️ SQL Database	Application CRUD, triggers for notifications

Casino Architecture Summary¶

flowchart TB
    subgraph Floor["🎰 Casino Floor"]
        SLOT["5,000 Slot Machines"]
        TABLE["500 Table Games"]
        KIOSK["Self-Service Kiosks"]
    end

    subgraph LH["🏠 Lakehouse"]
        B["Bronze<br/>Raw telemetry"]
        S["Silver<br/>Cleansed sessions"]
        GC["Gold<br/>Compliance tables"]
    end

    subgraph WH["🏢 Warehouse"]
        DIM["dim_machine<br/>dim_player<br/>dim_date"]
        FACT["fact_slot_daily<br/>fact_table_daily"]
        AGG["agg_floor_hourly"]
    end

    subgraph SQL["🗄️ SQL Database"]
        CFG["machine_config"]
        PLR["player_accounts"]
        ALR["alert_rules"]
    end

    Floor --> B --> S --> GC
    S -->|"INSERT INTO"| FACT
    SQL -->|"Mirror"| LH
    WH --> BI["📊 Direct Lake"]
    GC --> BI

    style LH fill:#27AE60,stroke:#1E8449,color:#fff
    style WH fill:#2471A3,stroke:#1A5276,color:#fff
    style SQL fill:#8E44AD,stroke:#6C3483,color:#fff

🏛️ Federal Agency Implementation¶

Per-Agency Recommendations¶

Agency	Primary Store	Secondary Store	Rationale
USDA	🏠 Lakehouse	🏢 Warehouse (BI)	Large batch datasets (crop production, SNAP); Spark processing
SBA	🏢 Warehouse	🗄️ SQL Database	Loan analytics with T-SQL; operational loan tracking app
NOAA	🏠 Lakehouse	🏢 Warehouse (BI)	Streaming weather data; time-series in Lakehouse, summaries in WH
EPA	🏠 Lakehouse	🏢 Warehouse (BI)	Sensor data ingestion via Spark; regulatory reporting via WH
DOI	🏠 Lakehouse	🗄️ SQL Database	Geospatial data processing; permit management app in SQL DB

Federal Hybrid Architecture¶

flowchart TB
    subgraph Agency["🏛️ Agency Workspace"]
        subgraph LH["🏠 Lakehouse"]
            AB["Bronze (raw API data)"]
            AS["Silver (validated)"]
            AG["Gold (analytics)"]
        end

        subgraph WH["🏢 Warehouse"]
            DIM["Agency Dimensions"]
            FACT["KPI Facts"]
            VW["Reporting Views"]
        end

        subgraph SQL["🗄️ SQL Database"]
            APP["Application Tables"]
            CDC_["CDC → OneLake"]
        end
    end

    subgraph CrossAgency["🔗 Cross-Agency"]
        XDB["Cross-Database Queries"]
        SM["Shared Semantic Model"]
    end

    AG --> WH --> SM
    SQL --> CDC_ --> LH
    WH --> XDB

    style LH fill:#27AE60,stroke:#1E8449,color:#fff
    style WH fill:#2471A3,stroke:#1A5276,color:#fff
    style SQL fill:#8E44AD,stroke:#6C3483,color:#fff

Federal Compliance Alignment¶

Requirement	Lakehouse	Warehouse	SQL Database
Data retention (7+ years)	✅ Delta time travel + archival	✅ Table lifecycle policies	✅ Backup + retention policies
Audit trail	✅ Delta history + unified audit log	✅ Unified audit log	✅ SQL audit + unified audit log
PII protection	✅ OneLake security + CLS	✅ CLS + Dynamic Data Masking	✅ DDM + CLS + encryption
FedRAMP boundary	✅ US Gov region	✅ US Gov region	✅ US Gov region
FISMA logging	✅ Activity logs	✅ Activity logs	✅ SQL audit logs

🔀 Migration Paths¶

Between Fabric Stores¶

From	To	Method
Lakehouse → Warehouse	Cross-DB `INSERT INTO ... SELECT`	⭐ Low
Warehouse → Lakehouse	Spark `spark.sql("SELECT ... FROM wh.dbo.table")`	⭐ Low
SQL Database → Lakehouse	Auto-mirror (continuous) or Pipeline (batch)	⭐ Low
SQL Database → Warehouse	Cross-DB `INSERT INTO ... SELECT`	⭐ Low
Lakehouse → SQL Database	Pipeline or custom app INSERT	⭐⭐ Medium
Warehouse → SQL Database	Pipeline or custom app INSERT	⭐⭐ Medium

From External Systems¶

Source	Recommended Target	Method
Azure SQL Database	SQL Database (mirror) or Lakehouse (shortcut)	Mirroring or Shortcut
Azure Synapse (dedicated pool)	Warehouse	Pipeline migration
Azure Data Lake Gen2	Lakehouse	Shortcut (zero-copy)
Databricks Delta Lake	Lakehouse	Shortcut or Pipeline
On-premises SQL Server	SQL Database or Warehouse	Pipeline + Gateway
Snowflake	Warehouse or Lakehouse	Pipeline

⚠️ Limitations¶

Limitation	Lakehouse	Warehouse	SQL Database
No T-SQL writes	✅ (SQL endpoint is read-only)	N/A	N/A
No Spark access	N/A	✅ (T-SQL only)	✅ (T-SQL only)
No enforced FKs	✅	✅ (informational only)	N/A
No triggers	✅	✅	N/A
No Direct Lake	N/A	N/A	✅ (Import/DQ only)
Limited concurrency	Write concurrency limited	Write concurrency limited	N/A
No Delta time travel	N/A	✅	✅
Storage cost	OneLake pricing	OneLake pricing	OneLake pricing
Preview features	Some features in preview	Stable	GA (since Nov 2024)
Gov Cloud	✅ Available	✅ Available	Limited availability

📚 References¶

Resource	URL
Lakehouse Overview	https://learn.microsoft.com/fabric/data-engineering/lakehouse-overview
Warehouse Overview	https://learn.microsoft.com/fabric/data-warehouse/data-warehousing
SQL Database in Fabric	https://learn.microsoft.com/fabric/database/sql/overview
Direct Lake Overview	https://learn.microsoft.com/fabric/get-started/direct-lake-overview
Cross-Database Queries	https://learn.microsoft.com/fabric/data-warehouse/cross-database-query
OneLake Shortcuts	https://learn.microsoft.com/fabric/onelake/onelake-shortcuts
Mirroring Overview	https://learn.microsoft.com/fabric/database/mirrored-database/overview
Fabric Decision Guide	https://learn.microsoft.com/fabric/get-started/decision-guide

Cross-Database Queries -- Query across all three stores
Direct Lake -- BI connectivity for Lakehouse and Warehouse
Fabric SQL Database -- SQL Database deep dive
Mirroring -- Replicate external databases into Fabric
Medallion Architecture Deep Dive -- Lakehouse layering patterns
Migration Patterns -- Migrating from external platforms

Back to Best Practices Index | Back to Documentation

📝 Document Metadata - Author: Documentation Team - Reviewers: Data Engineering, Data Warehouse, Architecture, Security - Classification: Internal - Next Review: 2026-07-21

🏗️ Lakehouse vs Warehouse vs SQL Database — Decision Guide¶

📑 Table of Contents¶

🎯 Overview¶

The Three Stores at a Glance¶

📊 Feature Comparison Matrix¶

Core Capabilities¶

Query Language Surface¶

Data Engineering Features¶

🔍 Deep Dive: Each Store¶

🏠 Lakehouse¶

🏢 Warehouse¶

🗄️ SQL Database¶

🌳 Decision Trees¶

Primary Decision Tree¶

"Which Store for This Table?" Decision Tree¶

⚡ Performance Characteristics¶

Workload Profile Comparison¶

Concurrency Benchmarks (F64 SKU)¶

🔐 Security Model Comparison¶

🔄 Hybrid Patterns¶

Pattern 1: Lakehouse for Engineering + Warehouse for BI¶

Pattern 2: SQL Database for Ops + Lakehouse for Analytics¶

Pattern 3: All Three Together¶

🎰 Casino Implementation¶

Recommended Store Allocation¶

Casino Architecture Summary¶

🏛️ Federal Agency Implementation¶

Per-Agency Recommendations¶

Federal Hybrid Architecture¶

Federal Compliance Alignment¶

🔀 Migration Paths¶

Between Fabric Stores¶

From External Systems¶

⚠️ Limitations¶

📚 References¶

🔗 Related Documents¶