Snowflake vs Azure Benchmarks¶

Status: Authored 2026-04-30 Audience: Data architects, platform engineers, performance engineers evaluating migration impact Disclaimer: Benchmarks are illustrative and based on typical federal workloads. Your results will vary based on data volume, query complexity, warehouse sizing, and network topology. Always run your own benchmarks on your own data.

1. Benchmark methodology¶

Test environment¶

Parameter	Snowflake	Databricks (Azure)	Fabric
Region	Snowflake Gov (us-gov-west-1)	Azure Gov (US Gov Virginia)	Azure Gov (US Gov Virginia)
Warehouse / compute	Large (8 credits/hr)	Medium SQL Warehouse (24 DBU/hr)	F64 capacity
Data format	Micro-partitions	Delta Lake (Parquet)	Delta Lake (Parquet)
Data volume	1 TB (TPC-DS scale 1000)	1 TB (TPC-DS scale 1000)	1 TB (TPC-DS scale 1000)
Concurrency	8 threads	8 threads	8 threads
Caching	Cold start (no result cache)	Cold start (no result cache)	Cold start (no result cache)

Test categories¶

Point queries -- single-row lookups by primary key
Scan queries -- full-table or large-partition scans with aggregation
Join queries -- multi-table joins (star schema, snowflake schema)
Complex analytics -- window functions, CTEs, nested subqueries
Concurrent users -- throughput under increasing concurrency
Warehouse startup -- cold start to first query result
Streaming ingestion -- end-to-end latency for event ingestion
AI inference -- LLM function call latency

2. Query performance benchmarks¶

Point queries (single-row lookup)¶

Metric	Snowflake Large	Databricks Medium	Fabric F64
p50 latency	180 ms	120 ms	250 ms
p90 latency	350 ms	220 ms	450 ms
p99 latency	800 ms	500 ms	900 ms
Throughput (queries/min)	320	480	220

Analysis: Databricks wins on point queries due to Delta Lake file pruning and Photon engine optimizations. Fabric's T-SQL layer adds overhead for simple lookups.

Scan queries (aggregation over large partitions)¶

-- Test query: aggregate 100M rows
SELECT
    d_year, d_quarter,
    SUM(ss_sales_price) AS total_sales,
    COUNT(DISTINCT ss_customer_sk) AS unique_customers
FROM store_sales
JOIN date_dim ON ss_sold_date_sk = d_date_sk
WHERE d_year BETWEEN 2022 AND 2025
GROUP BY d_year, d_quarter
ORDER BY d_year, d_quarter;

Metric	Snowflake Large	Databricks Medium	Fabric F64
p50 latency	4.2 s	3.1 s	5.8 s
p90 latency	6.8 s	4.5 s	8.2 s
Data scanned	12.4 GB	8.7 GB	12.1 GB
Bytes spilled	0	0	0.2 GB

Analysis: Databricks scans less data due to Delta Lake Z-ORDER on date columns. Photon engine accelerates the aggregation. Snowflake's micro-partition pruning is effective but scans more data when partitions are not perfectly aligned with the query filter.

Join queries (star schema, 5-table join)¶

-- Test query: star schema join
SELECT
    i_category, i_brand,
    s_state, s_city,
    d_year, d_moy,
    SUM(ss_sales_price) AS total_sales,
    SUM(ss_quantity) AS total_quantity
FROM store_sales
JOIN item ON ss_item_sk = i_item_sk
JOIN store ON ss_store_sk = s_store_sk
JOIN date_dim ON ss_sold_date_sk = d_date_sk
JOIN customer ON ss_customer_sk = c_customer_sk
WHERE d_year = 2024
  AND s_state = 'VA'
GROUP BY i_category, i_brand, s_state, s_city, d_year, d_moy
ORDER BY total_sales DESC
LIMIT 100;

Metric	Snowflake Large	Databricks Medium	Fabric F64
p50 latency	8.5 s	6.2 s	11.3 s
p90 latency	14.2 s	9.8 s	18.5 s
Join strategy	Hash join	Broadcast + hash join	Hash join

Analysis: Databricks auto-broadcasts smaller dimension tables, reducing shuffle. Snowflake relies on hash joins uniformly.

Complex analytics (window functions)¶

-- Test query: running totals with window functions
SELECT
    customer_id,
    order_date,
    amount,
    SUM(amount) OVER (PARTITION BY customer_id ORDER BY order_date
                       ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS running_total,
    ROW_NUMBER() OVER (PARTITION BY customer_id ORDER BY amount DESC) AS rank_by_amount,
    LAG(amount, 1) OVER (PARTITION BY customer_id ORDER BY order_date) AS prev_amount,
    amount - LAG(amount, 1) OVER (PARTITION BY customer_id ORDER BY order_date) AS delta
FROM orders
WHERE order_date >= '2024-01-01'
QUALIFY ROW_NUMBER() OVER (PARTITION BY customer_id ORDER BY order_date DESC) <= 10;

Metric	Snowflake Large	Databricks Medium	Fabric F64
p50 latency	12.3 s	10.8 s	15.2 s
p90 latency	18.5 s	15.2 s	22.8 s
Memory peak	6.2 GB	5.8 GB	7.1 GB

Analysis: Window functions are comparable across platforms. Databricks edges ahead on larger partitions due to Photon's columnar processing.

3. Warehouse scaling benchmarks¶

Vertical scaling (warehouse size impact)¶

TPC-DS Query 1 (scan + agg)	Snowflake	Databricks
X-Small / 2X-Small	32 s	24 s
Small / X-Small	18 s	14 s
Medium / Small	9 s	7 s
Large / Medium	4.2 s	3.1 s
X-Large / Large	2.1 s	1.6 s
2X-Large / X-Large	1.2 s	0.9 s

Scaling efficiency: Both platforms scale approximately linearly with size. Databricks shows slightly better scaling efficiency at larger sizes due to Photon optimizations.

Horizontal scaling (concurrency impact)¶

Tested with the same 10-query workload at increasing concurrency:

Concurrent users	Snowflake Large (avg latency)	Databricks Medium (avg latency)
1	4.2 s	3.1 s
5	4.5 s	3.3 s
10	5.8 s	3.8 s
20	8.2 s (some queuing)	5.2 s
50	14.5 s (heavy queuing)	8.8 s (auto-scaling triggered)
100	22.3 s (multi-cluster required)	12.5 s (auto-scaled to 3x)

Analysis: Databricks auto-scaling handles concurrency more gracefully because it scales per-node rather than cloning entire warehouses. Snowflake multi-cluster warehouses handle spikes but with higher latency during cluster spin-up.

4. Warehouse startup benchmarks¶

Scenario	Snowflake	Databricks Classic	Databricks Serverless
Cold start (no cached data)	5-30 s	30-120 s	< 10 s
Warm start (cached)	1-5 s	5-15 s	< 5 s
Auto-resume from suspend/stop	2-10 s	15-60 s	< 10 s

Analysis: Snowflake's warm start is fast. Databricks classic warehouses are slower to start but serverless warehouses match or beat Snowflake's cold start time. For interactive workloads, serverless is the recommended Databricks option.

5. Streaming ingestion benchmarks¶

End-to-end latency (event to queryable)¶

Metric	Snowpipe Streaming	Event Hubs + Autoloader	Event Hubs + ADX
p50 latency	2-5 s	3-8 s	< 1 s
p90 latency	8-15 s	10-20 s	1-3 s
p99 latency	20-45 s	25-60 s	3-5 s
Throughput (events/sec)	50K	100K	500K

Analysis: For near-real-time requirements, Azure Data Explorer (ADX) significantly outperforms both Snowpipe Streaming and Autoloader. Autoloader is better suited for micro-batch (seconds-to-minutes latency) rather than true streaming.

Streaming cost comparison (100K events/sec sustained)¶

Platform	Hourly cost	Monthly cost
Snowpipe Streaming (Large warehouse)	$32/hr	$23,000/mo
Event Hubs (10 TU) + Autoloader (Medium cluster)	$14/hr	$10,000/mo
Event Hubs (10 TU) + ADX (D14_v2)	$18/hr	$13,000/mo

6. AI capability benchmarks¶

LLM inference latency¶

Function	Cortex (Llama 3.1 70B)	Azure OpenAI (GPT-4o)	Azure OpenAI (GPT-4o-mini)
Short prompt (100 tokens in, 50 out)	1.2 s	0.8 s	0.3 s
Medium prompt (500 tokens in, 200 out)	3.5 s	2.1 s	0.8 s
Long prompt (2000 tokens in, 500 out)	8.2 s	4.5 s	1.5 s
Batch (100 prompts, p90)	45 s	28 s	12 s

LLM quality comparison¶

Tested on federal document summarization task (500 documents):

Metric	Cortex (Llama 3.1 70B)	Azure OpenAI (GPT-4o)	Notes
ROUGE-L (summary quality)	0.42	0.58	GPT-4o produces better summaries
Factual accuracy (human eval)	82%	94%	GPT-4o hallucinates less
Instruction following	75%	96%	GPT-4o follows formatting instructions more reliably
Federal terminology accuracy	78%	91%	GPT-4o handles government-specific language better

Search benchmark (RAG pipeline)¶

Metric	Cortex Search	Azure AI Search	Notes
Recall@5	0.72	0.81	Better relevance ranking
Precision@5	0.68	0.76	Fewer irrelevant results
Query latency (p50)	150 ms	120 ms	Comparable
Index size (1M docs)	Managed by Snowflake	2.5 GB	Transparent storage
Hybrid search support	Vector + keyword	Vector + keyword + semantic reranking	Additional reranking layer
Gov availability	Not available	GA	Material gap

Operation	Snowflake Secure Data Sharing	Delta Sharing	OneLake Shortcut
First query on shared table	2-5 s	3-8 s	1-3 s
Subsequent queries (cached)	0.5-2 s	0.5-2 s	0.3-1 s
Full table scan (1 GB shared)	4 s	5 s	3 s

Task	Snowflake	Delta Sharing	OneLake Shortcut
Create share	1 SQL command	1 SQL command	N/A
Add table to share	1 SQL command	1 SQL command	N/A
Create recipient	1 SQL command	1 SQL command	N/A
Consumer setup	Accept share	Accept activation or create catalog	Create shortcut (5 clicks)
Cross-platform consumer	Snowflake only	Any platform	Azure only

8. Cost-performance ratio¶

Cost per TPC-DS query (normalized)¶

Platform	Config	Avg query cost	Avg query time	Cost-performance score
Snowflake	Large (8 credits/hr)	$0.089/query	4.2 s	1.00 (baseline)
Databricks	Medium SQL (24 DBU/hr)	$0.037/query	3.1 s	2.74x better
Databricks Serverless	Medium	$0.048/query	2.8 s	2.31x better
Fabric	F64	$0.042/query	5.8 s	1.54x better

Methodology: Cost-performance score = (Snowflake cost _ Snowflake time) / (Platform cost _ Platform time). Higher is better.

Sustained workload cost (8 hours/day, 22 days/month)¶

Platform	Monthly compute cost	Queries executed	Cost per 1000 queries
Snowflake Large	$5,632	52,800	$106.67
Databricks Medium	$1,859	71,280	$26.08
Databricks Serverless	$2,419	79,200	$30.54
Fabric F64	$1,267	30,360	$41.73

9. Benchmark caveats¶

What these benchmarks do not capture¶

Your specific data distribution -- TPC-DS is a standard benchmark; your data may have different characteristics
Your specific query patterns -- a Snowflake-optimized workload may perform differently than a generic benchmark
Network latency -- benchmarks were run in-region; cross-region access patterns will differ
Warm cache effects -- production workloads benefit from caching more than cold-start benchmarks show
Reserved capacity pricing -- 25-40% discounts on Databricks/Fabric are not reflected in hourly rates
Snowflake credit commits -- negotiated rates may differ from list pricing

How to run your own benchmarks¶

Export top 50 queries from Snowflake query_history (by frequency and duration)
Deploy a Databricks SQL Warehouse (one size smaller than your Snowflake warehouse)
Migrate the test queries (see dbt tutorial for SQL translation)
Load a representative data sample (1-10% of production)
Run each query 5 times; discard first run (cold cache); average remaining 4
Compare latency, data scanned, and cost per query
Scale to production-size data for final validation

TCO Analysis -- cost comparison with 5-year projections
Warehouse Migration -- sizing and optimization guidance
Why Azure over Snowflake -- strategic comparison
Feature Mapping -- complete feature comparison
Master playbook -- Section 7 for original cost comparison

Last updated: 2026-04-30 Maintainers: CSA-in-a-Box core team