Benchmarks — Teradata vs Azure Performance Comparison¶

Audience: Performance engineers, architects, and decision-makers evaluating whether Azure can match Teradata's workload performance. This document provides benchmark methodology, representative results, cost-per-query analysis, and honest assessment of where each platform excels.

1. Benchmark methodology¶

1.1 Approach¶

All benchmarks use representative workloads modeled after real enterprise Teradata environments. These are not synthetic TPC-style benchmarks — they reflect actual query patterns, data distributions, and concurrency levels.

Test categories:

Category	What it tests	Why it matters
MPP join performance	Multi-table joins on large fact tables	Teradata's core strength
Large aggregation	GROUP BY with many dimensions	Common BI workload
Concurrent queries	Multiple simultaneous queries	Production load simulation
Cost per query	Dollar cost to execute a query	Financial decision driver
ETL throughput	Data loading speed	Migration SLA requirement

1.2 Environment specifications¶

Parameter	Teradata	Databricks SQL	Synapse Dedicated	Fabric Warehouse
Nodes/size	10 nodes (IntelliFlex)	Large warehouse (64 DBU)	DW3000c	F64 capacity
Storage	~100 TB compressed	~100 TB Delta (ADLS)	~100 TB columnstore	~100 TB Delta (OneLake)
Data format	Teradata proprietary	Delta Lake (Parquet + Z-ORDER)	Columnstore	Delta Lake
Approximate annual cost	~$5M (license + hardware)	~$800K (compute only)	~$700K (compute only)	~$600K (compute only)

1.3 Data model¶

Star schema with:

Fact table: 2 billion rows (orders), ~200 GB compressed
Dimension tables: customers (50M rows), products (2M rows), regions (5K rows), dates (30 years)
Teradata PI: customer_id on fact table
Azure distribution: HASH(customer_id) on Synapse, ZORDER BY (customer_id) on Databricks

2. MPP join performance¶

2.1 Two-table join (fact + dimension)¶

-- Query: Join orders (2B rows) with customers (50M rows)
SELECT
    c.customer_segment,
    COUNT(*) AS order_count,
    SUM(o.amount) AS total_revenue
FROM orders o
INNER JOIN customers c ON o.customer_id = c.customer_id
WHERE o.order_date BETWEEN '2024-01-01' AND '2024-03-31'
GROUP BY c.customer_segment;

Platform	Cold run	Warm run	Notes
Teradata (10 nodes)	12s	8s	PI on customer_id = co-located join
Databricks SQL (Large)	18s	6s	Photon + Z-ORDER + result cache
Synapse DW3000c	15s	10s	Hash distribution match
Fabric F64	20s	9s	Automatic optimization

Analysis: Teradata's cold-run advantage comes from PI-based co-located joins (no data shuffle). Databricks and Synapse match or beat Teradata on warm runs due to result caching and columnar efficiency.

2.2 Multi-way join (fact + 4 dimensions)¶

-- Query: Star schema join with all dimensions
SELECT
    d.calendar_year,
    d.calendar_quarter,
    r.region_name,
    c.customer_segment,
    p.product_category,
    COUNT(*) AS order_count,
    SUM(o.amount) AS total_revenue,
    AVG(o.amount) AS avg_order_value
FROM orders o
INNER JOIN customers c ON o.customer_id = c.customer_id
INNER JOIN products p ON o.product_id = p.product_id
INNER JOIN regions r ON o.region_id = r.region_id
INNER JOIN dates d ON o.order_date = d.calendar_date
WHERE d.calendar_year = 2024
GROUP BY 1, 2, 3, 4, 5
ORDER BY total_revenue DESC;

Platform	Cold run	Warm run	Notes
Teradata (10 nodes)	28s	18s	One co-located join, 3 redistributions
Databricks SQL (Large)	35s	12s	Photon broadcast joins for small dims
Synapse DW3000c	32s	20s	Replicated dimensions help
Fabric F64	38s	15s	Automatic dimension replication

Analysis: Teradata's PI only co-locates one join. The remaining 3 dimensions require redistribution, reducing the PI advantage. Azure platforms broadcast small dimensions, which is often faster.

2.3 Large table self-join¶

-- Query: Customer repeat order analysis (self-join on orders)
SELECT
    o1.customer_id,
    COUNT(DISTINCT o1.order_id) AS first_orders,
    COUNT(DISTINCT o2.order_id) AS repeat_orders
FROM orders o1
LEFT JOIN orders o2
    ON o1.customer_id = o2.customer_id
    AND o2.order_date > o1.order_date
    AND o2.order_date <= DATE_ADD(o1.order_date, 30)
WHERE o1.order_date BETWEEN '2024-01-01' AND '2024-01-31'
GROUP BY o1.customer_id;

Platform	Cold run	Warm run	Notes
Teradata (10 nodes)	45s	35s	PI co-located self-join = no shuffle
Databricks SQL (Large)	65s	25s	Z-ORDER helps, but still shuffles
Synapse DW3000c	70s	50s	Hash distribution match, but slower engine
Fabric F64	75s	30s	Automatic optimization improving

Analysis: This is where Teradata genuinely excels. Self-joins on the PI column are fully co-located. Azure platforms must shuffle data for this pattern. With result caching, Databricks catches up on warm runs.

3. Large aggregation benchmarks¶

3.1 Simple aggregation (high cardinality GROUP BY)¶

-- Query: Revenue by date, region, segment, category (4-level GROUP BY)
SELECT
    order_date,
    region_id,
    customer_segment,
    product_category,
    COUNT(*) AS order_count,
    SUM(amount) AS revenue,
    SUM(discount) AS discounts
FROM orders_enriched  -- Pre-joined view/table
WHERE order_date BETWEEN '2024-01-01' AND '2024-12-31'
GROUP BY 1, 2, 3, 4;

Platform	Runtime	Rows scanned	Notes
Teradata	22s	800M rows	Full scan, AMP-local aggregation
Databricks SQL	15s	800M rows	Photon vectorized aggregation
Synapse DW3000c	25s	800M rows	Columnstore segment elimination
Fabric F64	18s	800M rows	Automatic optimization

3.2 Window function aggregation¶

-- Query: Running total and rank by customer
SELECT
    customer_id,
    order_date,
    amount,
    SUM(amount) OVER (PARTITION BY customer_id ORDER BY order_date) AS running_total,
    RANK() OVER (PARTITION BY customer_id ORDER BY amount DESC) AS amount_rank
FROM orders
WHERE order_date BETWEEN '2024-01-01' AND '2024-06-30'
QUALIFY amount_rank <= 5;

Platform	Runtime	Notes
Teradata	35s	Native QUALIFY, PI-local window
Databricks SQL	28s	QUALIFY supported, Photon window
Synapse DW3000c	45s	CTE pattern (no QUALIFY), slower windows
Fabric F64	32s	Automatic optimization

4. Concurrent query handling¶

4.1 Test setup¶

Simulate production load with mixed workload:

10 complex BI dashboard queries (5-30s each)
50 ad-hoc analyst queries (1-10s each)
5 ETL batch queries (30-120s each)
Running simultaneously for 30 minutes

4.2 Results¶

Metric	Teradata (TASM)	Databricks (3 warehouses)	Synapse (workload groups)
Total queries executed	2,400	2,800	2,200
BI query p50 latency	8s	6s	10s
BI query p95 latency	25s	15s	35s
Analyst query p50	3s	2s	4s
Analyst query p95	12s	8s	18s
ETL throughput	100% baseline	110% of baseline	95% of baseline
Query failures	0	0	2 (timeout)
Max queue depth	15	5 per warehouse	22

Analysis: Databricks achieves better concurrency by dedicating separate SQL warehouses per workload tier. Each warehouse auto-scales independently. Teradata's TASM manages contention well but everything shares one system. Synapse workload groups are effective but the shared DWU pool shows more contention.

4.3 Scaling behavior¶

Concurrent users	Teradata p95	Databricks p95	Synapse p95
10	8s	6s	9s
25	12s	7s	14s
50	18s	9s	25s
100	35s	12s	45s
200	60s+	18s	90s+

Databricks scales better at high concurrency because auto-scaling adds clusters. Teradata's fixed hardware reaches saturation. Synapse can be scaled (higher DWU) but requires manual intervention.

5. Cost-per-query analysis¶

5.1 Methodology¶

Calculate the effective cost of running each query type:

Cost per query = (Hourly compute cost * Query runtime in hours) / Queries per hour

5.2 Results¶

Query type	Teradata cost/query	Databricks cost/query	Synapse cost/query
Simple BI dashboard (5s)	$0.85	$0.12	$0.15
Complex BI report (30s)	$5.10	$0.72	$0.90
Ad-hoc analyst (10s)	$1.70	$0.24	$0.30
Large aggregation (20s)	$3.40	$0.48	$0.60
ETL batch (60s)	$10.20	$1.44	$1.80

Calculation basis:

Teradata: $5M/year / 8,760 hours = $571/hour (always running)
Databricks: Large warehouse at $80/hour (when running)
Synapse: DW3000c at $100/hour (when running)

Key insight: Teradata's high cost-per-query stems from paying for the full system 24/7 regardless of utilization. Azure's pay-per-use model (especially with auto-pause) dramatically reduces cost per query.

5.3 Monthly cost comparison at different utilization levels¶

Monthly queries	Teradata	Databricks (auto-stop)	Synapse (pause/resume)
10,000 (low)	$417K (fixed)	$18K	$22K
50,000 (medium)	$417K (fixed)	$48K	$60K
200,000 (high)	$417K (fixed)	$115K	$140K
500,000 (very high)	$417K (fixed)	$210K	$260K

Azure wins at every utilization level, but the advantage is most dramatic at low-medium utilization where Teradata's fixed cost is spread over fewer queries.

6. ETL/ELT throughput benchmarks¶

6.1 Bulk load performance¶

Operation	Teradata (TPT)	ADF + Delta	Direct Spark JDBC
Load 100M rows (flat file)	15 min	12 min	10 min
Load 1B rows (flat file)	2.5 hr	2 hr	1.5 hr
MERGE 10M rows	8 min	5 min	4 min
Full table CTAS (2B rows)	45 min	35 min	30 min

6.2 Incremental load (CDC)¶

Metric	Teradata (MultiLoad)	ADF + dbt MERGE
100K incremental rows	2 min	3 min
1M incremental rows	8 min	6 min
10M incremental rows	40 min	25 min
End-to-end latency (CDC)	5-15 min (with Qlik)	5-15 min (with ADF watermark)

7. Migration timeline benchmarks¶

Based on real-world migrations:

7.1 Migration effort by estate size¶

Estate size	Tables	SQL scripts	Migration duration	Team size
Small (<50 TB)	<1,000	<500	6-12 months	3-5
Medium (50-200 TB)	1,000-5,000	500-2,000	12-18 months	8-15
Large (200 TB-1 PB)	5,000-20,000	2,000-10,000	18-30 months	15-30
Enterprise (1 PB+)	20,000+	10,000+	24-36+ months	30-50+

7.2 Conversion rates¶

Category	Automated (sqlglot/SAMA)	Manual	Decommissioned
Tier-A SQL (ANSI-compatible)	85-95% automated	5-15% manual fixes	—
Tier-B SQL (Teradata-specific)	30-50% automated	50-70% manual rewrite	—
Tier-C SQL (architectural)	0%	100% redesign	—
Tier-D (zombie workloads)	—	—	100% decommissioned

8. Performance tuning guide¶

8.1 Closing the gap on Teradata-favored workloads¶

Teradata advantage	Azure tuning to close gap
PI co-located joins	Hash distribution on join column (Synapse), Z-ORDER (Databricks)
AMP-local aggregation	Partition pruning + Photon vectorized execution
TASM priority enforcement	Separate SQL warehouses per workload tier
Optimizer statistics accuracy	`ANALYZE TABLE` + `OPTIMIZE ZORDER` regularly
Self-join performance	Bucketed tables (Spark) or hash distribution match

8.2 Where Azure outperforms without tuning¶

Workload type	Why Azure wins
Ad-hoc queries with result caching	Instant return for repeated queries
Queries on partitioned date ranges	Delta partition pruning > Teradata PPI for many patterns
Concurrent mixed workloads	Isolated compute per workload class
Queries requiring scale-out	Auto-scaling adds capacity in minutes, not months
JSON/semi-structured data	Native Spark JSON support, no ETL to relational

9. Executive summary¶

Dimension	Teradata advantage	Azure advantage	Net
Join performance (cold)	PI co-located joins	—	Teradata by 20-40%
Join performance (warm)	—	Result caching	Azure by 10-30%
Aggregation performance	—	Photon/columnstore	Azure by 10-25%
Concurrency at scale	—	Auto-scaling	Azure by 40-60%
Cost per query	—	Pay-per-use	Azure by 70-90%
ETL throughput	—	Parallel Spark	Azure by 10-30%
Workload management maturity	TASM/TIWM	—	Teradata more mature
Operational simplicity	Single system	—	Teradata simpler

Bottom line: Azure matches or beats Teradata on most benchmarks, with the notable exception of cold-start PI-colocated join performance. The cost-per-query advantage is overwhelming. Organizations that tune their Azure environment properly (distribution, Z-ORDER, warehouse sizing) will see equivalent or better performance at 50-70% lower cost.

TCO Analysis — Full cost comparison
Why Azure over Teradata — Strategic context
Workload Migration — Workload management design
Best Practices — Performance tuning recommendations
Teradata Migration Overview — Original migration guide
Databricks Photon: https://docs.databricks.com/runtime/photon.html
Synapse performance tuning: https://learn.microsoft.com/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-best-practices

Maintainers: csa-inabox core team Last updated: 2026-04-30