Industry — Retail & CPG¶

Scope: Brick-and-mortar retail, e-commerce, omnichannel, consumer packaged goods. Customer experience as competitive advantage, demand volatility, supply-chain complexity, payment data sensitivity.

Top scenarios¶

Scenario	Pattern	Latency	Reference
Customer 360	Multi-source identity resolution + medallion gold + reverse-ETL	minutes	Reference Arch — Data Flow
Real-time recommendation	Feature store + online inference + click feedback loop	sub-100ms	Example — ML Lifecycle (adapt)
Demand forecasting	Sales + weather + promotions + ML	daily	Example — ML Lifecycle
Inventory optimization	Real-time stock + demand forecast + replenishment	hours	Tutorial 11 — Data API Builder for serving
Pricing optimization	Competitive scrape + elasticity model + scenario eval	daily	Use Case — Anomaly Detection (similar streaming patterns)
Fraud / chargeback	Transaction streaming + ML scoring	sub-second	Industries — Financial Services
Conversational commerce	RAG + product catalog + checkout integration	seconds	Tutorial 08 — RAG, Example — AI Agents
Marketing attribution	Touchpoint ingest + multi-touch model	daily	Tutorial 02 — Data Governance
Loyalty / personalization	CDP + ML segments + activation channels	minutes	Tutorial 11 — Data API Builder

Regulatory landscape¶

Framework	Where in CSA-in-a-Box
PCI-DSS v4.0 (any payment data)	Compliance — PCI-DSS — strongly recommend tokenization at the edge
GDPR (EU customers)	Compliance — GDPR
CCPA / CPRA (California)	Same patterns as GDPR; "do not sell" preference + DSR handling
SOC 2 Type II	Compliance — SOC 2 — table stakes for B2B SaaS commerce
State privacy laws (VA, CO, CT, UT, TX, etc.)	Mostly track GDPR principles; one consent management platform usually serves all
COPPA (under-13 users)	If applicable, age-gate at signup; segregate child accounts

Reference architecture variations¶

CDP layer: customer 360 in gold + segment exports to activation channels (Marketo, Braze, Salesforce, Meta Ads). Tutorial 11 — Data API Builder provides the REST/GraphQL surface.
Edge POS integration: tokenize payment at the POS device; analytics receives only token + last-4 + transaction context. Keeps PCI scope at the edge.
Conversational commerce: AOAI + product catalog as RAG corpus. Add Azure AI Content Safety before sending model output to customers — never let an LLM make a price commitment without a guard.
Headless commerce: gold tables expose product / inventory / pricing via DAB → consumed by storefronts (Next.js, mobile apps) via GraphQL.

Why the standard CSA-in-a-Box pattern works for retail¶

Medallion + Purview = catalog of customer attributes with classification (PII / sensitive)
dbt = reproducible CDP (no more "the customer count differs between Marketing and Finance")
AOAI + AI Search + Content Safety = safe customer-facing GenAI
Data API Builder = headless commerce data layer without bespoke API code
Power Apps + Power BI = store-manager and merchandiser apps without app-dev cycles

What's specific to retail / CPG¶

Identity resolution is the hardest data problem. Customers have multiple emails, devices, household memberships, loyalty accounts, in-store interactions. Build identity resolution as a first-class silver-layer asset, not as a one-off.
Demand volatility is brutal. Forecast accuracy matters more than model sophistication; ensemble simple models + external signals (weather, holidays, social) usually beats one complex model.
PCI scope minimization is everything. Tokenize at the POS / payment gateway; never let raw PAN reach analytics. See Compliance — PCI-DSS.
Real-time matters at checkout, not for analytics. Recommendation, fraud scoring, dynamic pricing — sub-100ms or it doesn't get used. Other analytics can be batch.
Promotions are messy. Promo lift modeling is the most-mistaken analytics in retail; almost everyone over-attributes lift to promos. Use causal inference (DML, synthetic control) for promo eval.

Getting started¶

Read Reference Architecture — Data Flow
Pick one scenario from the top list — most retailers benefit most from Customer 360 first
Walk Tutorial 02 — Data Governance so customer data is properly classified before you build anything
Adapt Example — Commerce (federal commerce stats but the data patterns transfer)
Layer Example — Data API Builder for the headless commerce surface
Before rolling out customer-facing GenAI: review Patterns -- LLMOps & Evaluation

Omnichannel analytics reference architecture¶

The following diagram shows how data from POS systems, e-commerce platforms, and mobile apps converge into a unified customer graph that powers analytics, personalization, and activation.

flowchart TB
    subgraph Sources[Source Channels]
        POS[POS Systems<br/>in-store transactions]
        Ecom[E-Commerce Platform<br/>web orders + clickstream]
        Mobile[Mobile App<br/>app events + location]
        Loyalty[Loyalty Program<br/>enrollments + redemptions]
        Support[Customer Support<br/>calls + chat + email]
    end

    subgraph Ingest[Ingestion Layer]
        EH3[Event Hubs<br/>real-time streams]
        ADF3[ADF / Fabric Pipeline<br/>batch extracts]
    end

    subgraph Medallion3[Medallion Lakehouse]
        Bronze3[(Bronze<br/>raw events per channel)]
        Silver3[(Silver<br/>conformed + identity-resolved)]
        Gold3[(Gold<br/>Customer 360<br/>+ product analytics)]
    end

    subgraph Analytics3[Analytics & Activation]
        ML3[Azure ML<br/>recommendation<br/>+ demand forecast]
        PBI3[Power BI<br/>merchandising<br/>+ store ops dashboards]
        CDP[CDP / Activation<br/>Braze / Marketo /<br/>Meta Ads]
        DAB[Data API Builder<br/>headless commerce API]
    end

    POS --> EH3
    Ecom --> EH3
    Mobile --> EH3
    Loyalty --> ADF3
    Support --> ADF3
    EH3 --> Bronze3
    ADF3 --> Bronze3
    Bronze3 --> Silver3
    Silver3 --> Gold3
    Gold3 --> ML3
    Gold3 --> PBI3
    Gold3 --> CDP
    Gold3 --> DAB

    style Sources fill:#f5f5f5
    style Ingest fill:#fff4cc
    style Medallion3 fill:#cce4ff
    style Analytics3 fill:#ccffe4

Note

Identity resolution happens at the bronze-to-silver transition. Every channel produces a different identifier (POS loyalty number, e-commerce email, mobile device ID). The silver layer resolves these into a single customer_id using deterministic matching (email, phone, loyalty number) first, then probabilistic matching (address, name fuzzy match) for the remainder.

Supply chain optimization¶

Demand sensing¶

Demand sensing improves short-term forecast accuracy (1-14 days) by incorporating signals beyond historical sales:

POS sell-through data — actual consumer purchases, not retailer orders (reduces bullwhip effect)
Weather forecasts — temperature, precipitation, severe weather warnings correlated with category demand
Promotional calendar — planned promotions, competitor promotions (scrape or third-party data)
Social signals — trending products, viral moments, sentiment shifts
Event calendar — holidays, sports events, school schedules by region

Implement demand sensing as a daily dbt + ML pipeline: dbt prepares the feature tables in gold, Azure ML trains and scores the forecast model, results write back to gold for downstream consumption.

Supplier risk scoring¶

Build a supplier risk score in your gold layer using:

Risk dimension	Data source	Metric
Delivery reliability	Purchase orders vs receipts	On-time-in-full (OTIF) rate, lead time variability
Quality	Inspection records, returns	Defect rate, return rate by supplier
Financial health	Third-party credit data (D&B, Moody's)	Credit score, payment history
Concentration	Procurement data	% of category spend with single supplier
Geopolitical	Country risk indices	Sourcing region instability score

Combine into a composite score using weighted average or a simple ML model. Surface in Power BI for procurement teams. Alert when a critical supplier's score deteriorates beyond threshold.

Inventory turn analysis¶

Inventory turns (COGS / average inventory) is the fundamental retail efficiency metric. Build it as a dbt model hierarchy:

stg_inventory_snapshots — daily inventory positions by SKU and location
int_avg_inventory — rolling 30/60/90-day average inventory at cost
fct_inventory_turns — turns by SKU, category, location, and time period
rpt_slow_movers — SKUs below target turn rate, with aging buckets (30/60/90/180+ days)

Tip

For Fabric-native implementations, see Fabric Lakehouse patterns for optimizing Delta table layouts for inventory snapshot workloads.

Customer 360¶

Identity resolution¶

Identity resolution is the hardest and most valuable data engineering problem in retail. Build it as a first-class silver-layer asset, not a one-off script.

Deterministic matching (high confidence, run first):

Exact email match (normalize: lowercase, trim, remove dots in Gmail)
Exact phone match (normalize to E.164 format)
Loyalty number match
Payment token match (same tokenized card across channels)

Probabilistic matching (lower confidence, run second on unmatched records):

Name + address fuzzy match (Jaro-Winkler distance > 0.92)
Name + ZIP + last-4-of-card
Device fingerprint clustering

Store the identity graph in silver with match confidence scores. Downstream gold models join on the resolved customer_id. Periodically review low-confidence matches with a human-in-the-loop process.

RFM scoring¶

Recency-Frequency-Monetary (RFM) scoring segments customers by purchase behavior. Implement as a dbt model:

-- Simplified RFM scoring in dbt
with rfm_base as (
    select
        customer_id,
        datediff(day, max(order_date), current_date) as recency_days,
        count(distinct order_id) as frequency,
        sum(order_total) as monetary
    from {{ ref('fct_orders') }}
    where order_date >= dateadd(year, -2, current_date)
    group by customer_id
),
rfm_scored as (
    select *,
        ntile(5) over (order by recency_days desc) as r_score,
        ntile(5) over (order by frequency) as f_score,
        ntile(5) over (order by monetary) as m_score
    from rfm_base
)
select *,
    r_score * 100 + f_score * 10 + m_score as rfm_segment
from rfm_scored

Customer lifetime value (CLV)¶

CLV prediction uses historical purchase data to estimate future value. Two common approaches:

Probabilistic (BG/NBD + Gamma-Gamma model) — works well with transaction-level data, requires only purchase dates and amounts, implemented via the lifetimes Python library in Azure ML
ML-based — train a regression model on features including RFM scores, category preferences, channel mix, tenure, and engagement metrics; better accuracy but requires more feature engineering

Store CLV predictions in the gold layer, refresh monthly, and use them for:

Marketing budget allocation (invest more in high-CLV segments)
Acquisition channel evaluation (which channels bring high-CLV customers?)
Churn intervention prioritization (high-CLV + high-churn-risk = top priority)

Demand sensing patterns¶

Seasonal decomposition¶

Retail demand has strong seasonal patterns. Decompose time series into trend, seasonal, and residual components before feeding to ML models. Use STL decomposition (Seasonal and Trend decomposition using LOESS) in the feature engineering pipeline:

Trend — long-term direction (category growth/decline)
Seasonal — repeating calendar patterns (weekly, monthly, annual)
Residual — the signal your ML model should focus on (demand shocks, promotions, weather)

Compute decomposition in Databricks/Synapse Spark using statsmodels.tsa.seasonal.STL. Store components as features in gold for downstream ML models.

Promotional lift measurement¶

Measuring true promotional lift requires causal inference, not just before/after comparison (which confounds promotion with seasonality, trend, and other factors).

Method	When to use	Complexity
Difference-in-differences	You have control stores that didn't run the promotion	Low
Synthetic control	No clean control group; construct a synthetic counterfactual from weighted donor pool	Medium
Double ML (DML)	High-dimensional confounders; want a causal estimate of promotion effect	High
Bayesian structural time series	Single time-series with pre/post periods; Google's CausalImpact approach	Medium

Warning

Simple lift calculations (promoted sales minus baseline) systematically overstate promotion effectiveness. At minimum, use difference-in-differences with a control group. For high-stakes promo evaluation, invest in synthetic control or DML.

Weather impact modeling¶

Weather affects retail demand more than most teams realize. Key patterns:

Temperature — drives seasonal categories (beverages, ice cream, apparel transitions)
Precipitation — suppresses foot traffic, boosts e-commerce
Severe weather warnings — drive pantry-loading behavior (water, batteries, bread/milk)
Day length — affects outdoor activity and associated product categories

Ingest weather data from NOAA (free, daily resolution) or a commercial provider (hourly, hyperlocal). Join with store locations and sales data in silver. Add weather features to demand forecast models. See Use Case -- NOAA Climate Analytics for the weather data ingestion pattern.

Price and markdown optimization¶

Elasticity models¶

Price elasticity measures how demand responds to price changes. Estimate it per SKU-store using regression on historical price/demand data:

Own-price elasticity — % change in demand for a 1% change in price (typically -1.5 to -3 for branded CPG)
Cross-price elasticity — how a price change in one SKU affects demand for substitutes and complements
Promotional elasticity — demand response to temporary price reductions (usually 2-5x higher than everyday elasticity)

Use log-log regression (ln(demand) ~ ln(price) + controls) for constant-elasticity estimates. For more flexibility, use gradient-boosted trees with price as a feature and compute elasticity via finite differences.

Markdown cadence¶

Seasonal merchandise (fashion, holiday) requires a markdown strategy that balances sell-through against margin:

Initial price — set based on target margin and competitive positioning
First markdown (typically 4-6 weeks into season) — 20-30% off; triggered when sell-through falls below plan
Subsequent markdowns — deeper cuts at 2-3 week intervals based on remaining inventory and weeks to end-of-season
Final clearance — 60-70% off to clear residual inventory before new season

Optimize markdown timing and depth using a dynamic programming model that maximizes total margin subject to sell-through targets. Implement in Azure ML; score weekly; surface recommendations in Power BI for merchandisers.

Competitive pricing¶

Monitor competitor prices using:

Third-party price intelligence (Competera, Prisync, Intelligence Node) — automated scraping + matching
MAP (Minimum Advertised Price) compliance — ensure your prices comply with manufacturer MAP policies
Price-index dashboards — your price relative to competitors at category, brand, and SKU level

Surface competitive price intelligence alongside elasticity estimates in a Power BI dashboard so pricing analysts can make informed decisions. Automate only non-sensitive repricing (e.g., matching marketplace prices for commodity items); keep strategic pricing decisions human-led.

Marketing attribution¶

Multi-touch attribution models¶

Marketing attribution assigns credit for conversions across touchpoints. Implement as a gold-layer dbt model.

Model	Logic	Pros	Cons
Last-touch	100% credit to the last touchpoint before conversion	Simple, easy to implement	Ignores awareness and consideration stages
First-touch	100% credit to the first touchpoint	Values acquisition channels	Ignores nurture and conversion channels
Linear	Equal credit to all touchpoints	Fair across stages	Ignores relative importance of each touch
Time-decay	More credit to touchpoints closer to conversion	Reflects increasing purchase intent	Undervalues brand-building touchpoints
Position-based (U-shaped)	40% first, 40% last, 20% split across middle	Balances acquisition + conversion	Arbitrary weight distribution
Data-driven (Shapley value)	ML-computed marginal contribution of each channel	Most accurate	Requires large data volume; harder to explain

Implement data-driven attribution using the Shapley value approach: for each conversion, compute the marginal contribution of each channel by comparing conversion probability with and without that channel in the path. Use Azure ML for the computation; store attribution results in gold for Power BI dashboards.

Incrementality testing¶

Attribution models tell you which channels get credit; incrementality testing tells you which channels actually cause conversions. Run geo-based or user-based holdout experiments:

Define test and control — randomly assign geographic regions (or user cohorts) to treatment (see ads) and control (don't see ads)
Measure conversion difference — the gap between treatment and control conversion rates is the true incremental lift
Calculate iROAS — incremental Return on Ad Spend = (incremental revenue) / (ad spend in treatment)

Store experiment configurations and results in your gold layer for longitudinal tracking.

Trade-offs¶

Give	Get
Real-time identity resolution (streaming matching)	More responsive personalization but higher infrastructure cost and complexity
Probabilistic matching (fuzzy name/address)	Higher match rate but risk of false matches polluting Customer 360
Centralized CDP (gold layer serves all channels)	Single source of truth but latency for activation channels
Data-driven attribution (Shapley values)	More accurate credit assignment but requires significant data volume and compute
Tokenization at POS (no raw PAN in analytics)	Minimal PCI scope but lose ability to do card-level analytics across merchants

Retail demand forecasting example¶

For a complete end-to-end walkthrough of demand forecasting on this platform, see:

Example -- Retail Demand Forecasting

Industries — Financial Services — fraud + payment patterns transfer
Industries — Telco — churn + customer experience patterns transfer
Use Case — Casino & Gaming Analytics — customer LTV + fraud patterns transfer
Patterns — LLMOps & Evaluation
Patterns — Power BI & Fabric Roadmap