Skip to content

🛠️ Azure Analytics Services Documentation

Status Services Coverage

Comprehensive documentation for all Azure analytics services, organized by service category.


🎯 Service Categories Overview

This section provides detailed documentation for Azure analytics services, organized into logical categories based on their primary function and use cases.

graph TB
    subgraph "Analytics Compute"
        AC1[Azure Synapse Analytics]
        AC2[Azure Databricks]
        AC3[HDInsight]
    end

    subgraph "Streaming Services"
        SS1[Stream Analytics]
        SS2[Event Hubs]
        SS3[Event Grid]
    end

    subgraph "Storage Services"
        ST1[Data Lake Gen2]
        ST2[Cosmos DB]
        ST3[Azure SQL Database]
    end

    subgraph "Orchestration Services"
        OS1[Data Factory]
        OS2[Logic Apps]
    end

    AC1 --> ST1
    AC2 --> ST1
    SS1 --> ST1
    SS1 --> ST2
    SS2 --> SS1
    OS1 --> AC1
    OS1 --> ST1

💾 Analytics Compute Services

🎯 Azure Synapse Analytics

Complexity Tier

Unified analytics service combining data integration, data warehousing, and analytics.

Key Features:

  • Serverless SQL Pools: Query data directly from data lake
  • Dedicated SQL Pools: Enterprise data warehousing
  • Spark Pools: Big data processing and ML
  • Data Integration: Built-in ETL/ELT pipelines

Documentation Sections:

Best For: Enterprise data warehousing, unified analytics workspaces, large-scale data processing


🧪 Azure Databricks

Complexity Tier

Collaborative analytics platform optimized for machine learning and data science.

Key Features:

  • Collaborative Notebooks: Multi-language data science environment
  • Delta Live Tables: Declarative ETL framework
  • MLflow Integration: End-to-end ML lifecycle management
  • Unity Catalog: Unified data governance

Documentation Sections:

Best For: Data science & ML, collaborative analytics, advanced data engineering


🐘 HDInsight

Complexity Tier

Managed Apache Hadoop, Spark, and Kafka clusters in Azure.

Key Features:

  • Multiple Cluster Types: Hadoop, Spark, HBase, Kafka, Storm
  • Enterprise Security: ESP integration with Active Directory
  • Custom Applications: Support for custom Hadoop ecosystem tools
  • Hybrid Connectivity: Integration with on-premises systems

Documentation Sections:

Best For: Hadoop migration to cloud, custom big data applications, cost-optimized processing


🔄 Streaming Services

Azure Stream Analytics

Complexity Type

Real-time analytics service for streaming data processing.

Key Features:

  • SQL-based Queries: Familiar SQL syntax for stream processing
  • Windowing Functions: Tumbling, hopping, and sliding windows
  • Anomaly Detection: Built-in ML-based anomaly detection
  • Edge Deployment: Run analytics on IoT Edge devices

Documentation Sections:

Best For: IoT analytics, real-time dashboards, fraud detection, operational monitoring


📨 Azure Event Hubs

Complexity Type

Big data streaming platform and event ingestion service.

Key Features:

  • High Throughput: Millions of events per second
  • Kafka Compatibility: Drop-in replacement for Apache Kafka
  • Capture Feature: Automatic data archival to storage
  • Schema Registry: Centralized schema management

Documentation Sections:

Best For: High-volume event ingestion, Kafka migration, event-driven architectures


🌐 Azure Event Grid

Complexity Type

Event routing service for building event-driven applications.

Key Features:

  • Event Routing: Intelligent event routing to multiple destinations
  • Custom Topics: Create custom event publishers
  • System Topics: Built-in events from Azure services
  • Event Filtering: Route events based on content

Documentation Sections:

Best For: Event-driven applications, serverless workflows, system integration


🗃️ Storage Services

🏞️ Azure Data Lake Storage Gen2

Complexity Type

Hierarchical namespace storage optimized for big data analytics.

Key Features:

  • Hierarchical Namespace: Directory and file-level operations
  • Fine-grained ACLs: POSIX-compliant access control
  • Multi-protocol Access: Blob and Data Lake APIs
  • Lifecycle Management: Automated data tiering and archival

Documentation Sections:

Best For: Data lake implementations, big data analytics storage, data archival


🌌 Azure Cosmos DB

Complexity Type

Globally distributed, multi-model NoSQL database service.

Key Features:

  • Multiple APIs: SQL, MongoDB, Cassandra, Gremlin, Table
  • Global Distribution: Multi-region writes and reads
  • Analytical Store: HTAP capabilities with Synapse Link
  • Change Feed: Real-time change data capture

Documentation Sections:

Best For: Globally distributed applications, real-time low-latency apps, HTAP workloads


🗄️ Azure SQL Database

Complexity Type

Fully managed relational database service.

Key Features:

  • Hyperscale: Massively scalable database architecture
  • Elastic Pools: Shared resources across multiple databases
  • Built-in Intelligence: Automatic tuning and threat detection
  • Always Encrypted: Column-level encryption

Documentation Sections:

Best For: Relational data workloads, transactional applications, data marts


🔧 Orchestration Services

🏗️ Azure Data Factory

Complexity Type

Cloud-based data integration service for creating ETL/ELT pipelines.

Key Features:

  • Code-free ETL: Visual pipeline designer
  • Data Flows: Transformation logic with Spark execution
  • Hybrid Integration: On-premises and cloud data sources
  • CI/CD Support: Azure DevOps and GitHub integration

Documentation Sections:

Best For: Data integration pipelines, ETL/ELT processes, data migration


Azure Logic Apps

Complexity Type

Serverless workflow automation service.

Key Features:

  • Visual Designer: Drag-and-drop workflow creation
  • 300+ Connectors: Pre-built connectors for popular services
  • B2B Integration: EDI and AS2 support
  • Event-driven: Trigger-based workflow execution

Documentation Sections:

Best For: Business process automation, system integrations, event-driven workflows


🎯 Service Selection Matrix

By Use Case

Use Case Primary Service Supporting Services Architecture Pattern
Real-time Analytics Stream Analytics Event Hubs, Cosmos DB Lambda Architecture
Enterprise Data Warehouse Synapse Dedicated SQL Data Lake Gen2, Data Factory Batch Architectures
Data Science & ML Databricks Data Lake Gen2, MLflow Architecture Patterns
IoT Analytics Stream Analytics + Event Hubs Data Lake Gen2, Cosmos DB Streaming Architectures
Data Lake Implementation Data Lake Gen2 + Synapse Data Factory, Purview Medallion Architecture

By Data Volume & Complexity

Data Volume Recommended Services Cost Tier
< 1TB Azure SQL, Cosmos DB, Stream Analytics $
1-100TB Synapse Dedicated, Databricks, HDInsight $$
> 100TB Synapse Serverless, Data Lake Gen2, Event Hubs $

📊 Getting Started Recommendations

🚀 Beginners

Start with these services for simpler implementations:

  1. Azure SQL Database - Familiar relational database
  2. Azure Data Factory - Visual ETL pipeline designer
  3. Event Grid - Simple event routing
  4. Stream Analytics - SQL-based stream processing

🔧 Intermediate Users

Move to these for more complex scenarios:

  1. Synapse Serverless SQL - Query data lake without infrastructure
  2. Event Hubs - High-throughput event streaming
  3. Cosmos DB - Multi-model NoSQL database
  4. Data Lake Storage Gen2 - Scalable data lake foundation

🎯 Advanced Users

Leverage these for enterprise-scale implementations:

  1. Synapse Dedicated SQL Pools - Enterprise data warehousing
  2. Databricks - Advanced analytics and ML
  3. HDInsight - Custom big data solutions
  4. Event Hubs Dedicated Clusters - Maximum performance and isolation

🔗 Quick Navigation

📖 By Documentation Type

🎯 By Use Case


Last Updated: 2025-01-28
Total Services Documented: 11
Coverage: 95%