Tutorial 14: CI/CD Setup¶

Overview¶

This tutorial covers implementing CI/CD (Continuous Integration/Continuous Deployment) for Azure Synapse Analytics, including Git integration, Azure DevOps pipelines, and deployment automation for notebooks, pipelines, and SQL scripts.

Prerequisites¶

Completed Tutorial 13: Monitoring and Diagnostics
Azure DevOps or GitHub account
Git fundamentals
Understanding of deployment concepts

Learning Objectives¶

By the end of this tutorial, you will be able to:

Configure Git integration for Synapse
Set up Azure DevOps pipelines
Implement deployment strategies
Automate testing and validation
Manage environment promotions

Section 1: Git Integration¶

Connecting Synapse to Git¶

```text┌─────────────────────────────────────────────────────────────────┐ │ Synapse Git Integration │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ Development Workspace │ │ └── Connected to: feature/xyz branch │ │ ├── notebooks/ │ │ ├── pipelines/ │ │ ├── dataflows/ │ │ ├── linkedServices/ │ │ └── sqlscripts/ │ │ │ │ ┌────────────────┐ │ │ │ Collaborate │ ─── feature/xyz ───▶ Pull Request │ │ └────────────────┘ ▼ │ │ main │ │ ▼ │ │ ┌─────────────┐ │ │ │ Publish to │ │ │ │ Production │ │ │ └─────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────┘

### Repository Structure

```textsynapse-workspace/
├── .gitignore
├── workspace.json
├── publish_config.json
├── notebook/
│   ├── DataProcessing.json
│   ├── ETL_Pipeline.json
│   └── Reporting.json
├── pipeline/
│   ├── DailyIngestion.json
│   ├── WeeklyAggregation.json
│   └── DataQuality.json
├── dataflow/
│   ├── TransformSales.json
│   └── CleanseCustomer.json
├── linkedService/
│   ├── AzureDataLakeStorage.json
│   ├── AzureSQLDatabase.json
│   └── PowerBI.json
├── integrationRuntime/
│   └── SelfHostedIR.json
├── sqlscript/
│   ├── CreateTables.json
│   ├── StoredProcedures.json
│   └── Views.json
├── credential/
│   └── ManagedIdentity.json
└── trigger/
    ├── DailyTrigger.json
    └── EventTrigger.json

Branch Strategy¶

# Branch naming convention
main:           # Production-ready code
  └── release/*   # Release branches for staging
      └── develop   # Integration branch
          └── feature/*  # Feature development
          └── bugfix/*   # Bug fixes
          └── hotfix/*   # Production hotfixes

Section 2: Azure DevOps Pipeline Setup¶

Service Connections¶

# Create service principal for deployment
az ad sp create-for-rbac \
    --name "synapse-cicd-sp" \
    --role "Synapse Administrator" \
    --scopes "/subscriptions/<sub>/resourceGroups/<rg>/providers/Microsoft.Synapse/workspaces/<workspace>"

# Grant additional permissions
az role assignment create \
    --assignee <sp-client-id> \
    --role "Storage Blob Data Contributor" \
    --scope "/subscriptions/<sub>/resourceGroups/<rg>/providers/Microsoft.Storage/storageAccounts/<account>"

CI Pipeline (azure-pipelines-ci.yml)¶

# CI Pipeline - Validate and Build
trigger:
  branches:
    include:
      - develop
      - feature/*
  paths:
    include:
      - synapse-workspace/*

pool:
  vmImage: 'ubuntu-latest'

variables:
  workspaceName: 'synapse-dev'
  resourceGroup: 'rg-synapse-dev'
  subscriptionId: '$(AZURE_SUBSCRIPTION_ID)'

stages:
  - stage: Validate
    displayName: 'Validate Synapse Artifacts'
    jobs:
      - job: ValidateArtifacts
        displayName: 'Validate Workspace Artifacts'
        steps:
          - checkout: self

          - task: AzureCLI@2
            displayName: 'Install Synapse CLI Extension'
            inputs:
              azureSubscription: 'synapse-service-connection'
              scriptType: 'bash'
              scriptLocation: 'inlineScript'
              inlineScript: |
                az extension add --name synapse --yes

          - task: AzureCLI@2
            displayName: 'Validate Workspace'
            inputs:
              azureSubscription: 'synapse-service-connection'
              scriptType: 'bash'
              scriptLocation: 'inlineScript'
              inlineScript: |
                cd synapse-workspace

                # Validate JSON syntax
                for file in $(find . -name "*.json"); do
                  echo "Validating: $file"
                  python -m json.tool "$file" > /dev/null
                done

                # Validate pipeline definitions
                for pipeline in pipeline/*.json; do
                  echo "Checking pipeline: $pipeline"
                  jq '.properties.activities | length' "$pipeline"
                done

  - stage: Test
    displayName: 'Run Tests'
    dependsOn: Validate
    jobs:
      - job: UnitTests
        displayName: 'Run Unit Tests'
        steps:
          - task: UsePythonVersion@0
            inputs:
              versionSpec: '3.9'

          - script: |
              pip install pytest pytest-cov pyspark
              pytest tests/ -v --junitxml=test-results.xml
            displayName: 'Run Python Tests'

          - task: PublishTestResults@2
            inputs:
              testResultsFiles: '**/test-results.xml'
              testRunTitle: 'Synapse Unit Tests'

  - stage: Build
    displayName: 'Build Artifacts'
    dependsOn: Test
    jobs:
      - job: BuildArtifacts
        displayName: 'Package Artifacts'
        steps:
          - task: CopyFiles@2
            displayName: 'Copy Synapse Artifacts'
            inputs:
              SourceFolder: 'synapse-workspace'
              Contents: '**'
              TargetFolder: '$(Build.ArtifactStagingDirectory)/synapse'

          - task: CopyFiles@2
            displayName: 'Copy Deployment Scripts'
            inputs:
              SourceFolder: 'deployment'
              Contents: '**'
              TargetFolder: '$(Build.ArtifactStagingDirectory)/deployment'

          - task: PublishBuildArtifacts@1
            displayName: 'Publish Artifacts'
            inputs:
              PathtoPublish: '$(Build.ArtifactStagingDirectory)'
              ArtifactName: 'synapse-artifacts'

CD Pipeline (azure-pipelines-cd.yml)¶

# CD Pipeline - Deploy to Environments
trigger: none

resources:
  pipelines:
    - pipeline: ci-pipeline
      source: 'Synapse-CI'
      trigger:
        branches:
          include:
            - develop
            - main

pool:
  vmImage: 'ubuntu-latest'

variables:
  - group: synapse-variables

stages:
  - stage: DeployDev
    displayName: 'Deploy to Development'
    condition: and(succeeded(), eq(variables['Build.SourceBranch'], 'refs/heads/develop'))
    variables:
      environment: 'dev'
      workspaceName: 'synapse-dev'
      resourceGroup: 'rg-synapse-dev'
    jobs:
      - deployment: DeployToDevWorkspace
        displayName: 'Deploy to Dev Workspace'
        environment: 'synapse-dev'
        strategy:
          runOnce:
            deploy:
              steps:
                - template: templates/deploy-synapse.yml
                  parameters:
                    workspaceName: $(workspaceName)
                    resourceGroup: $(resourceGroup)
                    environment: $(environment)

  - stage: DeployStaging
    displayName: 'Deploy to Staging'
    dependsOn: DeployDev
    condition: and(succeeded(), eq(variables['Build.SourceBranch'], 'refs/heads/develop'))
    variables:
      environment: 'staging'
      workspaceName: 'synapse-staging'
      resourceGroup: 'rg-synapse-staging'
    jobs:
      - deployment: DeployToStagingWorkspace
        displayName: 'Deploy to Staging Workspace'
        environment: 'synapse-staging'
        strategy:
          runOnce:
            deploy:
              steps:
                - template: templates/deploy-synapse.yml
                  parameters:
                    workspaceName: $(workspaceName)
                    resourceGroup: $(resourceGroup)
                    environment: $(environment)

  - stage: DeployProd
    displayName: 'Deploy to Production'
    dependsOn: DeployStaging
    condition: and(succeeded(), eq(variables['Build.SourceBranch'], 'refs/heads/main'))
    variables:
      environment: 'prod'
      workspaceName: 'synapse-prod'
      resourceGroup: 'rg-synapse-prod'
    jobs:
      - deployment: DeployToProdWorkspace
        displayName: 'Deploy to Production Workspace'
        environment: 'synapse-prod'
        strategy:
          runOnce:
            deploy:
              steps:
                - template: templates/deploy-synapse.yml
                  parameters:
                    workspaceName: $(workspaceName)
                    resourceGroup: $(resourceGroup)
                    environment: $(environment)

Deployment Template (templates/deploy-synapse.yml)¶

parameters:
  - name: workspaceName
    type: string
  - name: resourceGroup
    type: string
  - name: environment
    type: string

steps:
  - download: ci-pipeline
    artifact: synapse-artifacts

  - task: AzureCLI@2
    displayName: 'Install Dependencies'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        az extension add --name synapse --yes
        pip install azure-synapse-artifacts

  - task: AzureCLI@2
    displayName: 'Stop Triggers'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        # Stop all triggers before deployment
        triggers=$(az synapse trigger list \
          --workspace-name ${{ parameters.workspaceName }} \
          --query "[?properties.runtimeState=='Started'].name" -o tsv)

        for trigger in $triggers; do
          echo "Stopping trigger: $trigger"
          az synapse trigger stop \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$trigger"
        done

  - task: AzureCLI@2
    displayName: 'Deploy Linked Services'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        cd $(Pipeline.Workspace)/ci-pipeline/synapse-artifacts/synapse/linkedService

        for file in *.json; do
          name="${file%.json}"
          echo "Deploying linked service: $name"

          # Apply environment-specific overrides
          cat "$file" | envsubst > "${file}.temp"

          az synapse linked-service create \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$name" \
            --file "@${file}.temp"
        done

  - task: AzureCLI@2
    displayName: 'Deploy Datasets'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        cd $(Pipeline.Workspace)/ci-pipeline/synapse-artifacts/synapse/dataset

        for file in *.json; do
          name="${file%.json}"
          echo "Deploying dataset: $name"

          az synapse dataset create \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$name" \
            --file "@$file"
        done

  - task: AzureCLI@2
    displayName: 'Deploy Notebooks'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        cd $(Pipeline.Workspace)/ci-pipeline/synapse-artifacts/synapse/notebook

        for file in *.json; do
          name="${file%.json}"
          echo "Deploying notebook: $name"

          az synapse notebook import \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$name" \
            --file "@$file"
        done

  - task: AzureCLI@2
    displayName: 'Deploy Pipelines'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        cd $(Pipeline.Workspace)/ci-pipeline/synapse-artifacts/synapse/pipeline

        for file in *.json; do
          name="${file%.json}"
          echo "Deploying pipeline: $name"

          az synapse pipeline create \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$name" \
            --file "@$file"
        done

  - task: AzureCLI@2
    displayName: 'Deploy SQL Scripts'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        cd $(Pipeline.Workspace)/ci-pipeline/synapse-artifacts/synapse/sqlscript

        for file in *.json; do
          name="${file%.json}"
          echo "Deploying SQL script: $name"

          az synapse sql-script import \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$name" \
            --file "@$file"
        done

  - task: AzureCLI@2
    displayName: 'Deploy Triggers'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        cd $(Pipeline.Workspace)/ci-pipeline/synapse-artifacts/synapse/trigger

        for file in *.json; do
          name="${file%.json}"
          echo "Deploying trigger: $name"

          az synapse trigger create \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$name" \
            --file "@$file"
        done

  - task: AzureCLI@2
    displayName: 'Start Triggers'
    inputs:
      azureSubscription: 'synapse-service-connection'
      scriptType: 'bash'
      scriptLocation: 'inlineScript'
      inlineScript: |
        # Start triggers after deployment
        triggers=$(az synapse trigger list \
          --workspace-name ${{ parameters.workspaceName }} \
          --query "[?properties.runtimeState=='Stopped'].name" -o tsv)

        for trigger in $triggers; do
          echo "Starting trigger: $trigger"
          az synapse trigger start \
            --workspace-name ${{ parameters.workspaceName }} \
            --name "$trigger"
        done

Section 3: SQL Pool Deployment¶

Database Project Structure¶

```textsql-database/ ├── dbo/ │ ├── Tables/ │ │ ├── fact.Sales.sql │ ```s -- -- -- -- -- -- IF IF

Area	Recommendation
Branching	Use GitFlow or trunk-based development
Testing	Automate validation and integration tests
Secrets	Use Key Vault, never commit secrets
Environments	Maintain parity between dev/staging/prod
Deployments	Use incremental deployments when possible
Rollback	Always have a rollback strategy

Tutorial 14: CI/CD Setup¶

Overview¶

Prerequisites¶

Learning Objectives¶

Section 1: Git Integration¶

Connecting Synapse to Git¶

Branch Strategy¶

Section 2: Azure DevOps Pipeline Setup¶

Service Connections¶

CI Pipeline (azure-pipelines-ci.yml)¶

CD Pipeline (azure-pipelines-cd.yml)¶

Deployment Template (templates/deploy-synapse.yml)¶

Section 3: SQL Pool Deployment¶

Database Project Structure¶

SQL Deployment Script¶

Section 4: Testing Framework¶

Pipeline Testing¶

Integration Testing¶

Section 5: Environment Configuration¶

Parameter Files¶

Environment Variable Substitution¶

Exercises¶

Exercise 1: Set Up CI/CD Pipeline¶

Exercise 2: Implement Database Migrations¶

Exercise 3: Add Testing¶

Best Practices Summary¶

Summary¶

Additional Resources¶