Purpose: This document outlines everything participants need to have in place before the workshop begins. Completing these steps ahead of time ensures you can focus on building, not troubleshooting setup issues during the session.
How the workshop runs: The workshop is delivered as a Databricks App backed by Lakebase (managed PostgreSQL) that guides participants through the framework step by step — from Gold layer design through GenAI agent deployment. Participants use an AI-powered IDE alongside the App to build production-grade Databricks solutions.
| # | Prerequisite | Owner | Estimated Time |
|---|---|---|---|
| 1 | Workspace access for participants | Admin | 1–2 days (AD group provisioning) |
| 2 | Unity Catalog access with schema creation privileges | Admin | 30 min |
| 3 | Serverless SQL Warehouse access | Admin | 15 min |
| 4 | Serverless General Compute access (budget policy) | Admin | 15 min |
| 5 | Databricks Apps enabled with Lakebase access | Admin | 30 min |
| 6 | Install an AI-powered IDE | Participant | 15 min |
| 7 | Provision IDE licenses / AI model access | Admin + Participant | Varies |
| 8 | Access to Claude Sonnet 4.5 (or higher) | Participant | 10 min |
| 9 | Install the Databricks CLI | Participant | 10 min |
| 10 | Install Node.js v22+ and Git | Participant | 10 min |
| 11 | Authenticate to the workspace and validate connectivity | Participant | 10 min |
These steps are typically performed by a Workspace Admin or Account Admin in advance.
Participants must be able to log in to the Databricks workspace used for the workshop.
- Recommended approach: Create a dedicated AD (Active Directory) group (e.g.,
workshop-participants) and assign all attendees to it. - Grant the AD group access to the target Databricks workspace.
- Verify that each participant can successfully log in to the workspace URL.
Tip: Send participants the workspace URL and ask them to confirm login access at least 48 hours before the workshop.
Each participant (or team) needs the ability to create and manage their own schema within a designated catalog.
- Create a workshop catalog (e.g.,
workshoporworkshop_<date>) or designate an existing one. - Grant the AD group the following privileges:
-- Grant catalog-level usage
GRANT USE CATALOG ON CATALOG workshop TO `workshop-participants`;
-- Grant the ability to create schemas within the catalog
GRANT CREATE SCHEMA ON CATALOG workshop TO `workshop-participants`;- Each participant will create their own schema during the workshop (e.g.,
workshop.john_doe).
Why: The workshop involves creating Bronze, Silver, and Gold layer tables, metric views, and other assets. Participants need their own isolated schema to avoid conflicts.
Participants need access to a Serverless SQL Warehouse for running queries, creating metric views, TVFs, and Genie Spaces.
- Create a shared Serverless SQL Warehouse (e.g.,
Workshop SQL Warehouse) or use an existing one. - Grant the AD group
CAN USEpermission on the warehouse. - Ensure the warehouse is set to Serverless (not Classic or Pro).
Sizing guidance: A
SmallServerless SQL Warehouse is typically sufficient for workshop-sized workloads.
Participants need access to Serverless General Compute for running notebooks and jobs.
- Enable Serverless compute for the workspace (if not already enabled).
- Create a budget policy for workshop participants to control cost:
- Navigate to Compute > Budget Policies in the workspace.
- Create a policy (e.g.,
workshop-budget-policy) with appropriate limits. - Assign the policy to the
workshop-participantsAD group.
- Grant the AD group permission to create and use Serverless compute.
Important: Without a budget policy, participants may not be able to launch Serverless compute. Verify this is configured before the workshop.
The workshop is delivered as a Databricks App that guides participants through the framework step by step. This requires Databricks Apps and Lakebase (managed PostgreSQL) to be enabled in the workspace.
- Navigate to Workspace Settings > Compute > Databricks Apps.
- Ensure Apps are enabled for the workspace.
- Grant the AD group the Consumer entitlement so participants can access deployed Apps:
- Navigate to Workspace Settings > Identity and access > Groups.
- Select the
workshop-participantsgroup. - Under Entitlements, enable Consumer.
Note: Databricks Apps run on dedicated Serverless compute. No additional cluster configuration is required.
The workshop App uses Lakebase as its backend database for tracking participant progress, storing session state, and managing the guided workflow.
- Ensure Lakebase is enabled for the workspace:
- Navigate to Workspace Settings > Compute > Lakebase.
- Enable the feature if not already active.
- Grant the AD group permission to create and access Lakebase databases:
-- Grant Lakebase usage (if using Unity Catalog-governed access)
GRANT USE CATALOG ON CATALOG workshop TO `workshop-participants`;
GRANT CREATE SCHEMA ON CATALOG workshop TO `workshop-participants`;Why Lakebase: The workshop App uses a managed PostgreSQL instance (Lakebase) to persist application state, guide participants through each stage, and store configuration. This eliminates the need for external database provisioning.
Before the workshop, the organizer should deploy the workshop App and verify it starts successfully:
# Deploy the workshop app (run from the project root)
databricks apps create --name workshop-guide --profile DEFAULT
# Verify the app is running
databricks apps get --name workshop-guide --profile DEFAULTParticipants should be able to access the App URL from their browser once it's deployed.
Sizing guidance: Lakebase instances for workshop use are lightweight — the default configuration is sufficient for up to 50 concurrent participants.
These steps must be completed by each participant on their own machine.
Install one of the following AI-powered IDEs on your machine:
| IDE | Download Link | Notes |
|---|---|---|
| Cursor (recommended) | cursor.com | Built-in AI coding assistant |
| Windsurf | windsurf.com | AI-native code editor |
| VS Code + GitHub Copilot | code.visualstudio.com | Requires Copilot extension |
Corporate environments: If your organization restricts software installation, submit a software exception request through your IT portal before the workshop.
Depending on your chosen IDE, you may need to provision licenses:
| IDE | License Requirement |
|---|---|
| Cursor | Pro or Business plan recommended for full AI features |
| Windsurf | Check your organization's SSO portal for access |
| VS Code + Copilot | GitHub Copilot Individual or Business subscription |
Verify your license is active by opening the IDE and confirming the AI assistant responds to prompts.
The workshop leverages advanced AI models for code generation. You need access to Claude Sonnet 4.5 (or higher, such as Claude Opus) through your IDE.
- Cursor: Go to Settings > Models and ensure Claude Sonnet 4.5 (or higher) is enabled.
- Windsurf: Verify model availability in your plan settings.
- VS Code + Copilot: Copilot uses its own models; ensure your subscription is active.
Why Claude Sonnet 4.5+: The workshop's agent skills and prompts are optimized for advanced reasoning capabilities. Lower-tier models may produce less reliable results.
The Databricks CLI is required for authentication and deploying Asset Bundles.
brew tap databricks/tap
brew install databrickswinget install Databricks.DatabricksCLIcurl -fsSL https://raw.githubusercontent.com/databricks/setup-cli/main/install.sh | shdatabricks --versionYou should see output like Databricks CLI v0.x.x. Any recent version is acceptable.
Troubleshooting: If you encounter security-related installation issues in a corporate environment, submit a software exception request through your IT portal.
The AppKit workshop (Path A) requires Node.js v22+ for scaffolding and running Databricks Apps locally. Git is required for cloning the template and installing agent skills.
Download and install from nodejs.org (LTS recommended).
node --versionYou should see v22.x.x or higher.
Download and install from git-scm.com if not already present.
git --versionNote: If you are only following Path B (Data Product Accelerator) and not building a Databricks App, Node.js is not required. Git is still needed for cloning the template.
Once the CLI is installed, configure it to connect to your workshop workspace.
- Log in to your Databricks workspace in a browser.
- Click your user icon (top-right corner) > Settings.
- Navigate to Developer > Access Tokens.
- Click Generate New Token.
- Provide a comment (e.g.,
workshop-token) and set an expiration (e.g., 7 days). - Copy the token immediately — you won't be able to see it again.
Open a terminal in your IDE and run:
databricks configure --profile DEFAULTWhen prompted, enter:
| Prompt | Value |
|---|---|
| Databricks Host | https://<your-workspace-url> (provided by workshop organizer) |
| Personal Access Token | The token you generated in Step 11a |
Run the following command to confirm authentication:
databricks auth env --profile DEFAULTExpected output (values will vary):
DATABRICKS_HOST=https://<your-workspace-url>
DATABRICKS_TOKEN=dapi********************************
Additionally, verify you can list workspace contents:
databricks workspace list / --profile DEFAULTYou should see a list of top-level workspace folders (e.g., /Repos, /Users, /Shared).
If authentication fails:
- Double-check the workspace URL (no trailing slash).
- Regenerate your PAT and reconfigure.
- Ensure your AD group membership has been provisioned (Step 1).
- If using OAuth instead of PAT, refer to the Databricks CLI authentication docs.
Use this checklist to confirm everything is ready:
- AD group created and all participants added
- Participants can log in to the workspace
- Workshop catalog created with
CREATE SCHEMAprivileges granted - Serverless SQL Warehouse provisioned and accessible
- Serverless General Compute enabled with budget policy assigned
- Databricks Apps enabled in workspace settings
- Lakebase enabled in workspace settings
- AD group granted Consumer entitlement
- Workshop App deployed and accessible at its URL
- AI-powered IDE installed and licensed
- Claude Sonnet 4.5 (or higher) model accessible in IDE
- Databricks CLI installed (
databricks --versionworks) - Node.js v22+ installed (
node --versionreturns v22+) — required for Path A (AppKit) - Git installed (
git --versionworks) - CLI authenticated (
databricks auth env --profile DEFAULTreturns valid credentials) - Workspace accessible (
databricks workspace list / --profile DEFAULTreturns results) - Workshop App URL loads successfully in browser
| Issue | Resolution |
|---|---|
| Cannot log in to workspace | Confirm AD group membership with your admin. Allow up to 24 hours for provisioning. |
| "Permission denied" on catalog | Admin needs to run the GRANT statements from Step 2. |
| Serverless compute not available | Admin needs to enable Serverless compute and assign a budget policy (Step 4). |
| Databricks Apps not available | Admin needs to enable Apps in Workspace Settings > Compute > Databricks Apps (Step 5). |
| Lakebase not available | Admin needs to enable Lakebase in Workspace Settings > Compute > Lakebase (Step 5). |
| Workshop App fails to start | Check that Lakebase is enabled, the AD group has the Consumer entitlement, and the app was deployed with correct permissions. Review app logs via databricks apps get-logs --name workshop-guide. |
| CLI install blocked by corporate policy | Submit a software exception request through your IT portal. |
databricks auth env returns an error |
Regenerate your PAT and reconfigure with databricks configure --profile DEFAULT. |
| IDE AI features not working | Verify your license/subscription is active and the correct model is selected. |
If you run into issues completing these prerequisites, please reach out to the workshop organizers before the session so we can troubleshoot together. We want everyone ready to build on day one.