Skip to content

bhavink/databricks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

499 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

I design and implement secure, production-grade Data and AI platforms across Azure, AWS, and GCP. Specializing in Databricks architecture, zero-trust security, and infrastructure automation.

๐ŸŽฏ What I Do

  • ๐Ÿ—๏ธ Build secure data lakehouses with Private Link, Unity Catalog, and data exfiltration protection
  • โ˜๏ธ Multi-cloud Databricks architecture for regulated industries (finance, healthcare, government)
  • โš™๏ธ Infrastructure as Code with modular Terraform templates and automation frameworks
  • ๐Ÿ“ Share knowledge through technical articles and open source contributions

๐Ÿ“š Recent Work

Latest Articles (13+ published on Databricks Blog):

๐Ÿ’ก Core Expertise

Security           Infrastructure        Multi-Cloud
โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”  โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”  โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”
โ€ข DEP Frameworks   โ€ข Terraform Modules   โ€ข Azure (ADB)
โ€ข Unity Catalog    โ€ข CI/CD Pipelines     โ€ข AWS (DB)
โ€ข Private Link     โ€ข Config Management   โ€ข GCP (DB)
โ€ข CMK/Encryption   โ€ข Custom Agents       โ€ข VNet/VPC/VPC-SC
โ€ข Network Security โ€ข Automation          โ€ข Cross-Cloud

๐Ÿ“ซ Connect


"Building secure, scalable data platforms that enable innovation while protecting what matters most."


Repository Contents: All Things Databricks โœ…

This repository contains production-ready infrastructure templates, ready-to-use code samples, how-to guides, and deployment architectures to help you learn and operate the Databricks Lakehouse on Azure, AWS, and GCP.


Quick Links ๐Ÿ”—

Cloud Description Path
๐Ÿ“– Guides Cross-cloud guides (authentication, networking, troubleshooting) guides
๐Ÿค– AI Governance Authentication & authorization for Agent Bricks, Genie, Databricks Apps ai-governance
๐Ÿ”ท Azure Production-ready security & modular Terraform deployment patterns adb4u
โ˜๏ธ AWS Private Link workspace templates with DEP controls awsdb4u
๐ŸŸข GCP VPC-SC, Private Service Connect, CMEK implementations gcpdb4u
๐Ÿ› ๏ธ Utils Utilities and helper scripts databricks-utils
๐Ÿ“ฆ Archive Legacy content and code samples archive

๐Ÿ“– Cross-Cloud Guides (Start Here!)

New to Databricks infrastructure? Check out our comprehensive guides:

Building AI Applications? Check out our AI governance guide:

  • AI Governance Guide - Production-ready authentication & authorization patterns for:
    • ๐Ÿ”ฎ Genie Space - Multi-team access, 1000+ users with complex UC governance
    • ๐Ÿค– Agent Bricks - Knowledge Assistant, Information Extraction, Multi-Agent Supervisor, Custom LLM
    • ๐Ÿ“ฑ Databricks Apps - App authorization vs user authorization patterns
    • Includes real-world scenarios mapped to official use cases

๐ŸŒฉ๏ธ Databricks Deployment Guides by Cloud

๐Ÿ”ท Azure (adb4u)

Production-Ready Modular Terraform Templates

  • โœ… Focus: Security, governance, and production-ready deployment patterns
  • ๐Ÿ—๏ธ Architecture: Non-PL, Full Private (air-gapped), Hub-Spoke with firewall
  • ๐Ÿ” Security: Unity Catalog, Private Link, NPIP/SCC, CMK, Service Endpoints
  • ๐Ÿ“š Documentation: 2,300+ lines with UML diagrams, traffic flows, troubleshooting guides
  • ๐Ÿ“ Path: adb4u/

Key Features:

  • Modular Terraform structure (Networking, Workspace, Unity Catalog, Key Vault)
  • BYOV (Bring Your Own VNet/Subnet/NSG) support
  • Automated NSG rule management for SCC workspaces
  • Customer-Managed Keys with auto-rotation
  • Comprehensive deployment checklists and troubleshooting

Quick Start: See adb4u/docs/01-QUICKSTART.md


โ˜๏ธ AWS (awsdb4u)

Private Link Workspace Templates with DEP Controls

  • ๐ŸŽฏ Focus: Deploying and operating Databricks on AWS with best practices
  • ๐Ÿ” Security: VPC design, Private Link, PrivateLink endpoints, data exfiltration protection
  • ๐Ÿ“Š Topics: S3 data access patterns, IAM roles and policies, cross-account setups
  • ๐Ÿ› ๏ธ Automation: Infrastructure templates and configuration management
  • ๐Ÿ“ Path: awsdb4u/

Key Features:

  • Private Link workspace deployments
  • Data Exfiltration Protection (DEP) controls
  • VPC and subnet design patterns
  • IAM role and policy automation
  • Cross-account setup guidance

๐ŸŸข GCP (gcpdb4u)

VPC-SC, Private Service Connect, CMEK Implementations

  • ๐ŸŽฏ Focus: GCP-specific guidance with emphasis on data plane security
  • ๐Ÿ” Security: VPC-SC perimeters, Private Service Connect, KMS integration
  • ๐ŸŒ Networking: VPC and subnet design, private connectivity patterns
  • ๐Ÿ”‘ Identity: IAM & service accounts, Workload Identity Federation
  • ๐Ÿ“ Path: gcpdb4u/

Key Features:

  • VPC Service Controls (VPC-SC) integration
  • Private Service Connect (PSC) for workspace connectivity
  • Google KMS integration for encryption
  • GCS connectors and data access patterns
  • Data exfiltration prevention patterns

๐Ÿ”ง How to Use This Repository

1. Choose Your Cloud Platform

Pick the folder that matches your target environment:

2. Select Deployment Pattern

Each cloud folder contains multiple deployment patterns:

  • Non-Private Link: Public control plane + private data plane (NPIP)
  • Full Private: Private Link for both control and data planes
  • Hub-Spoke: Centralized networking with egress control

3. Follow Deployment Guides

  • Read the README in your chosen folder
  • Review architecture diagrams and documentation
  • Follow step-by-step deployment instructions
  • Use provided Terraform modules and templates

4. Explore Additional Resources

  • Cross-Cloud Guides: guides/ - Authentication, networking, troubleshooting
  • Utility Scripts: databricks-utils/ - Helper tools and scripts
  • Archive: archive/ - Legacy code samples and REST API collections

๐ŸŒŸ Highlighted Features

Production-Ready Templates

  • โœ… Modular Terraform code with conditional logic
  • โœ… Support for BYOV (Bring Your Own VNet/VPC)
  • โœ… Automated network security group rules
  • โœ… Unity Catalog with regional metastore management

Comprehensive Documentation

  • ๐Ÿ“š 2,300+ lines of detailed guides
  • ๐Ÿ“Š UML architecture and sequence diagrams
  • ๐Ÿ” Traffic flow analysis with cost breakdowns
  • โš ๏ธ Troubleshooting guides and deployment checklists

Security Best Practices

  • ๐Ÿ” Data Exfiltration Protection (DEP) frameworks
  • ๐Ÿ”‘ Customer-Managed Keys (CMK) with auto-rotation
  • ๐ŸŒ Private Link, VPC-SC, and network isolation
  • ๐Ÿ›ก๏ธ Zero-trust architectures for regulated industries

โœจ Contributing

Contributions are welcome! Please:

  1. Open issues for bugs, questions, or feature requests
  2. Submit pull requests for:
    • Documentation improvements
    • Additional cloud scenarios
    • New deployment templates
    • Bug fixes or enhancements

๐Ÿ“„ License

This repository follows the licensing described in the project. Please see the LICENSE file (if present) or reach out for clarification.


๐Ÿ”— Additional Resources