Governance Atlas: Databricks-Native Data Governance with Unity Catalog, Genie, and Lakebase
Every serious governance project eventually reaches the same uncomfortable moment: the platform has the metadata, but the organization still does not have a product. There is a catalog. There are tags. There are comments, owners, lineage events, audit rows, dashboards, policies, and a dozen local rituals around who is allowed to change what. Yet when a steward asks, “Can I safely change this field?”, the answer still arrives as a meeting, a spreadsheet, and a prayer.
Building an AI Billing Agent on Databricks: Anomaly Detection, Genie Analytics, and Governed Write-Back at Scale
Inside the Customer Billing Accelerator from Entrada and Databricks, an agentic AI stack that detects anomalies, answers finance questions in plain English, and writes back to source systems, all governed through Unity Catalog.
DataPact 3.0: Validation, Genie, and the discipline of a curated room
A field report on what changed between DataPact 2.9 and 3.0, why we put a managed Genie space at the centre of the release, and the engineering it takes to make a conversational data quality surface trustworthy enough to call a product.
From Cost Visibility to Action: Scaling FinOps Intelligence with Databricks System Tables and Genie
This post walks through the architecture Entrada built around that observation, the Serverless Cost Control Accelerator, and, more importantly, the design principles behind it. Regardless os whether we’re a platform engineer, SRE, or FinOps lead trying to decide where to invest, the principles matter more than the product.
Building Secure, AI-Ready Medical Research Platforms on Databricks
Research organizations need faster, more reliable ways to prepare sensitive data for analysis without loosening their grip on governance and privacy. Across the medical research platforms we’ve built on Databricks, the same patterns keep proving their worth: cleaner ingestion, standardized de-identification, simpler access to research-ready datasets, and a foundation that holds up when analytics and AI ambitions grow. Here’s what we’ve learned about designing these environments well.
Lakebase: The Death of the Siloed Application Database
Every enterprise manages two separate, expensive database systems: OLTP for real-time transactions and OLAP for analytics. The pipeline connecting them is the most fragile thing in the entire stack. Databricks’ Lakebase makes that pipeline optional, offering a strategic opportunity to collapse two stacks into one and finally deliver the near-real-time data that critical business applications need.
Serverless by Workload Shape: Entrada’s Databricks Playbook for Real Price/Performance
Databricks is directionally right to push serverless. Its current guidance recommends serverless for supported workloads because it is the simplest, most reliable option for notebooks, jobs, and Lakeflow Spark Declarative Pipelines, and its compute selection guidance recommends serverless for most automated workloads while steering SQL tasks toward serverless SQL warehouses.
Fraud Detection at Scale: What It Really Takes
Fraud detection at scale is not just about catching suspicious activity faster. It is about building the data, AI, and governance foundation needed to detect risk reliably, explain decisions, and stay cost-efficient.
Monitoring ML Models in Production with Databricks
Most ML models do not break in development. They break quietly in production, when data changes, performance drifts, and no one notices until business trust is already slipping. That is why monitoring is not an afterthought. It is one of the foundations of enterprise AI.
True CI/CD for the Lakehouse: Infrastructure as Code (IaC) & DABs
There is a conversation I have had more times than I can count. A client tells me their team “already has CI/CD.” When I ask them to walk me through it, the answer usually sounds like this: a developer runs a notebook to completion, exports it, uploads it to a shared folder, and notifies the production team via Slack to “pull the latest version.” That is not CI/CD. That is a deployment ceremony wrapped in good intentions.
Race to the Lakehouse
AI + Data Maturity Assessment
Unity Catalog
Rapid GenAI
Modern Data Connectivity
Gatehouse Security
Health Check
Sample Use Case Library