Skip to main content

DBT Transformation Pipeline

The dbt (data build tool) project transforms raw clinical trial data through a Medallion Architecture into analytics-ready datasets and CDISC-compliant domains.

Enterprise Feature

DBT pipeline and semantic views are available in Enterprise editions.


Overview

DBT Pipeline Architecture


Medallion Architecture

LayerSchemaPurposeModels
BronzebronzeRaw 1:1 source copystg_* staging models
SilversilverCleaned, validated, enrichedint_* intermediate models
GoldgoldAnalytics-ready facts & dimensionsdim_*, fact_* models
SemanticgoldPre-joined business viewssem_* models
CDISCcdiscRegulatory-compliant domainscdisc_* models
Bronze (Raw)  →  Silver (Clean)  →  Gold (Analytics)  →  Semantic (BI)
stg_* int_* dim_*/fact_* sem_*

Key Models

Dimension Tables (dim_*)

ModelDescription
dim_studyStudy master data
dim_siteSite information
dim_subjectSubject demographics
dim_patientPatient records
dim_dateDate dimension

Fact Tables (fact_*)

ModelDescription
fact_enrollmentEnrollment events
fact_adverse_eventAdverse events
fact_visitStudy visits
fact_vital_signVital measurements
fact_lab_resultLab test results

Semantic Views (sem_*)

Pre-joined, business-friendly views for analytics:

ModelDescription
sem_clinical_summaryOne-row-per-subject with all key data
sem_adverse_eventsAE metrics aggregated by subject
sem_enrollment_metricsMonthly enrollment with cumulative totals

CDISC Domains

Regulatory-compliant domains following CDISC SDTM standards:

DomainDescription
cdisc_dmDemographics
cdisc_aeAdverse Events
cdisc_vsVital Signs
cdisc_lbLaboratory Results
cdisc_cmConcomitant Medications

Exposures

DBT exposures document how models are consumed by downstream systems:

ExposureConsumerModels
Cube Semantic LayerCube.devAll dim_*, fact_*, sem_*
CDISC ExportRegulatory submissionsAll cdisc_*
BI DashboardsMetabase, Supersetsem_* views

Quick Commands

# Run full pipeline
dbt run

# Run by layer
dbt run --select staging
dbt run --select dimensions
dbt run --select facts
dbt run --select tag:cdisc

# Run tests
dbt test

# Generate docs
dbt docs generate && dbt docs serve