42
What is CDISC 360: an enterprise architecture perspective Sam Hume, DSc - VP, Data Science, CDISC Tianna Umann – Cross Domain Solution Architect, Microsoft 28 April 2020

What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

  • Upload
    others

  • View
    21

  • Download
    0

Embed Size (px)

Citation preview

Page 1: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

What is CDISC 360: an enterprise architecture perspective

Sam Hume, DSc - VP, Data Science, CDISC

Tianna Umann – Cross Domain Solution Architect, Microsoft

28 April 2020

Page 2: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Business

Metadata

Application

Technology

Page 3: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Business

Metadata

Application

Technology

Page 4: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Business

Metadata

Application

Technology

Page 5: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Metadata Inventory

AvailableToday

Additional CDISC 360 Content

Controlled Terminology

Foundational Standards

Templates

Derivations and Mappings

Tables, Listings, and Figures

TAUGmetadata

Codelist Subsets

Study design metadata

Analysis Concepts

Biomedical Concepts

Page 6: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

StudyMetadata

Library

Enhance Standards

Publish Standards

Define Build Execute

Indu

stry

CDISC 360 Perspective 1: Metadata Flow within the Project

Page 7: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Perspective 2: End-to-end Data Flow

Increased Automation & Reduced Variability

Data Flow Diagram: Data State and Data Flow

Page 8: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Perspective 2: Code as Metadata

• Data flow metadata to capture logic needed to drive data transformationsand derivations

• Capture Pseudocode descriptions

• Capture input and output variables

• Add multiple executable reference implementations (e.g. SAS, R, Python)

• Represented in ODM and Define-XML as a MethodDef

Page 9: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Perspective 3: CDISC Library

• Production: W3C RDF• Sandbox: JSON or XML Schema

M3: Meta-Meta-Model

• ISO 11179

M2: Meta-Model

• CDISC Standards

M1: Model

• Trial Metadata and Data

M0: Model Runtime

Page 10: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Perspective 3: CDISC LibraryISO 11179-based Model

CDISC 360Biomedical Concepts

CDISC IGStandards Metadata

Page 11: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Perspective 3: CDISC Library

AEOUTAdverse Event Outcome

Model

Model Class

Dataset

Dataset Variable

Dataset Variable

IG

IG Class

Dataset

ClassVariable

Events

AEAdverse Events

SDTM IG 3.2

SDTM 1.4

Events

--OUT

CDISC Standards Model

Page 12: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Perspective 4: External Project Data FlowTransCelerate Digital Data Flow (DDF)

Page 13: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Business

Metadata

Application

Technology

Page 14: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360 Applications Summary

Page 15: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS1 Enhance Standards: CMAP & Excel

Templates, Codelist Subsets, & Methods

Biomedical & Analysis Concepts

Proof-of-Concept Metadata Creation

Page 16: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS2 Publish Standards: CDISC Library

CDISC Foundational Standards & CT CDISC 360 Created Metadata

• Bespoke code to generate metadata• Python• RDF

• Generate machine-readable metadata published to a CDISC Library Sandbox

Publish Standards Metadata via the CDISC Library or Sandbox Library

Page 17: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

API Gateway

DeveloperPortal

Azure API Management

API Service #1(Azure Function)

API Service #n(Azure Function)

...

Workstreams 4,5,6

POC Apps

Developers

Consume API Content

Develop App

Workstream 1

Concept Maps &

Templates

Export

Azure Files

Current CDISC Library APIs

API Service #2(Azure Function)

REST API

REST API

REST API

REST API

WS2 Publish Standards: Sandbox CDISC Library

Page 18: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS4 Define: Study Metadata Library

Study Metadata Library

Page 19: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS5 – Build: Study Designer

UI for creating study metadata

Page 20: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS5 Build: Study Builder UI/UX Demo

CDISC 2020 Europe Interchange | #CDISCEurope #ClearDataClearImpact

Page 21: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS5 Build: Metadata Artifacts • Bespoke code to generate:

• ODM CRFs / aCRFs• Define-XML

• ODM and Define-XML visualizations• Stylesheets• Formedix On

• End-to-end with ODM and Define-XML• Test driving automation with ODM

and Define-XML metadata

Formedix CRF / aCRF Visualization

Page 22: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS6 Execute: Data Transformation Engine & SAS

Data Transformation Engine (DTE) from Noumena Solutions (Greg Steffens)

Source Dataset 1

Source Dataset 2

Source Dataset 3

DTE Processor with Map

Target dataset

Page 23: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

WS6 Execute: TLF Automation Engine

Study Level ADaM

Data

CDISC 360 Enriched

ARM Metadata

R ShinySelect TFL of

Interest

Review data

Customize TFL Layout & Metadata

Select TFL Layout (Template)

SAS

Generate SAS Program and Define.xml

Execute SAS Program

Validate and Deliver

Generate Output

Page 24: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Business

Metadata

Application

Technology

Page 25: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Technology: Code Development and Execution

Azure FunctionsAzure JupyterNotebooks

Visual Studio Code

Code Development & Execution

Cypher Query Language

Page 26: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 360: Cloud-based Computing

CDISC 360 Azure Cloud CDISC Library

CDISC collaboration tools

CMAP

Page 27: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Future StateHow does research change once we implement the learnings from CDISC 360?

Page 28: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Proof of Concept to Production

Architecture Patterns

Evolution of Cloud platforms

Automation Multi Data Source ELT

Page 29: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 2020 Europe Interchange | #CDISCEurope #ClearDataClearImpact

29

Monolithic N-Tier Microservices Serverless

Page 30: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Evolution of Cloud Platforms • Serverless is the culmination of several iterations of cloud platforms. The evolution began with

physical metal in the data center and progressed through infrastructure as a Service (IaaS) and Platform as a Service (PaaS)

Page 31: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Serverless Event Grid, Logic Apps, Functions

Page 32: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 2020 Europe Interchange | #CDISCEurope #ClearDataClearImpact

32

Page 33: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 2020 Europe Interchange | #CDISCEurope #ClearDataClearImpact

33

Page 34: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Thank You!

Page 35: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Monoliths

Advantages• Easy to re-create an environment

with multiple components and applications

Disadvantages• Tends to break down at scale• Difficult to work in parallel on code• Any change requires deploying

new version of the application

Page 36: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Microservices

CDISC 2020 Europe Interchange | #CDISCEurope #ClearDataClearImpact

Page 37: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Parse a CSV file and insert into a database

Page 38: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

N-Layer applications

Page 39: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Workstreams 4-6 POC Target Architecture - Draft

App Service #1(Azure Function)

App Service #n(Azure Function)

...

App Service #2(Azure Function)

Workstream 2POCAzure

API Management

API Gateway POC User Interface(s)

(Azure Web Apps)

REST API REST API

Virtual Network

REST API

REST API?

?

Page 40: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Azure Databricks

External Study Data

CDISC LibraryBiomedical &

Analysis Concepts

Sponsor Libraries

Page 41: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

CDISC 2020 Europe Interchange | #CDISCEurope #ClearDataClearImpact 41

Page 42: What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Data pipeline