Upload
trinhhanh
View
218
Download
0
Embed Size (px)
Citation preview
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 1
Statistical Programming for Dummies
PhUSE University Day, 15th October 2013 Sven Greiner, Nicola Tambascia
Statistical Programming, Accovion GmbH
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 2
Contents
I. Statistical Programming in Pharma
II. The drug developement process
III. Working example 1: Trial evaluation
IV. Working example 2: CDISC data conversion
V. Other interesting tasks
VI. Profile of a Statistical Programmer
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 3
Statistical Programming in Pharma
" Statistical Programmers are involved in the evaluation of clinical trials
" Main task: Technical implementation of statistical analyses
" Where can I work as a Statistical Programmer? • Pharmaceutical company: Bayer, Lilly, Roche, …
• CRO: Quintiles, Parexel, Accovion, …
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 4
The drug development process
Molecule discovery
Pre-clinic
Drug approval
Marketing trial
Phase I - III trials
Clinical Trial Protocol
Clinical trial conduct
Medical Writing
Data Management
Statistical Programming & Biostatistics Phase IV trials
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 5
WE 1: Trial evaluation – Getting started
" Example of a trial evaluation for an ongoing trial
" Status: • Clinical Trial Protocol is available
• Original datasets (ODS) provided by Data Management
• (Draft) Statistical Analysis Plan (SAP) is available
" Goal: Create tables, figures and listings to analyse the trial endpoints
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 6
WE 1: Trial evaluation – ADS
" Create analysis datasets (ADS)
" About 10-30 ADS per trial evaluation
MH DM AE
ODS (SAS datasets)
Specifications
DEMO HIST EVE NTS
ADS (SAS datasets)
„data as collected“ „data for analysis“
SAS program
DATA dm; SET ods.dm; RUN;
* Create ADS; DATA ads.demo; SET dm; RUN;
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 7
WE 1: Trial evaluation – ADS " Example of an ADS (ADSL)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 8
WE 1: Trial evaluation – Reporting " Reporting starts after the first ADS are programmed
" Up to several thousand outputs per trial
DEMO HIST EVE NTS
ADS
SAP & Table Shells
Statistical Output
Reporting program (SAS)
DATA all; SET ads.demo; RUN;
* Print output; PROC REPORT; RUN;
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 9
WE 1: Trial evaluation – Reporting " Example of an efficacy table
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 10
WE 1: Trial evaluation – Validation
" Validation is part of quality control
" “The process of evaluating software during or at the end of the development process to determine whether it satisfies specified requirements.”
Output review
1.
Code review
* Print output; PROC REPORT; RUN;
2.
Double programming
3.
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 11
WE 1: Trial evaluation – Delivery
" Final statistical outputs in different formats
" Final ADS and specifications
Recipient: Medical Writing, Regulatory, Medical, Management
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 12
WE2: CDISC data conversion
" Current trend to have clinical data in standardized format
" Driven by authorities that ask for standardized data
" Original data needs to be converted into standard format
" Standard currently recommended by authorities is provided by the Clinical Data Interchange Standards Consortium (CDISC)
" SDTM for raw data
" ADaM for derived data used for Analysis
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 13
WE2: Why CDISC SDTM/ADaM?
" General Benefits of Standards
• Improve communication
• Time/resource savings
• Comparison across submissions
" FDA‘s eStudy Data Draft Guidance (Feb 2012) • Written official recommendation that sponsors submit
study data in a standardized electronic format
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 14
à Study
à Data
à Tabulation
à Model
SDTM defines a standard structure for study data tabulations that are to be submitted as part of a product application to regular authority such as FDA. It based on material prepared by the Submissions Data Standard (SDS) Team of the CDISC.
SDTM V1.2 Study_Data_Tabulation_Model_v1.2.pdf
SDTM IG V3.1.2 SDTM_Implementation_Guide_V3_1_2.pdf
WE2: What is SDTM?
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 15
WE2: Process Flow – (Legacy) Data Conversion
Annotated CRF
SDTM
Specifications
Mapping programs
define.xml (SDTM)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 16
WE2: CRF Annotation Guidelines " See CDISC Metadata Submission Guidelines (SDTM)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 17
WE2: Sample SDTM Domain Specifications (EG)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 18
WE2: CDISC SDTM Sample
" Selected variables from a SDTM questionnaire dataset
Source: CDISC SDTM/ADaM Pilot (qs.xpt)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 19
WE2: Metadata – define.xml
Excerpt from the CDISC Draft Metadata Submission Guidelines define.xml Sample
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 20
WE2: Review Tool: OpenCDISC (www.opencdisc.org)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 21
WE2: ADaM – General Principles
" Dataset design • Analysis ready
à "One Statistical Procedure away“
" Main Input: SDTM
" Documentation via standard metadata concept • Similar but not equal to SDTM define.xml • Facilitate clear and unambiguous communication
à Traceability (results => source data (SDTM) and vice versa) à Describe contents, source and quality
" Data & metadata • Usable by currently available software tools
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 22
WE2: Process Flow: ADaM
SDTM
SAP
ADaM
Results creation programs
Analysis results (TLGs)
- Analysis datasets
- Analysis results
define.xml (ADaM)
Datasets creation programs
ADaM Specifications
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 23
WE2: Retrospective vs. Prospective
" Retrospective ADaM Generation • Reported results based on a different data format
• Conversion to ADaM needed at all? • If yes,
à Focus on ADSL and key results + ensure reproducability à Focus on data requirements for integrated analyses à Good documentation/metadata to facilitate review
" Prospective ADaM Generation • Calculate study results based on ADaM datasets
• 2 Major Requirements à Efficient generation of Tables/Listings/Graphs (TLGs) à Good documentation/metadata to facilitate review
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 24
WE2: Specifications/Metadata in EXCEL
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 25
WE2: CDISC ADaM Sample
" Selected variables from an ADaM questionnaire dataset (adqsadas.xpt of CDISC SDTM/ADaM Pilot)
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 26
WE2: Analysis Results Sample
Display
Result 1
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 27
WE2: Analysis Results Metadata
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 28
Other interesting tasks
" Programming of integrated databases, e.g. ISS, ISE, ...
" Development and validation of generic macros, tools, …
" CDISC conversion consultancy
" Optional: presenting at conferences
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 29
Profile of a Statistical Programmer
" Programming skills and enjoy programming
" Some statistical knowledge and / or medical knowledge
" Ability to deal with data
" Communication skills
" English skills
" Office skills
" Team player
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 30
Beate Hientzsch Director, Statiscal
Programming Accovion GmbH Helfmann-Park 10 D-65760 Eschborn, Germany Tel. +49 6196 7709-283 [email protected] www.accovion.com
Senior Statistical Programmer
Nicola Tambascia
Beate Hientzsch Director, Statiscal
Programming Accovion GmbH Helfmann-Park 10 D-65760 Eschborn, Germany Tel. +49 6196 7709-288 [email protected]
www.accovion.com
Senior Statistical Programmer
Sven Greiner
Questions?