30
Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15 th October 2013, PhUSE University Day 1 Statistical Programming for Dummies PhUSE University Day, 15th October 2013 Sven Greiner, Nicola Tambascia Statistical Programming, Accovion GmbH

Statistical Programming for Dummies - phusewiki.org 2103 UD Streams/UD05.pdf · Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University

Embed Size (px)

Citation preview

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 1

Statistical Programming for Dummies

 PhUSE University Day, 15th October 2013 Sven Greiner, Nicola Tambascia

Statistical Programming, Accovion GmbH

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 2

Contents

I.  Statistical Programming in Pharma

II.  The drug developement process

III.  Working example 1: Trial evaluation

IV.  Working example 2: CDISC data conversion

V.  Other interesting tasks

VI.  Profile of a Statistical Programmer

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 3

Statistical Programming in Pharma

"  Statistical Programmers are involved in the evaluation of clinical trials

"  Main task: Technical implementation of statistical analyses

"  Where can I work as a Statistical Programmer? •  Pharmaceutical company: Bayer, Lilly, Roche, …

•  CRO: Quintiles, Parexel, Accovion, …

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 4

The drug development process

Molecule discovery

Pre-clinic

Drug approval

Marketing trial

 Phase I - III trials

Clinical Trial Protocol

  Clinical trial   conduct

Medical Writing

Data Management

Statistical Programming & Biostatistics  Phase IV trials

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 5

WE 1: Trial evaluation – Getting started

"  Example of a trial evaluation for an ongoing trial

"  Status: •  Clinical Trial Protocol is available

•  Original datasets (ODS) provided by Data Management

•  (Draft) Statistical Analysis Plan (SAP) is available

"  Goal: Create tables, figures and listings to analyse the trial endpoints

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 6

WE 1: Trial evaluation – ADS

"  Create analysis datasets (ADS)

"  About 10-30 ADS per trial evaluation

MH DM AE

 ODS  (SAS datasets)

Specifications

DEMO HIST EVE NTS

 ADS  (SAS datasets)

„data as collected“ „data for analysis“

 SAS program

 DATA dm;   SET ods.dm;  RUN;

 * Create ADS;  DATA ads.demo;   SET dm;  RUN;

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 7

WE 1: Trial evaluation – ADS "  Example of an ADS (ADSL)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 8

WE 1: Trial evaluation – Reporting "  Reporting starts after the first ADS are programmed

"  Up to several thousand outputs per trial

DEMO HIST EVE NTS

 ADS

 SAP &  Table Shells

 Statistical Output

 Reporting program  (SAS)

 DATA all;   SET ads.demo;  RUN;

 * Print output;  PROC REPORT;  RUN;

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 9

WE 1: Trial evaluation – Reporting "  Example of an efficacy table

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 10

WE 1: Trial evaluation – Validation

"  Validation is part of quality control

"  “The process of evaluating software during or at the end of the development process to determine whether it satisfies specified requirements.”

 Output review

 1.

 Code review

* Print output;  PROC REPORT;  RUN;

 2.

 Double programming

 3.

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 11

WE 1: Trial evaluation – Delivery

"  Final statistical outputs in different formats

"  Final ADS and specifications

Recipient: Medical Writing, Regulatory, Medical, Management

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 12

WE2: CDISC data conversion

" Current trend to have clinical data in standardized format

" Driven by authorities that ask for standardized data

"  Original data needs to be converted into standard format

"  Standard currently recommended by authorities is provided by the Clinical Data Interchange Standards Consortium (CDISC)

"  SDTM for raw data

" ADaM for derived data used for Analysis

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 13

WE2: Why CDISC SDTM/ADaM?

"  General Benefits of Standards

•  Improve communication

•  Time/resource savings

•  Comparison across submissions

"  FDA‘s eStudy Data Draft Guidance (Feb 2012) •  Written official recommendation that sponsors submit

study data in a standardized electronic format

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 14

à Study

à Data

à Tabulation

à Model

SDTM defines a standard structure for study data tabulations that are to be submitted as part of a product application to regular authority such as FDA. It based on material prepared by the Submissions Data Standard (SDS) Team of the CDISC.

SDTM V1.2 Study_Data_Tabulation_Model_v1.2.pdf

SDTM IG V3.1.2 SDTM_Implementation_Guide_V3_1_2.pdf

WE2: What is SDTM?

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 15

WE2: Process Flow – (Legacy) Data Conversion

  Annotated CRF

  SDTM

  Specifications

  Mapping   programs

  define.xml (SDTM)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 16

WE2: CRF Annotation Guidelines "  See CDISC Metadata Submission Guidelines (SDTM)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 17

WE2: Sample SDTM Domain Specifications (EG)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 18

WE2: CDISC SDTM Sample

"  Selected variables from a SDTM questionnaire dataset

  Source: CDISC SDTM/ADaM Pilot (qs.xpt)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 19

WE2: Metadata – define.xml

  Excerpt from the CDISC Draft Metadata Submission Guidelines define.xml Sample

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 20

WE2: Review Tool: OpenCDISC (www.opencdisc.org)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 21

WE2: ADaM – General Principles

"  Dataset design •  Analysis ready

à "One Statistical Procedure away“

"  Main Input: SDTM

"  Documentation via standard metadata concept •  Similar but not equal to SDTM define.xml •  Facilitate clear and unambiguous communication

à Traceability (results => source data (SDTM) and vice versa) à Describe contents, source and quality

"  Data & metadata •  Usable by currently available software tools

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 22

WE2: Process Flow: ADaM

  SDTM

  SAP

  ADaM

  Results   creation   programs

  Analysis   results   (TLGs)

- Analysis datasets

  - Analysis results

  define.xml (ADaM)

  Datasets   creation   programs

  ADaM Specifications

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 23

WE2: Retrospective vs. Prospective

"  Retrospective ADaM Generation •  Reported results based on a different data format

•  Conversion to ADaM needed at all? •  If yes,

à  Focus on ADSL and key results + ensure reproducability à  Focus on data requirements for integrated analyses à  Good documentation/metadata to facilitate review

"  Prospective ADaM Generation •  Calculate study results based on ADaM datasets

•  2 Major Requirements à  Efficient generation of Tables/Listings/Graphs (TLGs) à  Good documentation/metadata to facilitate review

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 24

WE2: Specifications/Metadata in EXCEL

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 25

WE2: CDISC ADaM Sample

"   Selected variables from an ADaM questionnaire dataset (adqsadas.xpt of CDISC SDTM/ADaM Pilot)

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 26

WE2: Analysis Results Sample

Display

Result 1

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 27

WE2: Analysis Results Metadata

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 28

Other interesting tasks

"  Programming of integrated databases, e.g. ISS, ISE, ...

"  Development and validation of generic macros, tools, …

"  CDISC conversion consultancy

"  Optional: presenting at conferences

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 29

Profile of a Statistical Programmer

"  Programming skills and enjoy programming

" Some statistical knowledge and / or medical knowledge

" Ability to deal with data

"  Communication skills

"  English skills

"  Office skills

"  Team player

Sven Greiner & Nicola Tambascia – Statistical Programming for Dummies, 15th October 2013, PhUSE University Day 30

 Beate Hientzsch  Director, Statiscal

 Programming  Accovion GmbH  Helfmann-Park 10  D-65760 Eschborn, Germany  Tel. +49 6196 7709-283  [email protected]  www.accovion.com

 Senior Statistical  Programmer

 Nicola Tambascia

 Beate Hientzsch  Director, Statiscal

 Programming  Accovion GmbH  Helfmann-Park 10  D-65760 Eschborn, Germany  Tel. +49 6196 7709-288  [email protected]

 www.accovion.com

 Senior Statistical  Programmer

 Sven Greiner

Questions?