27
Gregory Landrum, Ph.D. NIBR Informatics Novartis Institutes for BioMedical Research, Basel 8 th KNIME Users Group Meeting Berlin, 26 February 2015 KNIME Enterprise server usage and global deployment at NIBR

KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Gregory Landrum, Ph.D. NIBR Informatics Novartis Institutes for BioMedical Research, Basel

8th KNIME Users Group Meeting

Berlin, 26 February 2015

KNIME Enterprise server usage and global deployment at NIBR

Page 2: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

2

Novartis Institutes for BioMedical Research (NIBR) A global network of >6,000 scientists, physicians, and business professionals.

Page 3: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

R&D at Novartis Bringing innovative medicines to patients

3

The Drug Development Process

Source: http://www.nibr.com/cs/groups/public/@nibr_com/documents/document/n_prod_200363.pdf

Page 4: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

R&D at Novartis Bringing innovative medicines to patients

4 Source: http://www.nibr.com/cs/groups/public/@nibr_com/documents/document/n_prod_200363.pdf

Page 5: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Timelines and economics

Paul, S. M., Mytelka, D. S., Dunwiddie, C. T., Persinger, C. C., Munos, B. H., Lindborg, S. R., & Schacht, A. L. (2010). Nature reviews Drug discovery, 9(3), 203-214.

Page 6: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

NIBR: Making it work

§  Our model of research – connecting the laboratory to the clinic, and pursuing molecular pathways across a landscape of multiple diseases – means that we have to be a highly collaborative organization.

§  Every project is made up of cross-functional teams, drawn from pathways scientists, chemists, disease area specialists, informaticians, clinicians and more.

§  We’re doing scientific research, not making widgets

§  Lots of collaboration, lots of technology, lots of data

6

Page 7: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

| Presentation Title | Presenter Name | Date | Subject | Business Use Only 7

Lots of data

Page 8: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Lots of data Shape of the data generated for a project

8

Hit finding 106 rows, 1-2 columns

Hit-to-lead 103 rows, 5-10 columns

Lead optimization 102 rows, 102 columns Clinic

1 rows, 104 columns

Page 9: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

The role of NIBR Informatics (NX)

§  Identifying and driving new opportunities to accelerate science with leading-edge computing and informatics solutions.

§  Traditional IT stuff: service desk, hardware support, network, etc.

§  Designing, building, deploying, and supporting tools/systems for: •  portfolio management; document management; compliance and reporting •  lab informatics; sample management and logistics; electronic lab notebooks •  high-performance computing; large-scale data warehousing and mining,

machine learning •  scientific data analysis; visualization; reporting

§  Pushing the frontier: research and exploration

§  Combination of purchased and in-house developed systems, lots of different technologies, lots of integration work

9

Page 10: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

NIBR and KNIME

§  We believe KNIME can be really useful, so we want to make it available to all of our scientists

§  We’re supporting both people who are using KNIME to solve problems in their own labs/groups and people who want to make tools available to others.

§  Need to support exchange of workflows and information across all our sites

§  Need to be integrated into our data and software environment

10

Page 11: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Infrastructure

§  Internal node development

§  Enterprise servers + cluster integration

§  Standardized desktop releases for Windows, Linux, Mac

§  Nightly builds for users comfortable on the bleedingleading edge

§  Dev and test servers to support our node development

11

Page 12: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

12

NIBR’s KNIME servers

Page 13: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

KNIME for NIBR internal distribution

13

Standardized set of nodes and extensions

Customized preferences

Page 14: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

KNIME for NIBR make it supportable

14

Allow a reset to the default configuration without requiring a new install.

Page 15: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

In-house node development make it useful

§  Connections to internal data sources and applications

§  Wrappers around in-house developed algorithms

§  Connection to our web service framework for cheminformatics services

15

Page 16: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Open-source node development

§  Chemistry nodes based on the RDKit •  open-source cheminformatics toolkit •  useable from C++, Python, Java, C#

•  NIBR scientists/developers actively participate •  www.rdkit.org

§  Standard cheminformatics tasks + some nice extras

§  Developed both in-house and together with knime.com

16

Page 17: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Sponsored node development

§  Modifications to naïve Bayes nodes to support fingerprints

§  Fingerprint naïve Bayes supporting unbalanced datasets

§  Database schema browser

§  Improvements to database connector, readers

§  Ensemble tree classifier

§  New Python integration

17

Page 18: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Integration example 1: Descriptor calculation

18

Page 19: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Integration example 2: DART

19

Internal web-based tool used by project teams to do querying and reporting from our data warehouse

Page 20: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Integration example 2: DART

20

Internal web-based tool used by project teams to do querying and reporting from our data warehouse

Access to saved queries and views

URL contains full state of query/view

Page 21: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Integration example 2: DART + KNIME

21

Page 22: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Integration example 2: DART + KNIME

22

Access to saved queries and views

Page 23: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Usage snapshot

23

Unique users per month

Users by site

Notes: 1)  stats only include KNIME client 2)  December data incomplete

Overall: 240 unique users, mostly scientists

Page 24: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

What are those users doing with KNIME ?

§  Querying and reporting from our warehouse

§  virtual chemistry

§  processing usage statistics

§  mining medchem project data

§  processing and analyzing experimental data

§  machine learning

§  triaging high-throughput screening results

§  looking up chemical catalog numbers

§ …

24

in other words: a bit of everything

Page 25: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

KNIME Server usage

§  Primarily used to share workflows

§  Increasingly used as a quick and easy deployment platform for small application/services built in KNIME

§  This is mainly driven by the scientists themselves

§  Areas for improvement: • Would be nice if it were easier to sync between servers • Would be great if the server could do “RESTful” web services.

§  Still: enabling scientists to share workflows and make (hopefully) simple applications available to each other is great

25

Page 26: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Wrapping up

§  KNIME in heavy use to solve many different problems

§  Enterprise server used to exchange workflows globally

§  Web portal provides a way for scientists to deploy tools to each other

§  KNIME is a great platform for us to build upon

26

Page 27: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory

Acknowledgements

§  NIBR • Manuel Schwarze (NX) • Mark Duffield (NX)

•  David Nick (NX) • Marc Litherland (NX) •  John Davies (CPC) •  Richard Lewis (GDC) •  Remy Evard (NX)

27

§  knime.com