89
Cheminformatics workflows using the mobile + cloud platform Dr. Alex M. Clark October 2013 © 2013 Molecular Materials Informatics, Inc. http://molmatinf.com

Alex Clark : NETTAB 2013

Embed Size (px)

Citation preview

Page 1: Alex Clark : NETTAB 2013

Cheminformaticsworkflows using themobile + cloud platform

Dr. Alex M. Clark

October 2013

© 2013 Molecular Materials Informatics, Inc. http://molmatinf.com

Page 2: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Overview2

• Chemical structures• Mobile apps• Cloud computing

Page 3: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

3rd Revolution

• The mobile platform is revolutionary: a clean break

- entirely new user interface- no backward compatibility- highly constrained resources- applicable to entirely new situations

3

mainframesminicomputers

personal computersportable laptops

mobile tabletssmartphones

Page 4: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Two platform stacks4

desktop/laptop

file/databaseserver

computecluster

phone/tablet

webservices

cloudresources

Page 5: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Delivery5

• User experience

• Cached data

• Light calculations

• Sharing

• Web API

• Heavy calculations

• Large data

• Centralised storage

Apps Cloud

Page 6: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Challenges• Mobile user interfaces are completely

different to desktop products

• Expectations: functionality, extreme simplicity

• Complex interfaces much harder: drawing chemical structures requires redesign

• Visualisation is more difficult: screen size

• Storage limitations: big data is a problem

• Calculations split mobile vs. cloud

6

Page 7: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS 7

Mobile MolecularDataSheet (MMDS) MolSync SAR Table MolPrime+ MolPrime

Green Solvents Lab Solvents Approved Drugs Reaction101 Yield101

Living Molecules TB Mobile ChemSpider SPRESImobile Open DrugDiscovery Teams

App Catalog

Page 8: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Drawing Structures• Cheminformatics requires a user

interface for drawing structures

• Interface needs to be:

- capable for complex structures- publication quality- fast to use- phone-size form factor

• Traditional interface paradigm on touchscreen: unusable...

8

Page 9: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Drawing Reactions

• Components: reuse sketcher, iPhone friendly• Higher level of markup useful for lab notebooks• Used by MMDS, Reaction101, Yield101

9

Page 10: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Assembling DataSheets

• MMDS: molecular datasheet unit

- molecular structures- reaction schemes- scalar data (text, numbers)

• Table-like structure, roughly equivalent to SDfile/RDfile

• Operate on individual rows or whole datasheets

10

Page 11: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Scaffolds & Substituents

• Scaffold-centric model useful to drug discovery

• SAR Table app: optimised for data entry

• And: analysis, visualisation and calculation

11

Page 12: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Searching

• App interfaces for structure searching

12

Page 13: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Calculation

• Structures used to calculate properties

• Single molecules, or whole datasheets

• Simple calculations: done by app

• Difficult calculations: use webservice

13

Page 14: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Importing

• Receive recognised filetypes

- clipboard- email- web- other apps (IPC)- webservices- remote filesystems

• Documented formats, e.g. SketchEl, MDL, ChemDraw, CML

14

Page 15: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Exporting• Apps are only as good as their

ability to use the data

• Export actual data using cheminformatics formats

- clipboard- email- web sharing- twitter- webservices- remote filesystems

15

Page 16: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Graphics16

• Prepare bitmapped graphics for:

- clipboard- photo roll- email

• Can incorporate into other apps on the device (e.g. Keynote)

• Vector graphics: SVG, EPS, Microsoft Word & Excel

Page 17: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Sharing17

• Share data publicly or privately

• Upload to a webservice

- store raw data- dynamically render- hosted by molsync.com

• Tweet directly from the app: share the link

Page 18: Alex Clark : NETTAB 2013

Real world workflow: searching for new Tuberculosis drugs

Dr. Alex M. Clark

October 2013

© 2013 Molecular Materials Informatics, Inc. http://molmatinf.com

Page 19: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Outline• Workflow is representative of lead discovery

• Searching for new Mycobacterium tuberculosis drugs

• Open data, open science, collaborating with experimentalists

• Workflow involves:

- mobile apps- cloud-based webservices- traditional software

19

http://molmatinf.com/venice.html

Page 20: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

TB Mobile• Curated data for

tuberculosis drugs

• iOS & Android

• CDD, NIAID grant, Sean Ekins

20

• App presents a browseable list of structures

• Can filter by target, sort by structural similarity

• Visual data exportable....

Page 21: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

A Good Scaffold• Objective: use the data from TB Mobile to

find a scaffold, then use to infer gene target from a larger collection of SAR data

• What makes a good scaffold?

21

Page 22: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Methodology• Preliminary work done with desktop software: com.mmi

• Fragment TB Mobile structures, scaffold-like

• Perform scaffold-substructure vs. 7000 in vitro

• Derive R-groups, tidy, present graphically, browse...

22

http://molmatinf.com/venice.html

Page 23: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Source Materials

• Scaffold:

• Scaffold origin: inhibitor of Glf target

• 87 molecules with in vitro activity (yes/no)

• Scaffold seems to elicit an activity pattern

• Next step: load it into the app ecosystem...

23

Page 24: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Importing

• Email attachment to self: open in SAR Table

24

Page 25: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Raw DataSheet

• Just Molecule and Activity...

25

Page 26: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Draw Scaffold

• Select first row

• Use built in sketcher to draw scaffold

26

Page 27: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Draw Scaffold

• Select first row

• Use built in sketcher to draw scaffold

26

Page 28: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Substituent Fields

• R-group labels: offer to create fields

27

Page 29: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Scaffold & Molecule

• Need to assign substituents: automatically

28

Page 30: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Scaffold Matching

• Perform single scaffold:molecule match

29

Page 31: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Scaffold Matching

• Perform single scaffold:molecule match

29

Page 32: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Scaffold Matching

• Perform single scaffold:molecule match

29

Page 33: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Bulk Matching

• Duplicate scaffold, match to all molecules

30

Page 34: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Bulk Results

• Only unambiguous results are provided

31

Page 35: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Bulk Results

• Only unambiguous results are provided

31

Page 36: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Symmetry/Degeneracy

• Two possible results: note degeneracy R4,R6

32

Page 37: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Symmetry/Degeneracy

• Two possible results: note degeneracy R4,R6

32

Page 38: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Finishing Assignment

• Single & bulk scaffold matching: semi-automated assignment, human tiebreakers

• Have 87 compounds, 21 active against TB

• Based on scaffold from a binder of Glf enzyme (essential for mycobacterial growth)

• What next?

• Look for other compounds: same scaffold

33

Page 39: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Searching

• Search public databases using template

34

Page 40: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Searching

• Search public databases using template

34

Page 41: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Search Results

• Results come back with scaffolds assigned

35

Page 42: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Sources

• Data from ChemSpider, PubChem, ChEBI• Includes database links and vendor information

36

Page 43: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Importing

• Search results imported in marked-up form

37

Page 44: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Model Building

• Use known activities to build a SAR model

38

Page 45: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Model Application

• Predicted activities shown as coloured wedges

39

Page 46: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Selected Candidates

• Compile a list: predicted activity > 0.8• Open in Mobile Molecular DataSheet (MMDS) app

40

Page 47: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Selected Candidates

• Compile a list: predicted activity > 0.8• Open in Mobile Molecular DataSheet (MMDS) app

40

Page 48: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Sharing by Email

• Very easy to share using email attachments• Recipient can open on any device, including mobile

41

Page 49: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Sharing by Email

• Very easy to share using email attachments• Recipient can open on any device, including mobile

41

Page 50: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Sharing by Email

• Very easy to share using email attachments• Recipient can open on any device, including mobile

41

Page 51: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Sharing via Dropbox

• MMDS can interact with MolSync app• DataSheets are uploaded & synchronised in Dropbox

42

Page 52: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

MolSync & Dropbox

• MolSync app is a chemistry-aware file browser• Can make use of Dropbox's collaboration features

43

Page 53: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Lookup FDA Reference

• Open With the Approved Drugs app

44

Page 54: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Approved Drugs

• 1300 structures for FDA-approved drugs• Reference search by structural similarity

45

Page 55: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Matrix View

• Plot structures in a grid: R1 vs R5

46

Page 56: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Matrix View

• Plot structures in a grid: R1 vs R5

46

Page 57: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Matrix + Predictions

• Compounds with unknown activity predicted shown using wedge style

47

Page 58: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Matrix + Hypotheticals

• Empty squares: propose compounds & predict

48

Page 59: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Matrix + Hypotheticals

• Empty squares: propose compounds & predict

48

Page 60: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Proposing Compounds

• Tap on a square: prompt to create partially defined entry, defining R1 and R5

49

Page 61: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Hypothetical Compound

• New compound added: partial definition• R2, R3, R4 and R6 unspecified

50

Page 62: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Lookup in SPRESI

• Use SPRESImobile app to find similar compounds: looking for a synthesis...

51

Page 63: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Similar Backbone

• Found a compound with similar core, possible synthesis template: consult literature

52

Page 64: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Find Reactions

• Lookup synthetic routes: amide condensation• Open the reaction data in Yield101

53

Page 65: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Find Reactions

• Lookup synthetic routes: amide condensation• Open the reaction data in Yield101

53

Page 66: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Find Reactions

• Lookup synthetic routes: amide condensation• Open the reaction data in Yield101

53

Page 67: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Yield101

• Prototype synthetic chemistry lab notebook• Reaction components imported

54

Page 68: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Reaction Scheme

• Tidyup structures, add in reagent

55

Page 69: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Lookup Starting Material

• Use Mobile Reagents integration to find commercial source for starting material

56

Page 70: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Lookup in ChemSpider

• Paste into ChemSpider Mobile app to do structure search

• Open matching ChemSpider compound information page

57

Page 71: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Lookup in ChemSpider

• Paste into ChemSpider Mobile app to do structure search

• Open matching ChemSpider compound information page

57

Page 72: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Quantitative Details

• Enter starting quantity, density: preliminary estimate of amounts used in synthesis

58

Page 73: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Public Sharing

• Upload scheme to molsync.com• Generates publicly sharable link

59

Page 74: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Shared Content

• Stores chemical data in its native form:

- molecules- reaction schemes- datasheets

• Can share, tweet, etc.

• Download data in many different forms...

60

Page 75: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Shared Content

• Stores chemical data in its native form:

- molecules- reaction schemes- datasheets

• Can share, tweet, etc.

• Download data in many different forms...

60

Page 76: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Numerous Formats• Sharing site stores pure data, creates preview

graphics dynamically... and interconversions

61

Molecules• SketchEl• MDL MOL• CML

Reactions• DataSheet XML• MDL RXN• MDL RDF

Collections• DataSheet XML• MDL SDfile

Raster Graphics• PNG• PNG ZIP

Vector Graphics• SVG• SVG ZIP• HTML SVG• EPS• MS Word• MS Excel

Page 77: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Vector Graphics• Quality is perfect at all resolutions: screen,

projector, web, printer, PDF...

62

Vector Bitmap

Page 78: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Microsoft Office Documents• Independent implementation of OOXML spec• Embedded graphics use vector DrawingML

63

Page 79: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

SAR Table Documents

• Scaffold/substituent tables for manuscripts

64

Page 80: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Matrix PDF

• Multi-page layout: print directly or email PDF

65

Page 81: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Tweeting66

• Can tweet directly from apps like MMDS

• First upload data to molsync.com

• Then emit tweet with link

• Observe list of predefined hashtags:

#tuberculosis

Page 82: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Open Drug Discovery Teams

• Curation of open data, e.g. Twitter & RSS feeds• Rare & neglected diseases, precompetitive areas

67

Page 83: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Harvested Tweet

• Tweet got harvested into Tuberculosis topic• Inline preview browsed, with other thumbnails

68

Page 84: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Chemistry Aware

• The app understands chemical data• Users can use the data, open with in apps...

69

Page 85: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Rinse & Repeat• Defer to experimentalists: measure activity for

selected compounds with predictions

• Return to SAR Table app:

- provide activity results- visualise matrix plot with actual data added- rebuild model, regenerate predictions- iteratively refine hypothesis

• Done using $50 worth of software and tablet

• Workflow is appropriate for use by experimentalists

70

Page 86: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Living Molecules• Going beyond desktop-era functionality

• Devices have:

- long battery life- bluetooth- location awareness- camera!

• Observe the strange symbol: molecular glyph

71

Page 87: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Molecular Glyphs

• Recognition sequence: point camera• View content, import, manage, utilise, export

72

Page 88: Alex Clark : NETTAB 2013

MOLECULAR MATERIALS INFORMATICS

Conclusion• Cheminformatics workflows historically the role of

specialists: expensive and/or complex

• Mobile apps are much cheaper and much more accessible to experimentalists

• Mobile+cloud can:

- replace simple-to-medium tasks- coexist with complex tasks run on desktop software

• Other advantages:

- anywhere/anytime portability- excellent collaboration and sharing- non-existent installation or maintenance burden

73

Page 89: Alex Clark : NETTAB 2013

Acknowledgments

http://molmatinf.comhttp://molsync.comhttp://cheminf20.org

@aclarkxyz

• Antony Williams

• Sean Ekins

• Joel Freundlich

• NETTAB / Bioinformatics Italian Society

• InfoChem, Eidogen-Sertanty

• Inquiries to [email protected]

http://molmatinf.com/venice.html