19
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Embed Size (px)

Citation preview

Page 1: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Pipeline Pilot Integration

Szilard Dorant

Solutions for Cheminformatics

Page 2: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

The Component Collection: Quick facts

• Provides access to ChemAxon tools from Pipeline Pilot

• Free of charge

• Open source : Java sources are also included

• Available from ChemAxon or Accelrys

• Latest version : 1.5– Released in May 2009– Requires JChem / Marvin 5.1.3 or newer– Requires Pipeline Pilot 6.1.1 or newer

Page 3: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Available functionality (1/2)

• Standardizer: structure canonicalization

• Chemical Terms expressions for filtering and calculations (including logP, logD, pKa, HBD, HBA, Isoelectric point, PSA and more)

• Reactor : “smart” virtual reaction processing

• Maximum Common Substructure (MCS) based clustering

• IUPAC Name <-> Molecule conversion (both directions)

• JChem chemical database: insertion, search and retrieval of structures; create and drop structure tables

Page 4: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Available functionality (2/2)

• Marvin applets: structure visualization and editing

• Major microspecies (major protonation form)

• Microspecies distribution

• Burden eigenvalue descriptor (BCUT)

• MolConverter: conversion of the wide range of structure formats supported by ChemAxon

• Markush enumeration: enumeration of Markush (generic) structures (e.g. R-groups, link nodes, atom- and bond lists and many more)

• Tautomerization: tautomer generation (all, dominant, major, canonical, generic)

Page 5: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Release history - major changes

• Version 1.5, May 2009– New components: "ChemAxon MolConverter", "ChemAxon

Tautomerization", "ChemAxon Markush Enumeration"

• Version 1.4, November 2008– New components: "LibMCS Clustering", "Molecule to IUPAC

Name", "Molecule from IUPAC Name“

– Major upgrade of "ChemAxon Reactor" component

• Version 1.3, July 2008– New component: “Chemical Terms Calculator”

– New, improved error reporting system

• Version 1.2, March 2008– New components: “ChemAxon Reactor”, “Drop JChem Base

Table”, “Create JChem Base Table”

– Upgraded: “ChemAxon Standardize”, “JChem Base Insert”, “JChemSearch”, documentation upgrade for most components

Page 6: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Calculator

Easy access for the most important calculations

Page 7: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Chemical Terms Calculator

• Use arbitrary Chemical Terms expressions

• Results stored to arbitrary properties

• A wide range of ChemAxon functionality can be accessed as Chemical Terms functions

New in 1.3Maximum freedom trough Chemical Terms Expressions for the expert user

Page 8: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Improved error reporting

• Helps to track down problems quicker, more efficient support

• More information is displayed in the error dialog: JChem version, the current structure, etc.

• An even more detailed error report including the full stack trace is appended to the file “pp_error_<date>.txt" in the “<user_home>\chemaxon\logs” directory (new file every day). The log directory can be changed be setting the environment variable CHEMAXON_PP_LOG_DIR.

From 1.3, upgraded in 1.5

Page 9: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Reactor

Improvements:

Upgraded in 1.4

• Synthesis code generation

• Output reaction mapping

• Advanced options:– Unambiguous only

– Ignore rules:• Reactivity and Exclude• Selectivity• Tolerance

Page 10: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Combinatorial Reactor Example

Page 11: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

IUPAC naming components

• IUPAC Name to Molecule

• Molecule to IUPAC Name

• Example “roundtrip” protocol:

New in 1.4

Page 12: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Clustering with LibMCS

Maximum Common Substructure (MCS) based clustering

• Size of smallest common substructure to consider

• Three levels of heuristics:– Exact (no heuristics)

– Fast

– Very Fast

• Bond type, atom type, charge can optionally be ignored

• Disallow “breaking” rings (default)

Options: New in 1.4

Page 13: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

• File input

• Enumeration type:

– Sequential

– Random

• Number of enumerated structures can be limited (per input structure)

• Valence filter

• Scaffold alignment

• Markush code generation. The scaffold ID can be:

– fetched from data field

– generated (prefix + number)

Markush Enumeration

Enumeration of generic structures New in 1.5

Page 14: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Tautomerization

Component for tautomer generation New in 1.5

• Calculation modes:

– All tautomers

– Canonical tautomer

– Generic tautomer

– Major tautomer

– Dominant tautomer distribution

• Options:

– Protect aromaticity, charge, double bond stereo, tetrahedral stereo

– Exclude antiaromatic compounds

– Single fragment mode

– Consider pH at specific value

Page 15: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

MolConverter

“Swiss army knife” for molecular format conversion New in 1.5

• Input and output can either be

– File

– Property

– Pipeline Pilot Molecule

• Specified input format or auto-detection

• Various output formats or custom format string

• Option to halt or continue on error, error messages put into property

• 2D cleaning (coordinate generation) only when needed (default). Unconditional 2D or 3D cleaning or no cleaning can also be selected

Page 16: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

New Java class: ChemAxonComponent

• Parent class for component classes, implements com.scitegic.pilot.Component

• Error reporting: all exceptions thrown in onInitializeBody(), onProcessBody(), onFinalizeBody() are caught, logged and reported (re-thrown with added information)

• Parameters and context always readily available

• Utility functions for conveniently accessing parameters and properties, e.g.: findStringParameter(String name)

• Reduced code redundancy, better readability

Page 17: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Planned development

• Conformer generation

• Improve Molecular Table Viewer (Marvin View)

• JChem Cartridge for Oracle

• Integration with Instant JChem

• Documentation

Node release cycle is fast and flexible. Please advise us on priority and additional functionality for future node development.

Page 18: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Resources

• Download:– http://www.chemaxon.com/integration/download.html– http://accelrys.org/pipelinepilot/chemistry.html

• Technical support forum:– http://www.chemaxon.com/forum/forum88.html

• E-mail:– [email protected]

Page 19: Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics

Find out more

• Product descriptions & linkswww.chemaxon.com/products.html

• Forumwww.chemaxon.com/forum

• Presentations and posterswww.chemaxon.com/conf

• Download

www.chemaxon.com/download.html