223
1 Structure Searching with STN Express

1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

Embed Size (px)

Citation preview

Page 1: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

1

Structure Searching with STN Express

Page 2: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

2

• The REGISTRY database contains chemical substance information.

• Bibliographic references and abstracts of papers discussing substances retrieved by a structure search are stored in the CAplus database. The L-number generated in the full-file structure search is the key to locating relevant references.

Introduction

Page 3: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

3

Why Do a Structure Search?

There are a number of ways to locate information

about a chemical compound:

Chemical name Molecular formula Etc. Structure

Each access point has unique advantages.

Introduction

Dictionary

Page 4: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

4

Dictionary

• Searching by dictionary is a powerful method for retrieving at least broad classes of substances in Registry, useful also in structure searching.

Introduction

Page 5: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

5

Dictionary - Chemical name

• Chemical names may be systematic or trade names. However, systematic names can be complex and difficult to derive:

Benzeneacetonitrile, α-[3-[[2-(3,4-dimethoxyphenyl)ethyl]methylamino]propyl]-3,4-dimethoxy-α-(1-methylethyl)-

Introduction

Page 6: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

6

Dictionary - Molecular formula

• Many compounds have the same molecular formula:

C27H38N2O4 is shared by >250 compounds in the CAS REGISTRY database

Introduction

Page 7: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

7

Dictionary - Chemical name, Molecular formula

• Name, and Molecular formula, fragments, are inside the BI.Name, and Molecular formula, fragments, are inside the BI.

Introduction

Page 8: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

8

Structure

• Each compound has an unique structure

Introduction

(CH2)3

Pr-i Me

OMeMeO

OMeOMe

CN

C N CH2 CH2

Page 9: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

9

Structure-Searchable STN Databases

• A number of STN databases are searchable by structure.

• The REGISTRY database, produced by CAS, is the world's largest collection of chemical compounds.

Introduction

Page 10: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

10

A sample of structure-searchable databases on STN:

Introduction

Structure-searchable databases Type of substance

REGISTRY Organic, inorganic, biochemical, biosequence

BEILSTEIN Organic

GMELIN Inorganic

MARPAT Markush structures from patent claims

CASREACT, CHEMINFORMRX, DJSMONLINE

Organic reactions

DCR Organic

Page 11: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

11

Basic Strategy for Structure Searching

• Locating substance information via structure searching is accomplished in four main steps:

1 Draw the structure2 Search the structure3 Display the structure matches retrieved4 Locate the desired information about the structure

matches of interest

Introduction

Page 12: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

12

Structure Drawing

Page 13: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

13

Accessing the Structure Drawing Screen

• The structure drawing screen is accessed via the STN Express main tool bar via the prepare query icon:

Structure Drawing

Page 14: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

14

Structure Drawing

Structure drawing screen has 7 main parts:

Page 15: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

15

Helpful HINT

Tool tips describe STN Express structure drawing tools. Position the cursor over the structure drawing tool of interest. The tool tip appears:

Structure Drawing

Page 16: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

16

• Most chemical structures can be drawn using the following techniques:

Drawing chains Drawing rings and ring systems Connecting rings and chains Specifying atoms Specifying shortcuts Specifying bonds Erasing and undoing

Structure Drawing

Page 17: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

17

• When structures are drawn using STN Express tools, the following defaults apply:

Atoms in the structure match the element in the current atom box. When the structure drawing screen is opened, the current atom is set to C (carbon).

Bonds in the structure match the bond in the current bond box. When the structure drawing screen is opened, the current bond is set to single.

PREFERENCES

Structure Drawing

Page 18: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

18

Drawing ChainsIf you want to Follow these steps:

Draw a chainfreehand

1. Click the pencil tool:

2. Place the cursor on the drawing screen atthe point where you want to start drawing.

3. Click, drag, and release to draw a linerepresenting an atom bonded to anotheratom.

4. Repeat step 3, as needed.

Draw a chainusing the chaintool

1. Click the chain tool:

2. The Chain Length option box appears.Scroll to select the desired length. ClickOK.

3. Click on the drawing screen to place thechain.

Structure Drawing

Page 19: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

19

Drawing Rings and Ring SystemsIf you want to Follow these steps:

Draw a single ring 1. Click the ring tool:

2. The Ring Description option box appears.Select the desired ring from the

Common ring icons, e.g.,

Scrollable list of ring descriptions

Click OK.

3. Click on the drawing screen to place thering.

(continued on next slide)

Structure Drawing

Page 20: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

20

If you want to Follow these steps:

Draw a single ringusing a template

1. Click the Template! pull-down menu.

2. The Open Template dialog box appears showinga list of files containing ring templates.Highlight the file of interest. Click Open.

3. The Template option box appears. Double-clickon a ring of interest.

4. Click on the drawing screen to place the ring.

Structure DrawingDrawing Rings and Ring Systems

Page 21: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

21

If you want to Follow these steps:

Draw a side-fusedring system

Draw the left-most ring:

1. Click the ring tool:

2. The Ring Description option box appears.Select the desired ring from the

Common ring icons, e.g.,

Scrollable list of ring descriptions

Click OK.

3. Click on the drawing screen to place the ring.

Draw the next ring:4. Repeat steps 1-2, selecting a different ring

component.5. Align the cursor with the middle of the bond

where fusion is to occur.

6. Click to place the ring.

Drawing Rings and Ring Systems

Structure Drawing

Page 22: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

22

If you want to Follow these steps:

Draw a spiro-fused ring system

Draw one ring:

1. Click the ring tool:

2. The Ring Description option box appears.Select the desired ring from the

Common ring icons, e.g.,

Scrollable list of ring descriptions

Click OK.

3. Click on the drawing screen to place the ring.

Draw the next ring:4. Repeat steps 1-2, selecting a different ring

component.5. Align the cursor with the atom where fusion is

to occur:

6. Click to place the ring.

Structure DrawingDrawing Rings and Ring Systems

Page 23: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

23

noteSide-fused systems can also be drawn using keyboardcommands in the Ring Description option box.

Keyboard commands are a series of numbers specifyingthe ring sizes to fuse together from left to right. A letter isspecified before a ring number (u or d) to indicate that thenext ring should be fused to the preceding ring in ahorizontal direction

u - one face up

d - one face down

This keyboardcommand

Creates this ring system

665

6u5

6d65

Structure Drawing

Page 24: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

24

Connecting Rings and ChainsIf you want to Follow these steps:

Connect a ring toan existing chain 1. Click the ring tool:

2. The Ring Description option box appears.Select the desired ring from the

Common ring icons, e.g.,

Scrollable list of ring descriptions

Click OK.

3. Align the cursor with the atom wherebonding is to occur:

4. Click to place the ring.

Structure Drawing

Page 25: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

25

If you want to Follow these steps:

Connect a chain toan existing ring 1. Click the chain tool:

2. The Chain Length option box appears.Scroll to select the desired length. ClickOK.

3. Align the cursor with the atom wherebonding is to occur:

4. Click to place the chain.

Structure Drawing

Connecting Rings and Chains

Page 26: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

26

Specifying Atoms

• When structures are drawn with the pencil tool, ring tool, or chain tool, the atoms in the structure are whatever element appears in the current atom box.

• Atoms may be changed.

Structure Drawing

Page 27: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

27

If you want to Follow these steps:

Change one or more atoms to one of the elements in the common atoms palette

1. Click the pencil tool:

2. From the common atoms palette, click the element

Once, to change 1 atom in the structure

Twice, to change >1 atom in the structure

3. Place the cursor over the atom(s) to be changed. Click.

Structure Drawing

Specifying Atoms

Page 28: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

28

If you want to Follow these steps:

Change one ormore atoms to anelement not in thecommon atomspalette

1. Click the pencil tool:

2. From the Draw pull-down menu, selectAtom.

3. The Atom Selection menu appears. Clickthe radio button associated with the atomof interest. Click Single Use or MultipleUse.

4. Place the cursor over the atom(s) to bechanged. Click.

Specifying Atoms

Structure Drawing

Page 29: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

29

TipsInserting an Atom• When inserting an atom in a structure, look for an "A" to

appear in the body of the pencil tool, indicating that the pencil tip is in correct position, directly over an atom.

• Illustration:

Structure Drawing

Page 30: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

30

• When structures are drawn with the pencil tool, ring tool, or chain tool, the atoms in the structure are whatever element appears in the current atom box.

• Atoms may be changed to any of the pre-drawn shortcuts.

Structure Drawing

Specifying Shortcuts

Page 31: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

31

If you want to Follow these steps:

Change one ormore atoms to oneof the shortcuts

1. Click the pencil tool:

2. Click the shortcut tool:

3. The Shortcuts menu appears. Click theradio button associated with the shortcutof interest. Click Single Use or MultipleUse.

4. Place the cursor over the atom(s) to bechanged. Click.

Structure Drawing

Specifying Shortcuts

Page 32: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

32

Specifying Bonds

• When structures are drawn with the pencil tool, ring tool, or chain tool, the atoms in the structure are whatever element appears in the current atom box.

• Bonds may be changed.

Structure Drawing

Page 33: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

33

If you want to Follow these steps:

Change one or more bonds to one of the bonds in the common bonds palette

1. Click the pencil tool:

2. From the common bonds palette, click the bond

Once, to change 1 bond in the structure

Twice, to change >1 bond in the structure

3. Place the cursor over the bond(s) to be changed. Click.

Structure Drawing

Specifying Bonds

Page 34: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

34

If you want to Follow these steps:

Change one ormore bonds to abond not in thecommon bondspalette

1. Click the pencil tool:

2. From the Draw pull-down menu, select Bond.

3. The Bond Selection menu appears. Click theradio button associated with the bond ofinterest. Click Single Use or Multiple Use.

4. Place the cursor over the bond(s) to bechanged. Click.

Structure DrawingSpecifying Bonds

Page 35: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

35

Correcting MistakesIf you want to Follow these steps:

Undo the lastmodification made Click the undo icon:

Erase an atom andits attached bond 1. Click the eraser tool:

2. Click the atom to be erased.

Clear the drawingwindow

From the Edit pull-down menu, selectClear All.

Structure Drawing

Page 36: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

36

TipsResetting Default Settings for Atoms/Bonds• When "multiple use" is selected for an atom or

shortcut, the default atom is changed to that selection and will remain in effect until defaults are reset by clicking the default atom/bond icon:

Structure Drawing

Page 37: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

37

TipsChanging atoms• Atoms can be changed by doing the following:

1 Click the selection tool.2 Click to highlight the atoms to be changed (shift).3 Select the new atoms.

Structure Drawing

Page 38: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

38

Displaying Carbon Atoms

• Carbon atoms in a structure can be displayed as

• To toggle between options, click the carbon display icon:

Structure Drawing

Page 39: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

39

Step Action

1

Click the save icon:

2 The Save Query As dialog box appears.

In the File Name: entry box, type a name forthe structure. For Windows, the nameshould have 8 or fewer characters.

Click OK.

After a structure is built, it must be saved prior to using it in an STN structure search

Structure Drawing

Page 40: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

40

Saving the StructureStep Action

3 A Saving dialog box appears. Deselect anychecked boxes.

Click Save.

Structure Drawing

Page 41: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

41

Saving the StructureStep Action

3 A Saving dialog box appears. Deselect anychecked boxes.

Click Save.

Structure Drawing

X

Page 42: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

42

Structure Drawing

Verify the Structure:

Page 43: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

43

Draw and save each structure below, using the techniques just discussed.

Skills Practice

Page 44: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

44

Structure Search Strategy

Page 45: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

45

In this section, you will learn how to

• Determine the type of structure search to do, based on your information

needs• Upload a structure to STN• Run a structure search in REGISTRY and display structure matches• Locate CAplus references to structures of interest

Structure Search Strategy

Page 46: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

46

Preview

• Structure searches can be run to locate

Exact matches of a structure Structures containing a structural

skeleton or fragment of interest Literature references discussing

structures of interest

• The steps involved in all structure searches follow a similar order.

Structure Search Strategy

Page 47: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

47

To locate research papers discussing a structure of interest

Step 1 Draw and save the structure.

Step 2 Logon to STN and enter a structure-searchable database.

Step 3 Upload the structure.

Step 4 Run a sample structure search.

Step 5 Evaluate structure matches.

Step 6 Run a full-file structure search.

Step 7 Find references discussing the structure matches of interest.

Preview

Structure Search Strategy

Page 48: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

48

Search Question:Locate references discussing the preparation of the following substance:

Structure Search Strategy

Page 49: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

49

Step 1: Draw and save the structure

Step 2: Logon to STN and enter a structure-searchable database

(The FILE command is used to enter a structure-searchable database)

FILE 'HOME' ENTERED

=> FILE REGISTRY

Structure Search Strategy

Page 50: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

50

Step 3: Upload the structure

Step Action

1 Click the Upload Structure Query icon:

2 The Structure File: dialog box appears. Click tohighlight the name of the structure file ofinterest. Click Open.

3 The structure is uploaded into STN. An L-number is assigned to the structure:

=>Uploading C:\Program

Files\Stnexp\Queries\pract1.str

L1 STRUCTURE UPLOADED

Structure Search Strategy

Page 51: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

51

S t e p A c t i o n

3 V e r i f y t h e s t r u c t u r e o n l i n e :

= > D L 1 L 1 H A S N O A N S W E R S L 1 S T R

C H 2C H 2

C H 2N

C H 2C H 2

M e O

M e O

i - P r C N

M e O

M e O

M e

Step 3: Upload the structure

Structure Search Strategy

Page 52: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

52

Step 4: Run a sample structure search• The perspective of the search question is to locate exact

matches on a given structure.

• A sample structure search searches a portion of the database. It is a no-cost option that is used to evaluate the effectiveness of a structure search by

Testing the structure search to ensureTesting the structure search to ensure it will run within system limitsit will run within system limits

Verifying that the types of answersVerifying that the types of answers retrieved are the types of answers retrieved are the types of answers

desireddesired

Structure Search Strategy

Page 53: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

53

• The SEARCH command is used to run a sample structure search. At the command line, 3 additional pieces of information are required:

A) L-number assigned to the structure during uploading

Structure Search Strategy

Page 54: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

54

B) Type of structure search

Three types of structure searches are possible, depending

on the types of substance matches desired:

Use this type of structure search: To retrieve this type of substance

match Exact Family Substructure

Exact substance X X X

Stereoisomers X X X

Salts X X

Mixtures X X

Substitution at open sites

X

Variability in bonds is allowed in all type of searches

Structure Search Strategy

Page 55: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

55

C) Scope of the structure search

Two scopes of structure searches are possible

Scope of structure search When you want to search

Sample Full

A portion of the database at no cost

X

The entire database X

Structure Search Strategy

Page 56: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

56

=> SS L1L1 EXACTEXACT SAMSAM

SAMPLE SEARCH INITIATED 13:45:28 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 6 TO ITERATE

100.0% PROCESSED 6 ITERATIONS 2 ANSWERSSEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 6 TO 266PROJECTED ANSWERS: 2 TO 124

L2 2 SEA EXA SAM L1

This search is projected to run to completion online within system limits.

Structure matches ("answers") are placed in an answer set that is given the next available L-number.

Structure Search Strategy

Page 57: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

57

Step 5: Evaluate structure matches

• The no-cost D SCAN feature is used to evaluate structure search results.

• D SCAN randomly selects an answer from the answer set and displays it:

Structure Search Strategy

Page 58: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

58

=> D SCAN

L2 2 ANSWERS REGISTRY COPYRIGHT 2001 ACS

IN Benzeneacetonitrile, a-[3-[[2-(3,4- dimethoxyphenyl)ethyl]methylamino]propyl]-3,4- dimethoxy-a-(1-methylethyl)-, (aS)- (9CI)

MF C27 H38 N2 O4

CI COM

Absolute stereochemistry.

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

Structure matches are in line with the desired results.

Specify here the number of additional answers you want to see.

(CH2)3

i-PrMe

OMe

OMe

OMe

MeO

CNN S

Structure Search Strategy

Page 59: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

59

L2 2 ANSWERS REGISTRY COPYRIGHT 2001 ACS

IN Benzeneacetonitrile, α-[3-[[2-(3,4-dimethoxyphenyl)

ethyl]methylamino]propyl]-3,4-dimethoxy-α-(1-

methylethyl)-, labeled with deuterium

(9CI)

MF C27 H34 D4 N2 O4

ALL ANSWERS HAVE BEEN SCANNED

(CH2)3

Pr-i Me

OMeMeO

OMeOMe

CN

C N CH2 CH2

Structure Search Strategy

Page 60: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

60

Step 6: Run a full-file structure search• A full-file structure search searches the entire

database. • The SEARCH command is also used to run a full-file

structure search. At the command line, 3 additional pieces of information are required:1 L-number assigned to the structure during uploading2 Type of structure search3 Scope of the structure search - e.g., full

Structure Search Strategy

Page 61: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

61

=> SS L1L1 EXACTEXACT FULLFULL

FULL SEARCH INITIATED 13:50:30 FILE 'REGISTRY'

FULL SCREEN SEARCH COMPLETED - 67 TO ITERATE

100.0% PROCESSED 67 ITERATIONS 17 ANSWERS

SEARCH TIME: 00.00.01

L3 17 SEA EXA FUL L1 The results of a full-file structure search are placed in a new answer set.

Structure Search Strategy

Page 62: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

62

Option: Evaluate answers

=> D SCAN

L3 17 ANSWERS REGISTRY COPYRIGHT 2001 ACS

IN Benzeneacetonitrile, -[3-[[2-(3,4- dimethoxyphenyl)ethyl]methylamino]propyl]-3,4- dimethoxy--(1-methylethyl)-, labeled with deuterium (9CI)

MF C27 H38 N2 O4

CI COM

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

(CH2)3

i-Pr Me

OMeMeO

OMeOMeCN

C N CH2 CH2

Structure Search Strategy

Page 63: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

63

Helpful HINT

There are fees associated with full-file searches of most structure-searchable databases. To see associated costs, type the following within the file of interest:

=> HELP COST

Structure Search Strategy

Page 64: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

64

Recall the Search Question:Locate references discussing the preparation of the following substance:

Structure Search Strategy

Page 65: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

65

=> FILE CAPLUS => S L3/PREP 7690 L3 2726608 PREP/RLL4 49 L3/PREP (L3 (L) PREP/RLL3 (L) PREP/RL)

Recall: L3 is the answer set resulting from the full-file structure search.

Syntax: Type a slash and the CAS Role after the L-number

• The no-cost D SCAN feature is also available in CAplus. When the answer set contains more than one answer, D SCAN randomly selects an answer from the

set and displays it.

Evaluating results

Structure Search Strategy

Page 66: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

66

Evaluating results

=> D SCAN L4 49 ANSWERS CAPLUS COPYRIGHT 2001 ACS

TI Investigation of radiochemical purity of 99mTc- verapamilST verapamil technetium 99m labeling radiochem purity

IT 52-53-9DP, Verapamil, 99mTc-labeled 14133-76-7DP, Technetium 99, verapamil labeled with, preparation RL: PNU (Preparation, unclassified); PRP (Properties); PREP (Preparation) (preparation and radiochem. purity of 99mTc- verapamil)

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

The CAS RN from the REGISTRY answer set that caused the answer to be retrieved is highlighted.

Structure Search Strategy

Page 67: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

67

Displaying CAplus answers in more detail

• The DISPLAY command is used to show additional information for the answers retrieved from CAplus. Several predefined display formats are available:

This predefined display format

Contains these parts of the CAplus record:

BIB Bibliographic information

IBIB Bibliographic information with field names spelled out

ABS Abstract

HITSTR Index term, CA index name, and structure for the hit CAS RN

ALL Full record, all citations (BIB + ABS + indexing)

Structure Search Strategy

Page 68: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

68

Step 1 Draw and save the structure.

Step 2 Logon to STN, and enter theREGISTRY file.

=> FILE REGISTRY

Step 3 Upload and verify the structurequery.

=> D L1

Step 4 Run a sample structure search => S L1 EXACT SAM

Structure Search Strategy

Summary

Page 69: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

69

Step 5 Evaluate results.

Option: Modify the originalstructure, as needed.

=> D SCAN

Repeat steps 1-5.

Step 6 Run a full-file structure search,and then evaluate results.

=> S L1 EXACT FULL=> D SCAN

Step 7 Enter the CAplus file, and findreferences citing the substances.

=> FILE CAPLUS=> S L2/PREP

=> D BIB ABS HITSTR

Structure Search Strategy

Summary

Page 70: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

70

1 Locate studies on the following substance, as well as stereoisomers and radiolabeled forms. Display the bibliographic information for the first 3 answers in the CAplus answer set.

2 Locate references reporting the preparation of the following specific substance or any of its salts. Display the BIB, ABS, and HITSTR information for the first 3 answers in the CAplus answer set.

Skills Practice

Page 71: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

71

Substructure Searching

Page 72: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

72

Using the structure in the Search Question, run the following searches:

Exact full

Family full

Substructure full

Skills Practice

Page 73: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

73

Use D SCAN to look at the answers retrieved in each search. Note the different kinds of

substances retrieved by each type of search.

Exact Family SSS

No. of answers

Unsubstituted structure

Radiolabeled forms,stereoisomers

Salts, mixtures,polymers

Substances withadditional substitution

Skills Practice

Page 74: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

74

Substructure Searching

In this section, you will learn how to draw structures for substructure searches to retrieve matches with the desired

Ring system fusion Bonding patterns Substitution at open positions

Page 75: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

75

Default Assumptions• STN Express assigns attribute information to the rings and chains in a

substructure query. These attributes determine the types of answers retrieved by a substructure search.

• Default assumptions can be modified.

For this structural feature in the search

query:

STN Express makes assumptions to retrieve structure matches with the

Ring systems Ring system as drawn

Ring system as part of a larger ring system

Atom in a chain Atom in a chain only

Bond in a chain

Bond in a chain only

Substructure Searching

Page 76: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

76

Ring Systems• The default assumption for ring systems is called

"isolated/embedded". STN automatically searches for each ring system in a structure

As drawn, without other rings attached to it via bonds (isolated)

As part of a larger ring system, with other rings fused to it (embedded)

• An option exists to specify that a ring system in a structure be isolated.

Substructure Searching

Page 77: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

77

Search Question: Locate substances with the following characteristics:

Requirements: R and R' = anything, including hydrogen No substitution at the carbon in the ring marked with an asterisk (*) The ring may be further substituted at the other open sites, but there may be no additional rings fused at these positions

Substructure Searching

Page 78: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

78

Building the StructureWorksheet

Structure search requirement

Action

R and R' = anything, including H

No substitution at the carbon in the ring marked with an asterisk (*)

The ring may be further substituted at the open sites

There may be no additional rings fused

Substructure Searching

Don’t draw any atom

Draw a Hydrogen

Don’t draw any atom

Look at the following procedure

Page 79: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

79

Ring isolation• To prevent the substructure search from retrieving the ring

system as part of a larger ring system, isolate the ring by doing the following:

Step Action

1 Position the cursor over an atom in the ring system that is to be isolated.

2 Right-click. A side-menu of structural items that you can change will display. Select Ring Isolation:

Substructure Searching

Page 80: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

80

Ring isolation

Step Action

3 The Ring Isolation dialog box appears. Select Isolated. Click OK.

4 (Option) Verify that a change was made to the structure.

Click the display attributes icon:

Substructure Searching

Page 81: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

81

Ring isolationStep Action

5 The Query Attributes pop-up menu appears. Move the cursor to any atom in the isolated ring system. The currently assigned value is shown.

Substructure Searching

Page 82: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

82

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-ring.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

N

N

O

O

H

Substructure Searching

Page 83: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

83

=> S L1 SSS SAM (or only: => S L1or only: => S L1)SAMPLE SEARCH INITIATED 14:49:50 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 380 TO ITERATE

100.0% PROCESSED 380 ITERATIONS 1 ANSWERSSEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 6431 TO 8769PROJECTED ANSWERS: 1 TO 80L2 1 SEA SSS SAM L1

Substructure Searching

Page 84: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

84

=> D SCAN L2 1 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 3-Pyridinecarboxylic acid, 1-ethoxy-1,4-dihydro- 4-(phenylimino)-, ethyl ester (9CI)MF C16 H18 N2 O3CI COM

ALL ANSWERS HAVE BEEN SCANNED

EtO OEt

Ph

C

O

N

N

Substructure Searching

Page 85: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

85

=> S L1 SSS FULL (or only: => S L1 FULLor only: => S L1 FULL)FULL SEARCH INITIATED 14:50:08 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED - 8011 TO ITERATE

100.0% PROCESSED 8011 ITERATIONS 21 ANSWERSSEARCH TIME: 00.00.01L3 21 SEA SSS FUL L1

=> D SCAN L3 21 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 3-Pyridinecarboxylic acid, 5-cyano-1,4-dihydro-4- imino-6-methyl-1-(phenylmethyl)-, ethyl ester (9CI)MF C17 H17 N3 O2

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

OEt

Me

Ph

NC

CH2

N

C

ONH

Substructure Searching

Page 86: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

86

If this structure was drawn and Ring Isolation set to Isolated

would the following substances be retrieved in a substructure search?

Retrieved Not Retrieved

Skills Practice

Page 87: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

87

Retrieved Not Retrieved

If this structure was drawn and Ring Isolation set to Isolated

would the following substances be retrieved in a substructure search?

Skills Practice

Page 88: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

88

Run the search for this fragment twice:o One with the ring isolated o One with the ring isolated embedded

Look at the differences

Skills Practice

Page 89: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

89

Atoms• Atoms in a substructure query may be

Ring Chain Ring/Chain

Substructure Searching

Page 90: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

90

Atoms• The default assumption for atom retrieval depends on the

location of an atom in the substructure query.

If, in the substructure query, an atom is part of a

Then, in the retrieved structure matches

Chain The atom will always be part of a chain

Ring The atom will always be part of a ring

Substructure Searching

Page 91: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

91

Search Question: Locate substances with the following characteristics:

R, R', and R" = anything, including H The ring system may be substituted at all of the open sites There may be additional rings fused to the ring system R' and R" may form a ring with the N, e.g.,

Substructure Searching

Page 92: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

92

Building the Structure

Worksheet

Structure search requirement

Action

R, R', and R" = anything, including H Ring system may be substituted at all of the open sites

There may be additional rings fused to the ring system

R' and R" may form a ring with the nitrogen, e.g.,

Substructure Searching

Don’t draw any atom

Don’t draw any atom

Rings isolated/embedded

Look at the following procedure

Page 93: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

93

Ring/chain atoms• To allow the substructure search to retrieve a chain atom as

part of a ring or a chain, do the following:

Step Action

1 Position the cursor over the atom of interest.

2 Right-click. A side-menu of structural items that you can change will display. Select Node Characteristics:

Substructure Searching

Page 94: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

94

Step Action

3 The Node Characteristics dialog box appears. Select Ring/Chain. Click OK.

4 (Option) Verify that a change was made to the structure.

Click the display attributes icon:

Ring/chain atoms

Substructure Searching

Page 95: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

95

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-atom-rc.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

NO

O

CH2N

Substructure Searching

Page 96: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

96

=> S L1 SSS SAMSAMPLE SEARCH INITIATED 15:06:29 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 15 TO ITERATE

100.0% PROCESSED 15 ITERATIONS 4 ANSWERSSEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 68 TO 532PROJECTED ANSWERS: 4 TO 200

L2 4 SEA SSS SAM L1

Substructure Searching

Page 97: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

97

=> D SCAN L2 4 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 3-Quinolinecarboxylic acid, 4-(4-hydroxy-3- methoxyphenyl)-6,7-dimethoxy-2-(1H-1,2,4-triazol -1-ylmethyl)-, ethyl ester (9CI)MF C24 H24 N4 O6

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

OEtMeO

OMe

MeO

C

O

CH2N

NN

N

OH

Note: The N atom is part of a ring in this substructure match.

Substructure Searching

Page 98: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

98

=> D SCAN L2 4 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 3-Quinolinecarboxylic acid, 2-[(dibutylamino) methyl]-4-(3,4-dimethoxyphenyl)-6,7-dimethoxy-, ethyl ester (9CI)MF C31 H42 N2 O6

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

Note: The N atom is part of a chain in this substructure match.

N(Bu-n)2

OEtMeO

OMe

MeO

OMe

C

O

CH2N

Substructure Searching

Page 99: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

99

=> S L1 SSS FULLFULL SEARCH INITIATED 15:06:52 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED - 288 TO ITERATE100.0% PROCESSED 288 ITERATIONS 114 ANSWERSSEARCH TIME: 00.00.01L3 114 SEA SSS FUL L1

=> D SCANL3 114 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 3-Quinolinecarboxylic acid, 4-(3,4-dimethoxy phenyl)-6,7-dimethoxy-2-(4-thiomorpholinylmethyl)-, ethyl ester (9CI)MF C27 H32 N2 O6 S

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

OEtMeO

OMe

MeO

OMe

C

O

CH2N

NS

Substructure Searching

Page 100: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

100

Helpful HINT

To change the characteristics of multiple atoms at one time, do the following:

1. Click the selection tool icon:

2. Hold down the SHIFT key while clicking on atoms whose characteristics are to be changed.

3. From the QueryDef pull-down menu, select Node Characteristics.

4. The Node Characteristics dialog box appears. Select Ring/Chain. Click OK.

Substructure Searching

Page 101: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

101

Bonding Patterns• Each bond in a substructure query has three

characteristics associated with it:

Bond type - Whether the bond is part of a ring or a chain

Bond structure - Whether the bond is single, double, triple, or unspecified

Bond value - Whether the bond is exact or part of an alternating single/double bond system (e.g., tautomers)

Substructure Searching

Page 102: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

102

Bond Type• Bonds in a substructure query may be

Ring Chain Ring/Chain

Substructure Searching

Page 103: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

103

Bond TypeIf, in the substructure

query, a bond is part of a Then, in the retrieved structure

matches

Chain The bond will always be part of a chain

Ring

The bond will always be part of a ring

note

Changing chain atoms to Ring/Chain, as in the previous example, does not automatically change the bonds to that atom to Ring/Chain.

Substructure Searching

Page 104: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

104

Bond Type

An option exists to specify that a bond type be ring/chain.

By default, the chain bonds in this substructure

Retrieve answers where the bonds are in a chain:

But not answers where the bonds are in a ring:

Substructure Searching

Page 105: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

105

Search Question: Expand the previous query to retrieve substructure matches where the O atoms and the CH2—N chain are also parts of rings fused onto the quinoline ring.

Requirements: There may be any type of substitution at all open sites There may be additional rings fused to the ring system

Substructure Searching

Page 106: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

106

Building the StructureWorksheet

Structure search requirement

Action

Any type of substitution at all open sites

Additional rings may be fused to the ring system, including those formed by the O atoms and the CH2—N chain

Substructure Searching

Don’t draw any atom

Ring isolated/embeddedLook at the following procedure

Page 107: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

107

• To allow the substructure search to retrieve a chain bond as part of a ring or a chain, do the following:

Step Action

1 Position the cursor over the bond of interest.

2 Right-click. The Bond Characteristics dialog box appears. Select Ring/Chain. Click OK.

3 (Option) Verify that a change was made to the structure.

Click the display attributes icon:

Bond type

Substructure Searching

Page 108: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

108

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-bond-rc.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

NO

O

CH2N

Substructure Searching

Page 109: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

109

=> S L1 SSS SAMSAMPLE SEARCH INITIATED 17:20:56 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 2128 TO ITERATE

47.0% PROCESSED 1000 ITERATIONS 3 ANSWERSINCOMPLETE SEARCH (SYSTEM LIMIT EXCEEDED)SEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 39795 TO 45325PROJECTED ANSWERS: 3 TO 278

L2 3 SEA SSS SAM L1

Substructure Searching

Page 110: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

110

=> S L1 SSS FULLFULL SEARCH INITIATED 17:23:15 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED - 42125 TO ITERATE100.0% PROCESSED 42125 ITERATIONS 133 ANSWERSSEARCH TIME: 00.00.01L3 133 SEA SSS FUL L1

=> D SCAN L3 133 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 1,4-Dioxino[2,3-g]quinoline-8-carboxylic acid, 9- (3,4-dimethoxyphenyl)-2,3-dihydro-7-(1- piperidinylmethyl)-, ethyl ester (9CI)MF C28 H32 N2 O6

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

OEt

MeOOMe

C

O

CH2NN

O

O

Substructure Searching

Page 111: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

111

Locate references discussing compounds containingthe following substructure

Requirements:

Any of the atoms may be part of a ring There may be any type of substitution at any

open site

Skills Practice

N

N NS

+

Have you found this answer?Have you found this answer?

S

N

N

N

Page 112: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

112

Bond Structure• Bonds in a substructure query may be drawn as

Single Double Triple Unspecified

Substructure Searching

Page 113: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

113

Bond StructureIf, in the substructure

query, the bond is Then, in the retrieved structure matches,

the bond will be

Single Single - In aromatic/tautomeric structures, these may display as double

Double Double - In aromatic/tautomeric structures, these may display as single

Triple Triple

Unspecified Single, double, or triple

Substructure Searching

Page 114: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

114

Search Question:Locate substances with the following characteristics:

Requirements: R is a carbon atom in a chain The bond may be single or double No additional ring fusion is desired on either ring system Any substitution may be present at all open sites

Substructure Searching

Page 115: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

115

Building the Structure

Worksheet

Structure search requirement

Action

R is a carbon atom in a chain

The bond is single or double

No additional ring fusion is desired on either ring system

Additional substitution may be present at all open sites

Substructure Searching

Draw a C atom

Look at the following procedure

Rings are isolated

Don’t draw any atom

Page 116: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

116

• To allow the substructure search to retrieve a bond that is single or double, do the following:

Step Action

1 Draw the structure with the bond drawn as a single bond.

2 From the Draw pull-down menu, select Bond.

Bond structure

Substructure Searching

Page 117: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

117

Step Action

3 The Bond Selection dialog box appears. Select Unspecified. Click Single Use.

Substructure Searching

Bond structure

Page 118: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

118

Step Action

4 Position the cursor over the bond to be changed. Click to place the bond.

Bond structure

Substructure Searching

Page 119: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

119

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-unspec-.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

N

H

H

H

H H

O

Both ring systems were isolated.

Substructure Searching

Page 120: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

120

=> S L1 SSS SAMSAMPLE SEARCH INITIATED 12:04:39 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 605 TO ITERATE

100.0% PROCESSED 605 ITERATIONS 9 ANSWERSSEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 10625 TO 13575PROJECTED ANSWERS: 9 TO 360

L2 9 SEA SSS SAM L1

Substructure Searching

Page 121: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

121

=> D SCAN L2 9 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN Isoquinoline, 1,2-dibenzoyl-6,7-diethoxy-1,2,3,4- tetrahydro- (7CI)MF C27 H27 N O4

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

Here, the bond to the O is a double bond. Note the double bonds in the fused ring!

EtO

EtO

PhPh

C

O

NC

O

Substructure Searching

Page 122: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

122

L2 9 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 1-Isoquinolinemethanol, 1,2,3,4-tetrahydro-6,7- dimethoxy-2-methyl--phenyl-, [S-(R*,S*)]- (9CI)MF C19 H23 N O3 Absolute stereochemistry.

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

Here, the bond to the O is a single bond.

MeMeO

MeO

Ph OH

N

HS

R

Substructure Searching

Page 123: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

123

=> S L1 SSS FULLFULL SEARCH INITIATED 12:05:40 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED - 12523 TO ITERATE

100.0% PROCESSED 12523 ITERATIONS 148 ANSWERSSEARCH TIME: 00.00.01

L3 148 SEA SSS FUL L1

Substructure Searching

Page 124: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

124

Bond Value• Bonds in a substructure query may be

Exact Normalized Exact/Normalized

Substructure Searching

Page 125: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

125

Bond Value

• STN Express automatically assigns bond values based on characteristics of a substructure.

• It often assigns a bond to be exact or normalized to maximize the answers retrieved by a query.

• Generally, the bond values assigned by the software do not lead to undesirable answers.

Substructure Searching

Page 126: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

126

Bond Value

Substructure Searching

Run three exact full searches for the previous structure.

With all exact bonds With all normalized bonds With all exact-normalized bonds

Look at the differences

Page 127: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

127

Bond Value

Substructure Searching

With all exact bonds

=> l1 exa fullFULL SEARCH INITIATED 09:27:14FULL SCREEN SEARCH COMPLETED - 4911 TO ITERATE 100.0% PROCESSED 4911 ITERATIONS 305 ANSWERSSEARCH TIME: 00.00.01 L2 305 SEA EXA FUL L1 => l2 not ids/ci 222437 IDS/CIL3 74 L2 NOT IDS/CI

Page 128: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

128

Bond Value

Substructure Searching

=> d scan L3 74 ANSWERS REGISTRY COPYRIGHT 2003 ACSIN Cyclohexane-d7 (6CI, 7CI, 9CI)MF C6 H5 D7  

With all exact bonds

Page 129: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

129

Bond Value

Substructure Searching

With all normalized bonds

=> l4 exa fullFULL SEARCH INITIATED 09:29:16FULL SCREEN SEARCH COMPLETED - 20679 TO ITERATE 100.0% PROCESSED 20679 ITERATIONS 1820 ANSWERSSEARCH TIME: 00.00.01 L5 1820 SEA EXA FUL L4 => l5 not ids/ci 222437 IDS/CIL6 182 L5 NOT IDS/CI

Page 130: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

130

Bond Value

Substructure Searching

With all normalized bonds=> d scan L6 182 ANSWERS REGISTRY COPYRIGHT 2003 ACSIN Benzene, tetracosamer, radical ion(1+) (9CI)MF (C6 H6)24CI PMS, RIS CM 1  

Page 131: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

131

Bond Value

Substructure Searching

With all exact-normalized bonds

=> l7 exa fullFULL SEARCH INITIATED 09:30:29FULL SCREEN SEARCH COMPLETED - 29534 TO ITERATE 100.0% PROCESSED 29534 ITERATIONS 2160 ANSWERSSEARCH TIME: 00.00.01 L8 2160 SEA EXA FUL L7 => l2 or l5L9 2125 L2 OR L5 => l8 not l9L10 35 L8 NOT L9

Page 132: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

132

Bond Value

Substructure Searching

With all exact-normalized bonds

=> d scan L10 35 ANSWERS REGISTRY COPYRIGHT 2003 ACSIN Cyclohexenylium-1-d (9CI)MF C6 H8 D  

D

1+

Page 133: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

133

REGISTRY has only these bond values

(Single, double, or triple) Exact (Single, double) Normalized

With STNExpress, you can search

Exact Exact or Normalized Normalized Unspecified

Substructure Searching

Page 134: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

134

To change the bond values assigned by STN Express, do the following:

Step Action

1 Click the selection tool icon:

2 Hold down the Shift key while clicking on the bonds whose values are to be changed.

3 From the QueryDef pull-down menu, select Bond Characteristics.

4 The Bond Characteristics dialog box appears. Click Exact. Click OK.

Bond value

Substructure Searching

Page 135: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

135

Normalized Bonds in REGISTRY

• In ringsIn rings, normalized bonds are used when – Alternating single and double bonds can be drawn all

the way around a ring– There are an even number of atoms in the ring

• In chains (or part of rings)In chains (or part of rings), normalized bonds are used only for certain tautomers

Structure drawing preview: STN Express determines normalized bonds as you draw.

Substructure Searching

Page 136: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

136

Bond Value

This bond value is

assigned

To substructures with

Normalized Certain tautomeric bonds,

Q------C====Q

Q = Heteroatoms with at least one hydrogen atom

Substructure Searching

Page 137: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

137

Normalized bonds are used for certain tautomeric chains:

• Central atom 2 = C, N, P, As, Sb, S, Se, Te, Cl, Br, I

• Hetero atoms 1 and 3 = N, O, S, Se, Te• One hetero atom must have a hydrogen

(or D, T, or a charge)

1 32 3H 21 H

Substructure Searching

Page 138: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

138

Bond Value

This bond value is

assigned

To substructures with

Normalized bonds in rings containing an even number of atoms and alternating single and double bonds

Substructure Searching

Page 139: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

139

Bond ValueThis bond value

is assigned To substructures with

Exact Atoms that are not part of a "tautomeric" or "aromatic" sequence. E.g., the C-C bonds in this cyclohexane are exact single bonds.

Substructure Searching

Page 140: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

140

Bond ValueThis bond value is

assigned

To substructures with

Normalized bonds in rings containing an even number of atoms and alternating single and double bonds (e.g., all of the bonds in the "benzo" part of the following structure, are normalized, including the "single bond" it shares with the cyclohexane ring.)

Substructure Searching

Page 141: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

141

Normalized bondsExamples Examples

Keto-enol tautomers do not fit the REGISTRY tautomer normalized bond rule, so you have to build a query to find both variations.

acidsamines

amides

CMe2

MeCHO

Pr-iMe C

O

Not normalized bonds

Substructure Searching

Page 142: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

142

Normalized bonds

benzenes

N+

cyclooctaquinolizinium

thiophenes

S

N

S

thiazoles

Normalized is not synonymous with aromatic.

Substructure SearchingNot normalized bonds

anthracenes

Page 143: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

143

Substructure Searching

Please run a Full Exact search for the following structures and look at the differences.

Exact bonds Normalized bonds Exact/Normalized bonds Exact bonds

10 6 16 5

Page 144: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

144

Substructure Searching

If you draw with STN Express the following structure how the software considers the value bonds? Why?

Page 145: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

145

Substructure Searching

If you run an Exact full search of this structure with all bonds Normalized you get 1 answer

If you run an Exact full search of this structure with all bonds Exact/Normalized do you get the previous answer?

Page 146: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

146

Do these rings have normalized bonds?

Me

CH2 Na

N

N

Substructure Searching

Page 147: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

147

Me

CH2 Na

N

N

YES YES

NOsingle exact bonds are denoted

Do these rings have normalized bonds?

Substructure Searching

Page 148: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

148

Search Question:STN Express assigned "exact/normalized" bonds to the highlighted bonds in the following structure:

Eliminate the answers where the C6 rings contain alternating single and double bonds.

Substructure Searching

Page 149: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

149

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-bond-value.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

H

H

H

H H

O

Substructure Searching

Page 150: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

150

=> S L1 SSS SAMSAMPLE SEARCH INITIATED 14:02:02 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 28 TO ITERATE

100.0% PROCESSED 28 ITERATIONS 3 ANSWERSSEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 243 TO 877PROJECTED ANSWERS: 3 TO 163

L2 3 SEA SSS SAM L1

Substructure Searching

Page 151: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

151

=> D SCANL2 3 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 1-Naphthalenemethanol, decahydro--[6-methoxy-2,3- bis(phenylmethoxy)phenyl]-2,5,5,8a-tetramethyl-, (1S,2S,4aS,8aS)- (9CI)MF C36 H46 O4

Absolute stereochemistry.

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

MeMe

OMe

Me Me

Ph

PhOH

O

O

H

HS

SSS

Substructure Searching

Page 152: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

152

OH O

Run two searches for the following structures. No ring fusions, other double bonds, or substitutions, are allowed. No salts or mixture desired. Which kind of search?

Skills Practice

Page 153: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

153

=> l1 exa fullL2 6

O

Page 154: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

154

O=> l3 exa fullL4 74=> l4 not l2L5 74

Page 155: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

155

Can you run only one search for retrieving both the structures?Which searching structure? Which kind of search?

O=> l6 exa fullL7 157

Skills Practice

Page 156: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

156

=> l7 not l2L8 151 L7 NOT L2 => l8 not l4L9 77 L8 NOT L4

OH

Page 157: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

157

S

N

S

N

Run two different searches for the reported structures (Full SSS, STN Express default).- Are the results different?- If yes,why?

Skills Practice

Page 158: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

158

=> l1 full L2 2531

H

H

H

H

H

NS

Page 159: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

159

=> l3 full L4 2184

H

H

H

H

H

NS

Page 160: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

160

=> l2 and l4L5 1486

EtO OEt

Ph

CH

NH

CHC

C

C

OH

O

S

O

Page 161: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

161

=> l2 not l4 and nc=1L6 832

EtS

Ph

C N CH2 CH CH2

Page 162: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

162

=> l4 not (l2 or m/els) and nc=1 L8 698

t-BuO

Me

PhCN

S

NHCCH2

O

C

O

Page 163: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

163

S

N

Which structure search should be run in order to collect all the previous results?

Skills Practice

Page 164: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

164

N

S

 => l7 full

L8 12146

Page 165: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

165

=> l8 not (l2 or l4)L11 1058

MeMe

CHS

NO

O

C+ -

Page 166: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

166

BONDS• Be carefull with cycles with odd number of atoms.

Expecially if they contain hetero atoms.• The bonds are not normalized usually (unless…).• The position of double bonds depends on

nomenclature.• Which the best way for searching them?

Substructure Searching

Page 167: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

167

Run two different searches for the reported structures (Full SSS, STN Express default).- Are the results different?- If yes,why?

N

N

N

Me

Ph

N

N

N

Me

Ph

Skills Practice

Page 168: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

168

N

N

N

Me

Ph

=> l1 fullL2 24

Page 169: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

169

N

N

N

Me

Ph

=> l3 fullL4 63

Page 170: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

170

=> l2 not l4L5 22 L2 NOT L4

Me Ph

Ph

CH2NN

N

Page 171: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

171

=> l4 not l2L6 61 L4 NOT L2

Me

Me

Ph CH CHN CH2N

N

Page 172: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

172

Which structure search should be run in order to collect all the previous results?

N

N

N

Me

Ph

Skills Practice

Page 173: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

173

N

N

N

Me

Ph

=> l7 fullL8 225

Page 174: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

174

=> l8 not l2L9 201 L8 NOT L2 => l9 not l4L10 140 L9 NOT L4

Page 175: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

175

MeMe

Ph

N

NN

E

Page 176: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

176

Pr-i

i-Pr

Me

Me

Me

Me

Ph

Cl

CN

NN+

Page 177: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

177

Skills Practice

CHCH

CHCH

CH

CH2NH2

CHCH

CHCH

CH

N

Which is the less expensive search for retrieving both of the previous compounds?

Page 178: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

178

BONDS

BE CAREFUL WITH BONDS!

Substructure Searching

Page 179: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

179

Substructure Searching

Extend

Page 180: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

180

Substructure Searching=> fil reg L1 STRUCTURE UPLOADED => dL1 HAS NO ANSWERSL1 STR 

N

F S

F O

Page 181: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

181

Substructure Searching

=> l1 FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 1 TO 80PROJECTED ANSWERS: 0 TO 0 L2 0 SEA SSS SAM L1 => l1 fullFULL SEARCH INITIATED 06:07:43FULL SCREEN SEARCH COMPLETED - 61 TO ITERATE 100.0% PROCESSED 61 ITERATIONS 0 ANSWERS61 ITERATIONS 0 ANSWERSSEARCH TIME: 00.00.01L3 0 SEA SSS FUL L1

Page 182: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

182

Substructure Searching

=> set extend on permSET COMMAND COMPLETED => l1 fullFULL SEARCH INITIATED 06:08:38 L4 61 SEA SSS FUL L1 EXTENDL4 61 SEA SSS FUL L1 EXTEND CANDIDATE STRUCTURE SEARCH COMPLETED - 61 TO ITERATE 100.0% PROCESSED 61 ITERATIONS 0 ANSWERSSEARCH TIME: 00.00.01 L5 0 SEA SSS FUL L1

Page 183: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

183

Substructure Searching

=> d scan l4

i-BuMeS SMe

F3C CHF2

C N

O

N

CH

i-Bu Me

MeS

CF3F2CH

NO

N

C

O

Page 184: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

184

Atom Substitution• There are a number of ways to control atom

substitution in substructure search matches:

If you want this kind of atom substitution

Build this into the substructure query:

A "class" of atoms, e.g., halogens

STN-defined variable atoms

A broadly defined list of substituents

G-group

Variable attachment positions for a ring substituent

Variable point of attachment

Substructure Searching

Page 185: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

185

Variable Atoms• A number of system-defined options are available

to allow variability in atom substitution:When this symbol is

part of a substructure These types of matches are

retrieved

X Any halogen

A Anything, except H

Q Anything, except C or H

M Any metal

(continued on next slide)

Substructure Searching

Page 186: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

186

Variable Atoms

When this symbol is part of a substructure

These types of matches are retrieved

(continued from previous slide)

Cb Any carbocyclic ring system

Cy Any ring system

Hy Any heterocyclic ring system

Id "Dummy node" - used in specialized polymer searches

Ak Any carbon chain

Substructure Searching

Page 187: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

187

Search Question:Locate substances with the following characteristics:

Requirements: R is any type of heterocyclic ring, with any type of substitution R' is any non-hydrogen ring or chain substituent No additional ring fusion is desired on either ring system Any substitution may be present at all open sites

Substructure Searching

Page 188: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

188

Building the Structure

Worksheet

Structure search requirement

Action

R is any type of heterocyclic ring with any kind of substitution

R' is any non-hydrogen ring or chain substituent

No additional ring fusion is desired on either ring system

Any substitution may be present at all open sites

Substructure Searching

Look at the following procedure

Draw a A variable (R/C)

Rings are isolated

Don’t draw any atom

Page 189: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

189

• To add a variable group to an existing structure, do the following:

Step Action

1

Click the variables icon:

2 The Variable Selection dialog box appears. Select the radio button labeled "Hy". Click Single Use.

Variable group

Substructure Searching

Page 190: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

190

Step Action

3 Click to place the variable in the structure.

Variable group

Substructure Searching

Page 191: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

191

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-variable.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

N

N

CH2CH2

CH2Hy

A

The rings were isolated, and the node characteristic of A was set to ring/chain.

Substructure Searching

Page 192: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

192

=> S L1 SSS SAMSAMPLE SEARCH INITIATED 16:29:55 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 2576 TO ITERATE

38.8% PROCESSED 1000 ITERATIONS 15 ANSWERSINCOMPLETE SEARCH (SYSTEM LIMIT EXCEEDED)SEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 48478 TO 54562PROJECTED ANSWERS: 400 TO 1144

L2 15 SEA SSS SAM L1

Substructure Searching

Page 193: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

193

=> D SCANL2 15 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 2,5-Pyrrolidinedione, 1-[3-[4-(4-methoxyphenyl)-1- piperazinyl]propyl]-(9CI)MF C18 H25 N3 O3CI COM

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

(CH2)3

OMe

N

N

N

OO Hy

A

Substructure Searching

Page 194: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

194

=> S L1 SSS FULL

FULL SEARCH INITIATED 16:30:53 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED - 51401 TO ITERATE

100.0% PROCESSED 51401 ITERATIONS 1028 ANSWERSSEARCH TIME: 00.00.03

L3 1028 SEA SSS FUL L1

Substructure Searching

Page 195: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

195

G-Groups• In addition to the STN system-defined variable groups (e.g., X), variable

groups containing substituents of your own definition may be created. • User-defined variable groups are called G-groups on STN. A G-group

can have up to 20 substituents comprising it. Those substituents may be:

Specific elements Shortcuts System-defined variable groups Structural fragments Other G-groups

Substructure Searching

Page 196: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

196

Search Question:Locate substances with the following characteristics:

Requirements: R = C, N R' = Me, Et, n-Pr, i-Pr, OH, SH, X No additional ring fusion is allowed on either of the 6-membered rings Additional substitution may be present at all open sites

Substructure Searching

Page 197: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

197

Building the Structure

Worksheet

Structure search requirement

Action

R = C, N

R' = Me, Et, n-Pr, i-Pr, OH, SH, X

No additional ring fusion on the 6-membered rings

Additional substitution at all open sites

Substructure Searching

Look at the following procedure

Look at the following procedure

Rings are isolated

Don’t draw any atom

Page 198: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

198

• To define and add a G-group to an existing structure, do the following:

Step Action

1 From the Draw pull-down menu, select G-groups.

G-group

Substructure Searching

Page 199: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

199

Step Action

2 The G-Group Definitions dialog box appears. Click New.

3 The Define New G-Group dialog box appears. Select Atoms, Shortcuts, or Variables, as needed.

G-group

Substructure Searching

Page 200: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

200

Step Action

4 The Shortcuts selection menu appears. Click to place a checkmark in the checkboxes associated with items of interest. Click OK.

G-group

Substructure Searching

Page 201: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

201

Step Action

5 The Define New G-Group dialog box reappears. Continue to define the G-group as needed. Click Save.

G-group

Substructure Searching

Page 202: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

202

Step Action

6 The G-Group Definitions dialog box reappears. Click Single Use.

G-group

Substructure Searching

Page 203: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

203

Step Action

7 Click to place the G-group in the structure.

G-group

Substructure Searching

Page 204: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

204

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-ggroup.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

N

G2

CH2CH2

CH2G2

NO

G1G1 Me,Et,n-Pr,i-Pr,OH,SH,XG2 C,N

Substructure Searching

Page 205: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

205

=> S L1 SSS SAMSAMPLE SEARCH INITIATED 11:59:54 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 11 TO ITERATE

100.0% PROCESSED 11 ITERATIONS 2 ANSWERSSEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 22 TO 418PROJECTED ANSWERS: 2 TO 124

L2 2 SEA SSS SAM L1

Substructure Searching

Page 206: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

206

=> D SCANL2 2 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 4-Piperidinol, 1-[3-(6-fluoro-1,2-benzisoxazol-3- yl)propyl]-4-(4-fluorophenyl)- (9CI)MF C21 H22 F2 N2 O2

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

(CH2)3

F

F

N

N

O OH G1 = FG2 = C, C

Substructure Searching

Page 207: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

207

Variable Points of Attachment on Rings

• A substituent can be tagged to appear at variable positions on a ring. The substituent may be a

Specific element Shortcut System-defined variable group Structural fragment G-group

Substructure Searching

Page 208: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

208

Search Question:Locate substances with the following characteristics:

Requirements: R is any type of heterocyclic ring with any type of substitution No additional ring fusion is desired on the phenyl or N-containing rings Any substitution may be present at all open sites There must be at least one halogen atom attached to the phenyl ring, at the

positions marked with asterisks

Substructure Searching

Page 209: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

209

Building the Structure

Worksheet

Structure search requirement

Action

R is any type of heterocyclic ring with any type of substitution

No additional ring fusion is desired on the phenyl or N-containing rings

Any substitution may be present at all open sites

There must be at least one halogen atom attached to the phenyl ring, at the positions marked with asterisks

Substructure Searching

Draw a Hy variable

Rings are isolated

Don’t draw any atom

Look at the following procedure

Page 210: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

210

• To add a substituent with a variable point of attachment to an existing structure, do the following:

Step Action

1 Draw the ring-containing component and the substituent as separate, unattached fragments.

2 Click the selection tool icon:

Variable point of attachment

Substructure Searching

Page 211: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

211

Step Action

3 Hold down the SHIFT key. Click to highlight the substituent and then each position on the ring where the substituent may be attached.

4 From the Draw pull-down menu, select Variable Points of Attachment.

Variable point of attachment

Substructure Searching

Page 212: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

212

Step Action

5 STN Express draws a "connection" from the substituent toeach possible attachment site.

Variable point of attachment

Substructure Searching

Page 213: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

213

=> FILE REGISTRY

=> Uploading C:\Program Files\Stnexp\Queries\sss-vpa.strL1 STRUCTURE UPLOADED

=> D L1L1 HAS NO ANSWERSL1 STR

N

N

CH2CH2

CH2Hy X

Substructure Searching

Page 214: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

214

=> S L1 SSS SAM

SAMPLE SEARCH INITIATED 14:20:48 FILE 'REGISTRY' SAMPLE SCREEN SEARCH COMPLETED - 2577 TO ITERATE

38.8% PROCESSED 1000 ITERATIONS 26 ANSWERSINCOMPLETE SEARCH (SYSTEM LIMIT EXCEEDED)SEARCH TIME: 00.00.01

FULL FILE PROJECTIONS: ONLINE **COMPLETE** BATCH **COMPLETE**PROJECTED ITERATIONS: 48497 TO 54583PROJECTED ANSWERS: 849 TO 1831

L2 26 SEA SSS SAM L1

Substructure Searching

Page 215: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

215

=> D SCANL2 26 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN Pyrrolo[3,2-c]azepin-4(1H)-one, 5-[3-[4-(4- fluorophenyl)-1-piperazinyl]propyl]-5,6,7,8- tetrahydro-8-hydroxy-1-methyl- (9CI)MF C22 H29 F N4 O2

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):1

F at the para position

(CH2)3

Me

F

N

OH

O

N

N

N

Substructure Searching

Page 216: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

216

L2 26 ANSWERS REGISTRY COPYRIGHT 2001 ACSIN 1H-Indole-2,3-dione, 1-[3-[4-(3-chlorophenyl)-1- piperazinyl]propyl]-, dihydrochloride (9CI)MF C21 H22 Cl N3 O2 . 2 Cl H

HOW MANY MORE ANSWERS DO YOU WISH TO SCAN? (1):0

=> S L1 SSS FULLFULL SEARCH INITIATED 14:21:19 FILE 'REGISTRY' FULL SCREEN SEARCH COMPLETED - 51428 TO ITERATE100.0% PROCESSED 51428 ITERATIONS 1310 ANSWERS SEARCH TIME: 00.00.03 L3 1310 SEA SSS FUL L1

Cl at the meta position(CH2)3

Cl

N N

OO

N

HCl2

Substructure Searching

Page 217: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

217

SummaryIf the

substructure contains

Consider the following:

Substructure search default setting

Atoms with open sites

Can sites be freely substituted?

Should substitution be blocked?

Any substitution at open sites

Ring systems Should rings stand alone?

Can rings be embedded in larger ring systems?

Isolated/embedded ring systems

Chain atoms Should the atoms appear only in chains?

Can the atoms appear in rings?

Chain node characteristics

(continued on next slide)

Substructure Searching

Page 218: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

218

SummaryIf the

substructure contains

Consider the following:

Substructure search default setting

Chain bonds Should the bonds appear only in chains?

Can the bonds appear in rings?

Chain bond characteristics

Sites where you want to force substitution

Are the substituents of interest defined by system-defined variables?

Are the substituents of interest user-defined?

Variables

G-groups

A ring with one or more positions where substitution may occur

Variable point of attachment

Substructure Searching

Page 219: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

219

1 Are there any patents dealing with the preparation of compounds with the following structures? If so, who holds those patents?

Requirements:

R = Any carbocyclic ring system. All the carbocyclic rings may be identical or they may be different and

they may be substituted or unsubstituted.

Skills Practice

Page 220: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

220

2 Locate papers discussing substances with the following structure. What have these substances been used for?

Requirements:

R = Ring or chain carbon with any type of substitution

Skills Practice

Page 221: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

221

3 What companies recently have been working on compounds with the following structure?HINT: Look at the CS (corporate source) index in the CAplus file record.

Requirements:

R = Hydrogen or any non-hydrogen substituent R' = Any non-hydrogen, non-carbon substituent, e.g., a

keto functionality All other open sites may have any type of substitution Rings may have other rings fused to them

Skills Practice

Page 222: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

222

4 Locate papers discussing the synthesis of the following substances:

Requirements:

R = X, H, -NO2, -CF3

R' = Anything but hydrogen R" = C or N Any substitution may be present at all other open sites Phenyl ring shown in the structure may not have additional

rings fused to it

Skills Practice

Page 223: 1 Structure Searching with STN Express. 2 The REGISTRY database contains chemical substance information. Bibliographic references and abstracts of papers

223

5 Locate references discussing compounds containing the following structure fragment:

Requirements: At least 2 -OH groups must be attached to the rings

labeled "A" and "B” Any substitution at the remaining open sites Ring system may have other rings fused to it

Skills Practice