Volume 1 Chapter 6 2D Searching Overview
Ex.1
Find all entries which contain the thiourea substructure:

The search question could be formulated as follows:
T1 *CONN AT1 N 1 AT2 C 3 0 AT3 N 1 AT4 S 1 0 BO 1 2 1 BO 2 3 1 BO 2 4 2 END QUES T1
Between these two records atom property records ATp and bond property records BO define the topology of the fragment.
Other types of records can be present to place special restrictions on the fragment and its environment. These will be introduced later in the chapter.
mca is the minimum number of connected atoms excluding terminal H atoms.
We use the minimum value rather than the actual value to allow for different levels of substitution at an atom.
nh is the number of attached terminal H atoms.
atom 2 is bonded to atom 3 by a single bond;
atom 2 is bonded to atom 4 by a double bond.
The 2 single bonds and 1 double bond at C result in a valency of 4 and therefore there is really no need to specify nh=0 in the atom property record of atom 2.

Ex.2 Find all entries in which the carboxylate group is bonded through both oxygen atoms to Cd or Zn:

The search question could be formulated as follows:
T1 *CONN ELDEF MM = Cd, ZN AT1 MM 2 AT2 O 2 AT3 C 2 AT4 O 2 BO 1 2 BO 1 4 BO 2 3 1,2,7 BO 3 4 1,2,7 END QUES T1
A new element symbol must be declared, in this case MM, which is then set equal to Cd or ZN.
No distinction is made between upper- and lower-case.
If you use an ELDEF instruction then it must appear ahead of the atom property record in which the new element symbol is used
These are single bonds and the default bond type in a bond property record is 1, ie. single.
Hence there is no need to specify it explicitly.
or 2 double
or 7 delocalised double.
This illustrates the use of a variable bond type which is important if you suspect that the database may not be consistent in the connectivity representation of a particular bonded fragment.

Ex.3 Find acyclic ethers represented by the substructure:

In this case three different connectivity search tests will be used to illustrate various features:
T1 *CONN T2 *CONN T3 *CONN AT1 C 1 AT1 C 1 E AT1 C 1 3 AT2 O 2 AT2 O 2 AT2 O 2 AT3 C 1 AT3 C 1 AT3 C T3 BO 1 2 A BO 1 2 A BO 1 2 A BO 2 3 A BO 2 3 A BO 2 3 A END END END QUES T1 QUES T2 QUES T3
This requires that the two bonds be acyclic.
The sub-keyword C can be used if you wish to specify that a bond must be cyclic.
Accordingly a search with test T1 would register hits for all acyclic ethers.
When E (for EXACT) is present the mca value is interpreted as the exact number of connected atoms, excluding terminal H atoms.

Thus the coding here has the same effect as the use of E in test T2.
This illustrates the fact that it is often possible to code a connectivity test in several different ways, each producing the same result.
T3 specifies that atom 3 has a total coordination number of 3, ie. the total number of atoms connected to atom 3, including terminal H atoms, is 3.
In other words, atom 3 has sp2 hybridisation.

Ex.4 Find entries containing the substructure:

Three different connectivity search tests will be used to illustrate new features:
T1 *CONN T2 *CONN T3 *CONN
AT1 S 2 AT1 S 2 AT1 S 2
AT2 C 2 AT2 C 2 AT2 C 2
AT3 N 2 AT3 N 2 AT3 N 2
AT4 C 2 AT4 C 2 AT4 C 2
AT5 C 1 AT5 C 1 AT5 C 1
BO 1 2 BO 1 2 BO 1 2
BO 2 3 BO 2 3 BO 2 3
BO 3 4 BO 3 4 BO 3 4
BO 4 1 BO 4 1 BO 4 1
BO 3 5 BO 3 5 BO 3 5
END NOCR NOLN
QUES T1 END END
QUES T2 QUES T3
A search with test T1 would register hits for all of the structures (i)-(iv) shown below.
NOCR requires that, for a hit to be registered, the atoms of the fragment cannot be connected to atoms outside the fragment by cyclic routes (bonds).
A search with test T2 would register a hit only for (i).
NOLN requires that, for a hit to be registered, the atoms of the fragment cannot be directly linked to any other atoms of the search fragment except by the bonds specified in the bond property records, ie. the bonds in the diagram.
A search with test T3 would register hits for (i)-(iii).

Back to Table of Contents
Volume 1 Chapter 6 Summary of Connectivity Search Test Instructions.