Substituent
In organic chemistry and biochemistry, a substituent is an atom or group of atoms which replaces one or more hydrogen atoms on the parent chain of a hydrocarbon, becoming a moiety of the resultant new molecule. The terms substituent and functional group, as well as other ones are used almost interchangeably to describe branches from a parent structure, though certain distinctions are made in the context of polymer chemistry. In polymers, side chains extend from a backbone structure. In proteins, side chains are attached to the alpha carbon atoms of the amino acid backbone.
The suffix -yl is used when naming organic compounds that contain a single bond replacing one hydrogen; -ylidene and -ylidyne are used with double bonds and triple bonds, respectively. In addition, when naming hydrocarbons that contain a substituent, positional numbers are used to indicate which carbon atom the substituent attaches to when such information is needed to distinguish between isomers. The polar effect exerted by a substituent is a combination of the inductive effect and the mesomeric effect. Additional steric effects result from the volume occupied by a substituent.
The phrases most-substituted and least-substituted are frequently used to describe or compare molecules that are products of a chemical reaction. In this terminology, methane is used as a reference of comparison. Using methane as a reference, for each hydrogen atom that is replaced or "substituted" by something else, the molecule can be said to be more highly substituted. For example:
- Markovnikov's rule predicts that the hydrogen added to the carbon of the alkene functional group that has the greater number of hydrogen atoms.
- Zaitsev's rule predicts that the major reaction product is the alkene with the more highly substituted double bond.
Nomenclature
The use of the suffix is determined by the number of hydrogen atoms that the substituent replaces on a parent compound. According to the 1993 IUPAC recommendations:
- -yl means that one hydrogen is replaced.
- -ylidene means that two hydrogens are replaced by a double bond between parent and substituent.
- -ylidyne means that three hydrogens are replaced by a triple bond between parent and substituent.
For multiple bonds of the same type, which link the substituent to the parent group, the prefixes di-, tri-, tetra-, etc., are used: -diyl, -triyl, -tetrayl, -diylidene.
For multiple bonds of different types, multiple suffixes are added: -ylylidene, -ylylidyne, -diylylidene.
The parent compound name can be altered in two ways:
- For many common compounds the substituent is linked at one end and historically not numbered in the name. The IUPAC 2013 Rules however do require an explicit locant for most substituents in a preferred IUPAC name. The substituent name is modified by stripping -ane and adding the appropriate suffix. This is "recommended only for saturated acyclic and monocyclic hydrocarbon substituent groups and for the mononuclear parent hydrides of silicon, germanium, tin, lead, and boron". Thus, if there is a carboxylic acid called "X-ic acid", an alcohol ending "X-anol", or an alkane called "X-ane", then "X-yl" typically denotes the same carbon chain lacking these groups but modified by attachment to some other parent molecule.
- The more general method omits only the terminal "e" of the substituent name, but requires explicit numbering of each yl prefix, even at position 1. Pentan-1-yl is an example of a name by this method, and is synonymous with pentyl from the previous guideline.
Methane substituents
According to the above rules, a carbon atom in a molecule, considered as a substituent, has the following names depending on the number of hydrogens bound to it, and the type of bonds formed with the remainder of the molecule:methane | no bonds | |
− | methyl group or methanyl | one single bond to a non-hydrogen atom |
= | methylene group or methanylidene or methylidene | one double bond |
−− | methylene bridge or methandiyl or methdiyl | two single bonds |
≡CH | methanylidyne group or methylidyne | one triple bond |
=CH− | methine group or methanylylidene or methylylidene | one single bond and one double bond |
>CH− | methantriyl group or methtriyl | three single bonds |
≡C− | methanylylidyne group or methylylidyne | one triple bond and one single bond |
=C= | methandiylidene group or methdiylidene | two double bonds |
>C= | methandiylylidene group or methdiylylidene | two single bonds and one double bond |
>C< | methantetrayl group or methtetrayl | four single bonds |
Structures
In a chemical structural formula, an organic substituent such as methyl, ethyl, or aryl can be written as R It is a generic placeholder, the R derived from radical or rest, which may replace any portion of the formula as the author finds convenient. The first to use this symbol was Charles Frédéric Gerhardt in 1844.The symbol X is often used to denote electronegative substituents such as the halides.