The prototypic integrin ligand, fibronectin, contains the amino acid sequence RGDS at the apex of a flexible loop connecting two β-strands in the 10th fibronectin type III repeat (Dickinson et al.
1994). The RGD sequence is also present in vitronectin, fibrinogen, and the LAP complex part of inactive transforming growth factor-β (TGF-β) and many other ECM proteins (Humphries et al.
2006). Elegant studies have shown that the epithelial αvβ6 and αvβ8, by binding RGD in LAP, are the major integrins that activate TGF-β in vivo either by allosteric changes in TGF-β-LAP (αvβ6) or by inducing matrix metalloproteinase-14 and causing proteolytic release of TGF-β (αvβ8; Aluwihare et al.
2009; Mu et al.
2002; Munger et al.
1999). More recently, a mechanical-strain-dependent contribution of myofibroblast β1 integrins to TGF-β activation has been demonstrated (Wipff et al.
2007).

In many biochemistry and cell biology books, the RGD sequence is presented as the major integrin-binding sequence, but only a third of the integrins are known to bind this sequence (Table ). RGD peptides can be used to assess the involvement of RGD-binding integrins in a certain event, but lack of effect of RGD peptides does not exclude integrin involvement in a process since two-thirds of integrins bind ligands RGD-independently.
Some isoforms of the basement membrane protein laminin contain an RGD sequence, but the sequence is not evolutionary conserved in these isoforms and is not recognized by the laminin-binding integrins. In laminin-111, the RGD site is masked in the native molecule (Aumailley et al.
1990; Schulze et al.
1996), whereas in laminin isoforms containing the α5 chain, it is exposed in the native molecule (Domogatskaya et al.
2008; Forsberg et al.
1994; Genersch et al.
2003; Sasaki and Timpl
2001). The full physiological importance of the αvβ1- and αvβ3-mediated binding to laminin-α5-containing laminins however remains unclear. As pointed out, the EHS-produced laminin-111 is a poor ligand for the laminin-binding integrins. Accumulating data suggest that a major integrin-binding site is present in laminin LG1-3 domains in the C-terminal part of the laminin heterotrimer (Ido et al.
2004; Kunneken et al.
2004). In addition to laminin α-chain-specifc sequences, C-terminal residues of both β- and γ-chains contribute/facilitate integrin binding (Ido et al.
2004; Kunneken et al.
2004). Splicing in the extracellular domain of integrin α7 affects it preference for certain laminin isoforms (Nishiuchi et al.
2003; Taniguchi et al.
2009; von der Mark et al.
2002; Table ).
RGD sequences present in triple helical fibrillar collagen sequences are normally not available for fibronectin receptors in native fibrillar collagen (Davis
1992; Gullberg et al.
1990), only becoming available in denatured collagen I. Instead, the collagen-binding integrins recognize the triple-helical GFOGER sequence (Knight et al.
1998), or variants thereof, in native collagens. An assembled database on collagen-I-binding sites and mutations has enabled the construction of a model in which the GFOGER site is present in a cell-interactive domain that is suggested to be exposed once per microfibril unit (each microfibrillar unit is composed of five collagen triple helix monomers), allowing the clustering of integrins (Sweeney et al.
2008). However, a number of components in the fibrillar matrix might influence the availability of these sites. Interestingly, collagen IX, which is present in cartilage, has been shown to be a good ligand for all of the collagen-binding integrins (Kapyla et al.
2004), but in vivo, not all of the collagen-binding integrins are expressed in chondrocytes. In the case of α11β1 integrin, if cartilaginous tissue is not dissected free of perichondrium, a signal from α11 will be derived from α11 expression in the perichondrium (Tiger et al.
2001), which lacks collagen IX. The identity of the integrins that might physiologically be relevant as collagen IX receptors in various regions of cartilage thus remains to be defined.
Collagen IX lacks the prototypic collagen-binding integrin recognition sequence GFOGER, and the exact sequence awaits mapping. Certain variants of the GFOGER sequence might show specificity for the different collagen-binding integrins, as indicated by a recent finding showing that the bacterial collagen-like peptide, sscl, with the active binding site GLPGER, is preferred by α11β1-expressing cells over α2β1-expressing cells (Caswell et al.
2008).
Other integrin-binding motifs include the α4-integrin-binding sequence LDV (Clements et al.
1994; Komoriya et al.
1991), variants of which are also present in the adhesion molecules ICAMs and MadCAM recognized by β2 integrins and the α4β7 heterodimer, respectively (Wang and Springer
1998).
In addition to physiological ligands, integrin ligands generated by proteolysis are receiving increasing recognition. Endostatin (derived from collagen XVIII), endorepellin (derived from perlecan), and tumstatin derived from collagen α3 (IV) are the best-known examples (Bix and Iozzo
2005; Suhr et al.
2009; Table ). In addition, integrins can bind snake toxins called disintegrins (Calvete et al.
2005; Swenson et al.
2007) and certain viruses (Stewart and Nemerow
2007) and bacteria (Hauck et al.
2006; Palumbo and Wang
2006). Some of these interactions occur outside the regular ligand-binding sites in the integrins and display distinct binding characteristics compared with the binding of physiological ligands.