The interactions of Eukaryotic Linear Motif (iELM) web server facilitates the exploration of short linear motif- (SLiM) mediated interfaces within protein–protein interaction (PPI) networks (
1). The importance of SLiMs in the regulatory and signalling mechanisms of the cell is becoming increasingly apparent, as highlighted by their use as molecular switches coordinating phase transitions in the cell (
2) and their increasing association with disease (
3–5). SLiMs are key components in a wide range of biological pathways and are known to act as sites for post-translational modifications such as phosphorylation or ubiquitination, as targeting signals for particular subcellular locations and as ligand-binding sites for protein recruitment (
6,
7). The majority of known motifs bind onto the surface of globular domains and exhibit specificity for a particular subgroup of a domain family (
1). SLiMs tend to be just 3–10 amino acids in length with only 2–5 residues responsible for the majority of the binding affinity and specificity (
6). This means that discriminating bioinformatically between a stochastic match and a result of biological relevance is fraught with difficulties (
8).
A number of resources have undertaken the task of annotating experimentally validated SLiM classes with the most notable examples being the Eukaryotic Linear Motif (ELM) (
3), MiniMotif (
9) and ScanSite (
10) databases. These resources also allow searching of protein sequences for novel instances of these annotated classes using regular expression patterns or position-specific scoring matrices. However, due to the high likelihood of motifs occurring in a stochastic manner, the use of pattern matching alone produces a large number of false positive hits (
6). Methods have, therefore, been developed to incorporate additional filters based on the attributes of SLiMs, including sequence conservation (
11–13), structural availability (
14–16), biophysical feasibility (
17) and biological keywords (
18). Recently, a number of
de novo motif prediction tools have also emerged, capable of predicting new classes of SLiMs (
19–22). However, difficulties arise in removing the experimental bias towards medically relevant proteins as well as biases due to evolutionary relationships (
12).
A number of resources have been developed using PPI data, to help predict the SLiM functional class associated with a particular protein-binding domain. Dilimot (
21) and SLiMFinder (
19) use the over-representation of sequence motifs in proteins, known to interact with a particular globular domain, to predict the regular expression of the binding SLiM; the ADAN database (
23) uses high-resolution structures to predict SLiM-mediated interactions for well-known modular protein domains (SH3, SH2, WW, etc). In contrast, NetworKIN (
24) employs interaction data to predict which kinase is responsible for a particular phosphorylation site. The identification of SLiM-mediated interactions within PPI data on the fly has however, to the best of our knowledge, not been investigated. To alleviate this, we introduce the iELM server that uses the annotated ELM regular expressions, especially trained Hidden Markov Models (HMMs) based on the manual annotation of SLiM-binding domains and PPI data to identify SLiM-mediated interactions. In addition, iELM takes into consideration many of the important attributes of SLiMs identified in some of the aforementioned studies, including the tendency of SLiMs to occur in regions of intrinsic disorder (
25) and the propensity of functional motifs to be evolutionary conserved (
6,
13). The iELM web server allows the identification of SLiM-mediated interactions associated with a protein of interest or within a users’ PPI network.