Intracellular transport and vesicle trafficking are fundamental processes that occur in nearly all eukaryotic cells. These processes are complex, requiring the activity of 5–10% of the proteome in
Saccharomyces cerevisiae and likely a similar commitment in most other systems. It has been suggested that the origin of the membrane trafficking machinery was a key innovation for creation of the eukaryotic cellular state [
1-
3], and the presence of this system clearly distinguishes modern eukaryotes from prokaryotic organisms. Regardless of the precise mode by which membrane transport occurs, for example vesicle mediated or compartmental maturation models, a crucial aspect of the membrane trafficking process is how specificity of membrane fusion is determined.
It was initially proposed that vesicle fusion compatibility was encoded exclusively by coiled-coil SNARE proteins, which interact directly with the general Sec18/NEM-sensitive factor fusion system [
4,
5]. This model required that each transport vesicle contain a specific vesicle (V or R)-SNARE protein that interacts with cognate target (T)-SNAREs including the Qa-SNAREs (or syntaxins) and additional Q-SNARE proteins, which are present in the membrane of the target organelle. In all fusion reactions combinatorial pairing of the SNAREs was thought to provide specificity, but recent work has shown that SNARE pairing can be promiscuous and that additional factors, e.g. Rab family small GTPases, the regulatory syntaxin-binding (SM) proteins, and the tethering complexes, must also be involved [
6-
8]. With the exception of the tether complexes, these various factors all share the characteristic of being part of multigene families, and having distinct members of the family localised to, and participating in, transport steps at discrete subcellular compartments [
9].
Analyses of SNARE [
10-
12] and Rab [
13] sequences are consistent with each family having evolved from an ancestral gene, which then gave rise to the major SNARE and Rab gene families present in extant eukaryotes. As individual SNARE/Rab subfamilies are associated with distinct subcellular organelles, the most likely interpretation is that development of each new organelle was concurrent with the emergence of a novel SNARE or Rab paralogue. The gene duplications required to define the major compartments of the endomembrane system appear to have taken place before the last common eukaryotic ancestor (LCEA) arose; a similar model is also likely for the vesicle coat machinery [
14].
The SM proteins bind syntaxins and regulate the
trans-SNARE-SNARE interaction. As with SNAREs and Rabs, the SM proteins can be divided into protein subfamilies, each of which performs a similar function but at a specific and unique location within the cell (Figure ). Surprisingly, although the SM proteins appear to all be derived
via gene duplication, the different SM families bind syntaxins by distinct mechanisms [
8]. Sly1p and Vps45p bind syntaxin 5 and syntaxin 16 respectively
via the N-terminal domain of the SNARE in the open confirmation. Sec1p also interacts directly with its cognate syntaxin protein, but with the N-terminal domain of the SNARE folded over the C-terminal domain in a closed confirmation. By contrast, Vps33p binds its cognate SNARE indirectly as a part of the HOPS complex. These distinct modes of SNARE regulation by SM proteins have prompted speculation about how different mechanisms can result in essentially equivalent function and how such a situation could have arisen during evolution [
15].
The tethering factors participate at the earliest stage in the approach of a vesicle towards a target membrane. The presence of such tethers has been suspected for some time [
16], but whilst earlier work focused on potential roles of extended coiled-coil proteins as tethers, e.g. GM130 and Uso1, more compelling evidence for a predominant role for multiprotein complexes has recently emerged (reviewed in [
7,
17]). Knockout studies in
S. cerevisiae [
7,
17,
18] demonstrate stronger phenotypes for the components of many of these complexes than for the putative coiled-coil elements. Further evidence, again mainly from studies in
S. cerevisiae, indicates interactions between these complexes and central components of the fusion apparatus, specifically SNAREs, Rab and ARF GTPases and coat proteins (Figures and ). In addition, the tethering factors may be isolated as stable complexes. Tethering complexes have differing combinations and numbers of subunits, with the individual complexes (COG, HOPS, TRAPPI, TRAPPII, DSL1, exocyst and GARP) acting at distinct subcellular locations (Figures and ).
Investigating membrane trafficking in organisms beyond the conventional experimental systems can identify conserved, presumably essential, membrane trafficking factors, as well as highlight the diversity of trafficking pathways amongst extant eukaryotes and uncover novel biology in specific lineages. From a combination of molecular and morphological data, six eukaryotic super-groups are now recognized [
19,
20], but molecular studies of intracellular transport systems have focused primarily on
S. cerevisiae and metazoan taxa, both members of the Opisthokonta super-group. The relatively poor experimental tractability of many of the organisms within the additional eukaryotic super-groups poses a considerable challenge, but the availability of genome sequences from some of these taxa can facilitate rapid identification of factors in such systems.
Evolutionary investigation of the membrane trafficking machinery has revealed several features. Firstly, genomic, phylogenetic and cell biological evidence suggests that the LCEA possessed a complex endomembrane system. The major protein families known to be required for vesicle formation and fusion [
9,
21] were present very early on in eukaryotic evolution [
22-
24] and additional data support the presence of a full complement of membrane trafficking organelles [
25,
26]. We previously investigated the distribution of key components of major endocytic trafficking pathways, and found excellent conservation of both Rab and syntaxin genes plus components of the multivesicular body ESCRT system, suggesting that the major endocytic transport pathways are likely ancient [
24,
27]. However, evidence for substantial secondary losses of certain factors, for example Rab4, was also obtained, implicating secondary loss as a driver in the evolution of taxon-specific trafficking features together with emergence of novel functions in specific lineages [
27]. Consistent with this pattern of loss or degeneration is a recent study of the N-glycosylation system, where multiple absences of genes responsible for the construction of the dolichol-PP-linked N-glycan precursor were detected [
28], plus the independent loss of Golgi complex cisternal stacking in, at least, four major eukaryotic lineages [
25].
An investigation in metazoa, yeast and streptophyte plants showed that the four SM protein families are separate and encompassed representatives of two eukaryotic super-groups, Opisthokonta and Viridiplantae [
29]. However, further sampling of additional taxa is needed to properly address the distribution of the SM families and when these families originated. Similarly, a limited number of potential evolutionary relationships between the COG, GARP and exocyst tethering complexes have been described [
17,
30], based nearly exclusively on the presence of shared domains in yeast homologues of the complexes. Here we investigated the relationships and distributions of tethering factors and SM proteins across multiple representative taxa. In terms of deeper evolutionary relationships, we considered whether there was evidence for a common ancestral complex that gave rise to all of the extant tethering complexes, or independent evolution of each complex. The first model predicts a degree of sequence relatedness between the proteins comprising the different complexes. Further, we also considered the possibility that the tethering complexes may have arisen after the LCEA and therefore would display restricted taxon distribution. We find weak evidence for relationships between the complexes, explainable as the product of functional sequence constraints, and consistent with independent evolutionary origins. Furthermore, we find that the presence of SM proteins and the tethering complexes is wide spread, suggesting that they are ancient features of the membrane-trafficking machinery.