Species have been first differentiated from morphological traits, and nowadays phenotypic criteria are still used to characterize them. Even for bacteria, phenotypic characteristics should agree with phylogenetic relatedness to constitute a species (
Wayne et al., 1987;
Stackebrandt et al., 2002). The underlying idea is that genetically distinct organisms should also be phenotypically distinct. Several cases, for which phylogeny, phenotype and ecological niche are related, support this view. For instance, in the group of asexual species of bdelloid rotifers, genetic and morphological clusters are the same and result from niche divergence (
Fontaneto et al., 2007). In bacteria of the genus
Bacillus, genetic groups and growth temperature are also linked as a consequence of the ecology of these species (
Guinebretière et al., 2008).
However, several studies have revealed that the genetic distances and the phenotypes can be poorly related, as it has been found in eukaryotes species such as
Zea mays (maize) (
Burstin & Charcosset, 1997) and
Lolium perenne (ryegrass) (
Roldán-Ruiz et al., 2001). Similar results have been observed for bacterial species such as members of the genus
Cronobacter (
Baldwin et al., 2009) and strains of
Staphylococcus aureus (
Morandi et al., 2010). A well-known phenomenon that can disrupt the link between genetic distance and phenotype is the phenotypic convergence resulting from similar ecological niches of distinct genetic groups. For instance, life-history strategies are associated with specific habitats in
Saccharomyces cerevisiae, and genetically distant strains sharing the same habitat have similar life-history strategies (
Spor et al., 2009).
The
E. coli species is of particular interest to study the relationships between phylogenetic relatedness and phenotypic variation. The evolutionary history of the species (
Lecointre et al., 1998) revealed that the strains are distributed among five main phylogroups: A, B1, B2, D and E (
Herzer et al., 1990;
Escobar-Páramo et al., 2004a). In addition, natural isolates of
E. coli are found in a variety of habitats, which can be either vertebrate hosts or water or soil (
Hartl & Dykhuizen, 1984) and can be commensals (
Tenaillon et al., 2010), intra-intestinal pathogens (Intestinal Pathogenic
E. coli or InPEC) or extra-intestinal pathogens (extra-intestinal pathogenic
E. coli or ExPEC) (
Kaper et al., 2004). We chose to call lifestyles the combinations of habitat and pathogenicity. The prevalence of the different phylogroups varies slightly between lifestyles. For instance, farm animals exhibit a higher proportion of A and B1 strains and a lower proportion of B2 and D strains than wild animals. Likewise, ExPEC strains belong mainly to the phylogroup B2 (
Picard et al., 1999). However, there is no clear-cut link between phylogroups and lifestyles, i.e. no lifestyle can be uniquely attributed to a given phylogroup (
Gordon & Cowling, 2003;
Escobar-Páramo et al., 2006).
E. coli genome, which encompasses approximately 4700 genes, is highly dynamic: the core-genome, the genes present in all the sequenced genomes, is about 2000 genes, whereas the pan genome, the full set of nonorthologous genes among all genomes, reaches 18 000 genes (
Rasko et al., 2008;
Touchon et al., 2009).
Based on this large genetic diversity and the various lifestyles, we expect to find a large phenotypic variation within the species. The nonrandom distribution of the phylogroups among different lifestyles may indicate that these groups differ in phenotypes. Besides, as anthropogenic factors such as domestication play a major role in the ecological structure and the level of antimicrobial resistance of
E. coli (
Escobar-Páramo et al., 2006;
Skurnik et al., 2006), the exposure of a strain animal host to humans could influence the phenotype of the bacterium. The prevalence of
E. coli and the relative abundance of the phylogroups depend on the host diet (
Gordon & Cowling, 2003), which might also have an impact on the strain phenotype. Finally, the strain phenotype could be globally linked to the pathogenic nature of the bacterium as this has been shown to be the case for a given metabolic phenotypic character. Indeed, the use of deoxyribose constitutes a fitness advantage for the competitiveness of extra-intestinal pathogenic
E. coli strains (
Bernier-Febreau et al., 2004;
Martinez-Jéhanne et al., 2009).
To assess the extent of intra-species diversity as well as the links between phylogeny, lifestyle and phenotype, we assayed the growth yield (carbon source utilization) of a panel of genetically diverse E. coli natural isolates. We included several phylogenetic outgroups in the study as well as one phenotypic outgroup to test whether our methodology gives a global and representative image of a strain phenotype. Metabolic capacities are conditioned by the occurrence of specific enzymatic reactions in the cell that can be inferred from the strain gene content. Therefore, to go further, we studied in a subset of strains the relationship between growth capacities and metabolic pathways reconstructed from complete genome data. Hence, we were able to analyse the correlations between phylogenetic distance, metabolic phenotypic distance and metabolic pathway presence. Overall, the strain growth yield seemed to present continuous variations around the species average, whereas the pattern of the presence/absence of the metabolic pathways was linked to the species phylogeny. Finally, we discussed the impact of the species life cycle on the metabolic phenotypic diversity and the molecular mechanisms that could account for discrepancies between growth and the presence of metabolic pathways.