The control of flowering time in
A. thaliana has been the focus of much study over the past decade. Yet, despite the wealth of resources at our disposal, a clear picture of the species-wide genetic architecture of flowering time has not yet emerged, since the simultaneous analysis of populations representing several parents has been the exception (
Simon et al. 2008;
Kover et al. 2009;
Brachi et al. 2010).
Most of the previous work mapping flowering-time QTL has used RILs. Because RILs represent immortalized, largely fixed recombinant genotypes that can be phenotyped many times, genotyping costs could be amortized over many phenotyping trials. In the past few years, expenses associated with genotyping have dropped considerably, and adoption of next-generation sequencing platforms promises to further lower costs while increasing the resolution of genotyping (
e.g.,
Baird et al. 2008;
Huang et al. 2009;
Xie et al. 2010). Apart from marker analysis, polymorphism discovery used to be a major bottleneck, before the advent of ultra-high-resolution microarrays and new sequencing methods (
Clark et al. 2007;
Ossowski et al. 2008). We have investigated the potential of F
2 populations as an alternative to immortal RILs, by making full use of our knowledge of hundreds of thousands of polymorphisms described for 20 accessions (
Clark et al. 2007).
The major QTL that we detected could explain on average about 40% of the overall variation, indicating that the remaining 60% of flowering-time variation must be associated with modest-effect QTL that lie below our significance threshold. That the unexplained variance does not hinder us from predicting the parental flowering times suggests that the remaining effects must be (1) very small, and therefore remain undetectable in our populations, and (2) equally distributed between negative and positive effects, thus canceling each other out. We also observed extensive variation in the onset of flowering in all F
2 populations, even when the parental accessions flowered at very similar times. Because hybridization of
A. thaliana accessions occurs regularly in the wild (
Abbott and Gomes 1989;
Bergelson et al. 1998;
Nordborg et al. 2005;
Picó et al. 2008), our results have important implications for the initial stages of adaptation via flowering time.
We also compared our results to a recently published species-wide study of flowering-time QTL in maize. In our populations, 54 of 55 QTL alleles altered flowering time by at least 1 day, while this was true for only 7 of 333 QTL in maize (
Figure S4,
Buckler et al. 2009). In
A. thaliana, an average of 3–4 QTL per F
2 population explained 3.1–22.7 days difference in flowering (mean 10.1 days), while the combined effects of 13–14 maize QTL per population in maize ranged from 1.5 to 13.0 (mean 6.2).
Also in contrast to maize, a small number of regions was overrepresented for flowering-time QTL. Two of them include
FRI and
FLC, although our F
2 populations were all exposed to prolonged cold in an attempt to identify vernalization-independent loci. Variation at the
FRI region strongly contributed to flowering-time variation in RILs, reaching values as high as 46%, and averaging 21.5% across all RILs. The relative importance of the
FRI genomic region in our 17 F
2 populations was not quite as strong, averaging only 12.1% of total variance, and was never higher than 19%, indicating that 6 weeks of vernalization was effective in limiting the contribution of FRI to flowering time. QTL mapping to the
FLC genomic region explained between 4 and 53.5% of the standing variation in our populations (), and between 2 and 37% in RILs (), confirming
FLC as a major gene for flowering time. Lov-5 carries a strong, vernalization-insensitive
FLC allele (
Shindo et al. 2006), which may skew the mean and range associated with
FLC: after removal of
FLCLov-5 from our list, mean variance dropped to 13.3 (range of 4–41.5) and was then more in line with results obtained with RILs. Two additional regions where QTL clustered overlapped with the locations of the remaining members of the
FLC clade,
FLM, and
MAF2-5, in both our F
2 and RIL populations. Mean variance and range were comparable in both sets of populations, suggesting that the observed allelic series at
FLM and
MAF2-5 between 14 of our 18 founding accessions might also apply to the RIL parental accessions as well. We detected QTL overlapping with the
FLM genomic regions twice as often as in RIL studies, possibly reflecting the partial bias in RIL parental accessions. Indeed, the common laboratory accessions Col and Ler were crossed, either to each other or to other accessions, to create 12 of the 19 RIL populations characterized for flowering-time QTL (
Table S5). In field experiments such as B
rachi and C
olleagues (2010), the decision to flower results from the integration of daily temperature cycles and gradual photoperiod changes. Only under these conditions—where daily temperatures often did not raise above 10°—were QTL in genes associated with the circadian clock detected, indicating that low temperatures may define a sensitized condition for variation in clock function in the specific context of flowering time. In all other studies, including this study, growth conditions included a constant temperature >16° and long and nonchanging photoperiods sufficient to saturate the photoperiodic pathway, thus allowing the emergence of effects caused by general mediators of flowering time and providing an explanation for the absence of clock-associated loci in our list of QTL.
Although we vernalized seedlings for 6 weeks before release at 16°, we still detected QTL mapping to the
FRI,
FLC, and
FLM/MAF2-5 genomic regions. Our growth chambers maintain very good control of temperature, light intensity, and air humidity, which likely greatly limited phenotypic variation due to microenvironmental noise and therefore enhanced our ability to detect QTL. In addition, the relatively low temperature of 16° generally delays flowering in long days compared to 23° (
Lempe et al. 2005). Responses to ambient temperature involve
SVP, as demonstrated by a similar flowering time at 16° and 23° for
svp mutants (
Lee et al. 2007). SVP function is dependent on FLM, as an
svp loss of function can suppress the late flowering caused by
FLM overexpression (
Scortecci et al. 2003). It is thus conceivable that
flm mutants might be similarly insensitive to changes in ambient temperature and that growing plants at 16° allowed us to measure differences in the strength of
FLM alleles that had escaped detection in several previous studies.
Although two of our major QTL clusters overlap with the locations of
FLM and
MAF2-5, initial genome-wide association studies failed to identify significant SNPs at either
FLM or
MAF2–MAF5 (
Atwell et al. 2010;
Brachi et al. 2010). Genome-wide association studies fail when they include too few accessions with functionally variant alleles, or if too many of the functionally variant alleles are distinct from each other. The evidence for allelic series at all our QTL is in support of the latter hypothesis. Only after an increase in sample size from 96 to 473 unique accessions did
MAF2 emerge as a possible flowering-time QTL candidate following association mapping (
Li et al. 2010). In all cases described in Arabidopsis, one constant feature remains: QTL for flowering time are few, but are associated with large effects.
The chromosomal location of most strong-effect QTL is in itself quite striking: aside from
ER, which is close to the centromere of chromosome 2, all other flowering-time QTL candidate genes (
FLC,
FLM,
MAF2-5, and
FRI) are located at the ends of their respective chromosomes. Following hybridization, parental genomes recombine and segregate to form novel combinations of alleles in the progeny. The low frequency of crossovers each generation means that large, intact fragments of parental chromosome will be transmitted to the progeny. The large-effect QTL that we detected in our populations would thus generate distinct pools of alleles in the F
2 and subsequent generations, which could have adaptive significance due to variation in flowering time. On the other hand, growth-related traits tend to display more complex genetic architectures than flowering time, with many small-effect QTL, and are often ripe with epistatic interactions (
Vlad et al. 2010). This delicate balance of alleles will be severely disrupted after hybridization and formation of pools of early and late-flowering plants; however, positioning flowering-time QTL to ends of chromosomes will limit the extent of genetic drag imposed on the rest of the chromosome.
In conclusion, we have identified a small number of genomic regions with strong effects on flowering time. Some of the same regions, and indeed candidate genes, are now coming to the forefront through genome-wide association mapping studies. That
FLM has yet to be described as being associated with flowering-time variation in association studies might mean only that the number of accessions remains too small, if many rare alleles contribute. The complete sequencing of hundreds, and soon thousands, of genomes from
A. thaliana accessions (
Weigel and Mott 2009) is a prerequisite for the genome-wide annotation of potential functional polymorphisms; apart from the direct analysis of QTL candidates, this will also improve the power of genome-wide association studies, since alleles that are the consequence of convergent changes in activity can be combined.