MS-based proteomics has proven to be an indispensable technology for the unbiased analysis of large numbers of proteins. It is routinely applied to study the composition and dynamics of subcellular organelles, protein complexes, interactions, modifications, and the mechanisms of cell signaling (
1–
5). Although most of these applications do not require the analysis of entire proteomes, even comprehensive “expression proteomics” is becoming a realistic proposition—at least in the sense of quantifying peptides for all the gene products expressed in a given cellular state (
6).
Notwithstanding these successes, an intrinsic challenge in MS-based proteomics remains the large “dynamic range” of protein abundance levels; at least four orders of magnitude in yeast (
7,
8) and even larger in human cells. In the standard “shotgun” proteomics strategy the enzymatic digestion of proteins to peptides followed by liquid chromatography tandem mass spectrometry (LC MS/MS)
1 further compounds the complexity and dynamic range challenges (
9,
10). For in-depth analysis of very complex mixtures such as those represented in total cell lysates, at least one step of protein or peptide fractionation is therefore always employed before LC MS/MS. However, each additional fractionation step is accompanied by corresponding increases in the required starting material and in the required measurement time. Furthermore, because of the very high sensitivity of modern mass spectrometers, peptides and proteins can easily be found in several adjacent biochemical fractions, diminishing the contribution of classical biochemical fractionation to achieving deep coverage of the proteome. In contrast, LC is in principle capable of very high separation power (
11). Joergensen and coworkers pioneered the use of small, one micrometer-sized chromatographic particles, which increase chromatographic resolution (
12). However, the backpressure in LC strongly depends on the size of these particles and these small particle sizes required ultrahigh pressure LC systems. Smith and coworkers similarly constructed very high pressure systems and coupled them to three-dimensional ion traps as well as toFourier transform-ion cyclotron resonance instruments with very high field strength (
13,
14). Using columns up to 2 m in length, they reported identification of about 2000 proteins of
Shewanella oneidensis in 12h gradients and demonstrated 15 attomole sensitivity for bovine serum albumin. Waters Corporation, along with several other companies, have commercially introduced high-pressure LC systems (UPLC for ultra high pressure chromatography). They reported that UPLC enabled the use of small (sub two-micrometer) beads and extended column lengths, which increased chromatographic resolution (
15). Yates and coworkers described an LC/LC peptide separation system with extended column length of 50 cm, which led to 30% increase in protein identification compared with the previous set-up (
16).
Monolithic columns offer a somewhat different approach to obtain high separation capacity, which does not necessitate as high a backpressure. Very recently, Ishihama and coworkers measured the
E. coli proteome in triplicate 41 h gradients on a 350 cm monolithic column and identified more than 2500 proteins (
17). Remarkably, this number slightly exceeded the transcriptome detected on microarrays in the same system, suggesting that very high coverage of the proteome had been achieved. Furthermore, these researchers reported a fivefold enhanced total peptide signal compared with standard columns typically used in shotgun proteomics, which they attributed to reduced peptide suppression in electrospray in their system.
Most of the above reports used very specialized equipment not routinely employed in proteomics. Furthermore, in just the last few years the resolution, mass accuracy and sequencing speed of modern mass spectrometers have increased dramatically (
18,
19) and there have been corresponding advances in computational proteomics. We therefore set out to investigate the combined capabilities of a high resolution chromatographic system with a state of the art MS and computational proteomics workflow. We employed small particles, long columns and long and shallow gradients using standard HPLC pumps to answer the conceptual question of whether or not extensive fractionation was necessary to characterize a large part of the proteome. We used the yeast model system as well as a human cell line to judge the depth and the usefulness of the achieved proteome coverage against the comprehensive yeast proteome (
6), a recent study designed to identify yeast proteins expressed at very low levels (
20) and, in a bioinformatic approach, by the coverage of cellular pathways and processes.