As described above, a single αHL ion channel can separate single molecules based on their
size, and easily resolves a mixture of PEGs to better than the size of a single monomer unit
(
i.e., < 44 g/mol)
16,18,22. This high resolution arises from
the interactions between the PEG polymer, the electrolytes (mobile cations), and amino acid
side chains that line the αHL channel's lumen
16. These interactions allow the
pore to be used as a nanometer-scale sensor that is specific to the size, charge and
chemical property of an analyte.
Here, such analysis is extended to PEGs with different chemical groups on either terminus.
The single channel ionic current recording in (top and lower
left) illustrates the blockades caused by the four different sized
coumarin-PEGn-NH2 molecules, one at a time. As with
unmodified PEG, each of the current blockades is unimodal (i.e., described well with
Gaussian distributions and well-defined mean values).
To accurately discriminate between the four bases (A, C, G and T) for strand nanopore
sequencing, one or more of the following strategies need to be adopted: (1) enhance and
differentiate the strength of the detection signals; (2) develop an effective method to
discern and process the electronic blockade signals generated; (3) control the translocation
rate of nucleic acids through the pore, e.g., by slowing down DNA movement; and (4)
design and make new and more effective synthetic nanopores. As we demonstrated here, the
Nano-SBS approach has transformed the problem of resolving the 4 individual bases to that of
discriminating among 4 large well-differentiated tags, which essentially solves the first
three problems.
DNA sequencing by synthesis is the dominant platform for genomics research and personalized
medicine
29,30,31,32,33. Kumar et al. first reported the modification of
nucleoside-5′-triphosphates, either by introducing more phosphate groups to produce tetra-
and penta-phosphates and introducing fluorophores directly to the terminal phosphate or
attaching a linker between the terminal phosphate and the fluorophore
24,25.
Tetra- and penta-phosphates were shown to be better DNA polymerase substrates, and
fluorophore-labeled phosphate nucleotides have been used widely for DNA sequencing
26,27,28,34. Here, we have demonstrated a novel approach to enhance
discrimination of the four nucleotides by modifying them at the terminal phosphate moiety
with distinct large chemical tags for single molecule electronic SBS. The physical and
chemical properties of the tag can be further adjusted to optimize the nanopore capture
efficiency and measurement accuracy. For instance, the insertion of a positively charged
linker consisting of four lysines or arginines between the polyphosphate and the PEG will
produce precursors with a neutral charge and released tags with a net positive charge. Using
the appropriate magnitude and sign of the potential
23, the released tags, but
not nucleotide substrates, will be transported through the pore.
The coumarin moiety on the tagged nucleotides can be replaced with other molecules of
larger size or different charge to further enhance nanopore discrimination. Clearly, it is
important that every tag released in a polymerase reaction be maintained in the proper order
for real-time single molecule Nano-SBS. Despite all these precautions, some unreacted
nucleotide analogs might enter the pore. Thus, the ability to discriminate between cleaved
tags and unreacted nucleotide analogs will be important; fortunately, these two groups of
tags should be easily differentiated by a nanopore due to their significant size and charge
differences. In addition, it has not escaped us that the tagged nucleotide Nano-SBS approach
can be implemented in a straightforward way by adding the four nucleotides (A, C, G and T)
labeled with identical tags on the 5′-phosphate in a stepwise fashion to reduce the overall
complexity of the system, analogous to pyrosequencing
30 and the Ion Torrent
approach
33. However, unlike those methods, the Nano-SBS approach has the
advantage of single molecule sensitivity without the requirement for DNA amplification, and
hence no issues with sequencing through homopolymeric regions, since tags released at each
position of the homopolymer are detected discretely by the nanopore at single-molecule
level.
The single molecule electronic Nano-SBS approach described here should be applicable to
either protein nanopores (
e.g., αHL;
Mycobacterium smegmatis porin A,
MspA)
35, or solid-state nanopores
36,37,38,39,40,41,42.
These options will provide nanopores with different properties that are appropriate for
detecting a library of tags. To implement this novel strategy for DNA sequencing, an array
of nanopores
43 can be constructed on a planar surface to facilitate massively
parallel DNA sequencing.
In conclusion, we have conducted proof-of-principle studies for a novel single molecule
electronic Nano-SBS platform that will measure the tags released from the nucleotide
substrates during the polymerase reaction, for sequence determination. In its full
implementation in the future, it should be capable of long, accurate reads, and potentially
offer very high throughput electronic single molecule DNA sequencing.