Helicobacter pylori is a Gram-negative spiral-shaped bacterium that persistently colonizes the human stomach [
1]. Persistent
H. pylori colonization of the human stomach is a risk factor for several diseases, including non-cardia gastric adenocarcinoma, gastric lymphoma, and peptic ulceration [
1,
2]. The incidence of these diseases varies considerably throughout the world. For example, the incidence of gastric adenocarcinoma is substantially higher in East Asia, Central America, and South America than in most other parts of the world [
3].
H. pylori isolates from unrelated humans exhibit a high level of genetic diversity [
4,
5]. Genetic variation is readily detectable by analyzing the nucleotide sequences of individual genes in different
H. pylori strains [
6].
H. pylori allelic diversity is probably the consequence of multiple factors, including a high rate of mutation, a high rate of intraspecies genetic recombination, and a long evolutionary history of the species [
4,
7]. Corresponding alleles in different
H. pylori strains typically are 92 to 99% identical in nucleotide sequences [
4,
6], but several
H. pylori genes exhibit a much higher level of genetic diversity [
8,
9].
Further analyses have shown that there is geographic variation among
H. pylori strains [
10-
16]. Based on multilocus sequence analysis of a panel of 370
H. pylori strains isolated from humans in different parts of the world, seven populations of strains with distinct geographic distributions have been identified [
17]. These
H. pylori populations reflect the migration of humans from Africa to other parts of the world over a time period estimated to be approximately 58,000 years [
12]. Geographic differences among
H. pylori strains could potentially be a factor that helps to explain the varying incidence of
H. pylori-associated diseases in various parts of the world.
In addition to variation among
H. pylori strains in the sequences of individual genes, there is considerable variation among strains in gene content. One study analyzed genomic DNA from 56 different
H. pylori strains using array hybridization methods and identified 1150 genes that were present in all of the strains tested (thus representing a "core" genome) [
18]. Among 1531 genes analyzed, 25% were absent from at least one of the 56
H. pylori strains. It was predicted that the
H. pylori core genome would consist of 1,111 genes if a much larger set of isolates were tested [
18]. Other studies have reported the existence of core genomes comprising 1091 or 1281 genes, based on DNA array analysis of 34 or 15
H. pylori strains, respectively [
19,
20]. One study reported that the phylogeny of
H. pylori strains based on MLST analysis was substantially different from the phylogeny of
H. pylori strains based on analysis of gene content [
18].
One of the most striking differences in gene content among
H. pylori strains is the presence or absence of a 40-kb region of chromosomal DNA known as the
cag pathogenicity island (PAI) [
8,
21-
24]. In the United States and Europe, about 50–60% of
H. pylori strains contain the
cag PAI and the remaining strains lack this region of the chromosome [
8,
21-
24]. In many other parts of the world, including East Asia, nearly all
H.
pylori strains contain the
cag PAI [
15,
25,
26]. The
H. pylori cag PAI encodes an effector protein, CagA, and a type IV secretion apparatus that translocates CagA into gastric epithelial cells [
27].
H. pylori strains harboring the
cag PAI are associated with an increased risk of non-cardia gastric cancer or peptic ulcer disease compared to strains that lack the
cag PAI [
21,
28]. The correlation between these diseases and presence of the
cag PAI provides an example of how the clinical outcome of
H. pylori infection is determined in part by genetic characteristics of the strains with which a person is infected.
In previous studies, the complete genomes of three
H. pylori strains have been analyzed [
29-
31]. These three
H. pylori strains were isolated from patients who had gastritis, atrophic gastritis, or duodenal ulcer disease. In the current study, we sought to analyze genetic features of
H. pylori strains isolated from patients with two different
H. pylori-associated diseases: gastric ulcer and gastric cancer. For this analysis, we selected a gastric ulcer strain (B128) that readily colonizes the stomachs of mice and Mongolian gerbils. This strain is of particular interest because an animal-passaged derivative of strain B128 (strain 7.13) causes gastric cancer in a Mongolian gerbil model [
32,
33]. For an analysis of a gastric cancer-associated
H. pylori strain, we selected strain 98-10, which was isolated from a gastric cancer patient in Japan [
34], a country with a very high incidence of gastric cancer [
3,
35].