Genetic affinities of Helicobacter pylori isolates from ethnic Arabs in Kuwait

Helicobacter pylori is one of the most genetically diverse of bacterial species, and since the 5'-end of cagA gene and the middle allele of vacA gene of H. pylori from different populations exhibit considerable polymorphisms, these sequence diversities were used to gain insights into the genetic affinities of this gastric pathogen from different populations. Because the genetic affinity of Arab strains from the Arabian Gulf is not known, we carried out genetic analysis based on sequence diversities of the cagA and the vacA genes of H. pylori from 9 ethnic Arabs in Kuwait. The analysis showed that the Kuwaiti isolates are closely related to the Indo-European group of strains, although some strains have a tendency to form a separate cluster close to the Indo- European group, but clearly distinct from East Asian strains. However, these results need to be confirmed by analyses of neutral markers (house-keeping genes in a multi-locus sequence typing [MLST]) platform. The profiling of virulence-associated genes may have resulted from ecologically distinct populations due to human migration and geographical separation over long periods of time.

used to deduce the genetic relationships among strains from East Asia, South Asia, South America, Africa, Europe and USA [8,9]. However, there are no data on the genetic affinity of Arab strains from the Arabian Gulf Region including those from Kuwait, although some data exist on their genotypes [10,11]. In the current study, we analysed the genetic relationship of Kuwaiti strains with strains from other ethnic world populations. In order to investigate the gene pool diversity of Kuwaiti isolates of H. pylori, we sequenced putative virulence genes cagA and vacA, whose allele frequency have been shown to vary by ethnic group [12].
H. pylori isolates were cultured from the gastric biopsies of dyspeptic ethnic Arab Kuwaiti patients in a previous study on antibiotic susceptibility conducted during 2003-2005 [13]. The isolates were stored in brain heart infusion broth (Difco Laboratories, Detroit, Michigan, USA) with 15% glycerol at -70°C. The isolates were grown microaerobically on brain heart infusion agar supplemented with sheep blood, IsoVitalex and Dent supplement as described previously [13]. Chromosomal DNA was extracted by the CTAB (hexadecyl-trimethyl ammonium bromide) method [13] from culture derived from a single colony. A 219 bp fragment from the 5'-end of the cagA was amplified by PCR with the primers, cagA5 and cagA2 [9]. PCR was carried out in a 50-μl volume using 10 ng DNA, 25 μl of HotStarTaq Master Mix (catalogue number 203445, Qiagen, Valencia, California, USA), 10 pmol of each primer, for 40 cycles using the following conditions: 95°C for 60 s, 55°C for 60 s, and 72°C for 90 s. A 700 bp middle region of the vacA gene was amplified with the primers, VAm-F and VAm-R [9]. The concentrations of various PCR components were the same as in the cagA PCR, but amplification was done in 32 cycles with the following conditions: 94°C for 40 s, 55°C for 40 s, and 72°C for 70 s. The m1 (567 bp) or m2 (642 bp) allele of the middle region of the vacA gene was determined by PCR with the primers, VAG-F and VAG-R [14]. The concentration of PCR ingredients were the same as the other two PCRs above, but amplification was done in 35 cycles with the following conditions: 94°C for 60 s, 55°C for 60 s, and 72°C for 60 s. The PCR products were visualised by agarose gel electrophoresis after staining with ethidium bromide and examination under ultraviolet light. The PCR products were purified by High Pure PCR product purification kit (Roche Diagnostics, Indianapolis, Indiana, USA) according to manufacturer's instructions and directly sequenced. DNA sequencing was performed by the dideoxynucleotide chain termination method with an ABI PRISM BigDye Termination Cycle Sequencing Ready Reaction kit (Perkin-Elmer-Applied Biosystems, Forster City, California, USA) in an ABI PRISM automated sequencer. The DNA sequences were manually edited and compared with the selected sequences in the public database http://www.ncbi.nlm.nih.gov/nucleotide/ with the PHYLIP programme of J. Felsenstein http://evolution .genetics.washington.edu/phylip.html [15]. The genetic trees were generated using DDBJ programme http:// www.ddbj.nig.ac.jp and visualised using Treeview programme (version1.61; http://taxonomy.zoology.gla.a c.uk/rod/treeview.html). The lengths of the cagA gene and the vacA gene used in the analyses were 209 bp and 620 bp (from the 700 bp amplified region) respectively.
In the previous study on the antibiotic susceptibility of H. pylori strains conducted in Al-Adan Hospital in Kuwait, cultures were available from 39 ethnic Arab Kuwaiti patients [13]. Of these cultures, 16 were amplified by the cagA gene primers, 21 by the vacA gene primers and 9 by primers for both the genes. Our study attempted to analyze the genomes of the 9 Kuwaiti isolates in defining the evolutionary history of these strains. The isolate numbers are: 7, 16, 21, 46, 202, 220, 43, 25, and 42675. The sequence obtained during this study have been assigned the following accession numbers [Gen-Bank:GU212658 to GenBank:GU212666] (cagA 5'-end), [GenBank:GU212667 to GenBank:GU212669 and Gen-Bank:GU441847] (vacA m1 allele) and [GenBank:GU441 844, GenBank:GU441846, GenBank:GU441848 to Gen-Bank:GU441850] (vacA m2 allele).
To assess the genetic relationship between the strains isolated from Kuwait and those from other regions, we focused first on a segment near the 5' end of the cagA gene that had been used to distinguish East Asian and European strains [16]. This segment was amplified by PCR from nine Kuwaiti strains and sequenced directly. The sequences obtained were closely related to one another (97-99% match) and also to those from Western strains (91-99% DNA sequence match), essentially the Indo-European cluster, but only 77-86% related to those from Chinese and Japanese (East Asian) strains (Fig. 1). It was of interest to note that among the nine strains analysed from Kuwait, four of them (strains 7, 16, 202 and 42675) formed a somewhat separate cluster and were closely related to one another (99-100% match). These 4 strains were 92-93% related to the rest five Kuwaiti strains. Formal classification of these strains would require analysis from a larger number of strains from this region. DNA sequencing analysis of a 0.7-kb PCR fragment containing the vacA middle region from various parts of the world revealed the existence of 3 distinct groups (Fig. 2). The sequence analysis result showed that Kuwaiti isolates were 86-96% related to one another. Among the three isolates, one isolate (strain 43) was closely related to (96-99% match) Indian cluster, while the other two isolates (strain 46 and strain 4675) were 93-95% related to ethnic European strains and were 85-86% related to the East Asian cluster. This shows that strains isolated from Kuwait have an mixed gene pool. However, genetic analysis of the other five strains with vacAm2 genotype showed that they were much more scattered and related to the strains from Germany and India, even though the strain 25 was somewhat distant in contrast to the East Asian strains, which were much more congruent (Fig. 3).
cagA and vacA genes are virulence genes under selective pressure and hence subjected to sequence variation. Therefore, the data presented in this study need to be confirmed by sequence analyses of neutral genes such as house-keeping genes used in multi-locus sequence typing (MLST). Such analyses have been carried out for H. pylori isolates from various ethnic groups in India [17] and Iran [18]. These studies showed that isolates in these countries essentially belonged to the population of hpEurope, though isolates from various ethnic groups belonged to distinct subpopulation groups. Also, our conclusion of mixed gene in Kuwaiti isolates should be confirmed by Bayesian statistics.
The Arab population in the Middle East is a distinct ethnic race. Although the region is geographically separate, it established contacts with other regions of the world through colonisation and trade hundreds of years ago. Our study has shown that the strains that infect ethnic Arab Kuwaiti population is closely related to the Indo-European strains although some strains have a tendency to form a separate cluster close to the Indo-Euro-  1-8 and 10) or reported by others. The test strains determined in this study are marked in black bold face. Sequences other than those for the test strains are from public database, as indicated. The tree was generated using DDBJ programme. Each number in this figure indicates the cagA gene sequence from a given strain, as follows (GenBank accession numbers in parentheses). The strains used are as follows.   strains 1-4) or reported by others. The test strains determined in this study are marked in black bold face. Sequences other than those for the test strains are from public database, as indicated. The tree was generated using DDBJ program. Each number in this figure indicates the vacAm1 sequence from a given strain, as follows (GenBank accession numbers in parentheses). The strains used are as follows 1   pean group but far away from East Asian strains. This suggests a common ancestry of strains circulating in Kuwait, India and the European countries. The ancestry of these strains is different from that of the strains circulating in the Far East since they formed a distinct cluster. Human migration and geographical separation over long periods of time may have resulted in ecologically distinct populations of H. pylori infecting individuals in different continents. The profiling of virulence-associated genes like cagA and vacA provide examples in which evolutionary dynamics might have been shaped by local differences in environmental conditions and host physiology and may have resulted in ecologically distinct populations due to human migration and geographical separation over long periods of time. The remarkable differences found to-date between various Asian and European H. pylori populations in at least a few loci along with the present data from Kuwaiti strains should encourage further analyses of strains from relatively understudied geographic areas and human ethnic groups like Middle Eastern regions where human migration and geographical separation over long periods of time are prominent. Such "geographic genomics" may unearth new genes that affect human infection, increase our understanding of bacterium-host interactions in colonisation and disease, and impart new insights into the evolution of this diverse and globally distributed human pathogen.