Skip to main content

Genomes of Vibrio metoecus co-isolated with Vibrio cholerae extend our understanding of differences between these closely related species



Vibrio cholerae, the causative agent of cholera, is a well-studied species, whereas Vibrio metoecus is a recently described close relative that is also associated with human infections. The availability of V. metoecus genomes provides further insight into its genetic differences from V. cholerae. Additionally, both species have been co-isolated from a cholera-free brackish coastal pond and have been suggested to interact with each other by horizontal gene transfer (HGT).


The genomes of 17 strains from each species were sequenced. All strains share a large core genome (2675 gene families) and very few genes are unique to each species (< 3% of the pan-genome of both species). This led to the identification of potential molecular markers—for nitrite reduction, as well as peptidase and rhodanese activities—to further distinguish V. metoecus from V. cholerae. Interspecies HGT events were inferred in 21% of the core genes and 45% of the accessory genes. A directional bias in gene transfer events was found in the core genome, where V. metoecus was a recipient of three times (75%) more genes from V. cholerae than it was a donor (25%).


V. metoecus was misclassified as an atypical variant of V. cholerae due to their resemblance in a majority of biochemical characteristics. More distinguishing phenotypic assays can be developed based on the discovery of potential gene markers to avoid any future misclassifications. Furthermore, differences in relative abundance or seasonality were observed between the species and could contribute to the bias in directionality of HGT.


Vibrio cholerae is the etiological agent of the potent diarrheal disease cholera, responsible for 1.2–4.3 million infections and 28,000–142,000 deaths worldwide every year [1]. All seven cholera pandemics throughout history were caused by either the Classical or El Tor biotype. Both biotypes were derived from a single genetic lineage, termed the pandemic generating lineage, within this extremely diverse species harboring more than 200 serogroups [2,3,4]. However, the majority of environmental V. cholerae isolates are actually non-toxigenic [5]. V. cholerae is a readily-culturable and genome-sequenced organism with close to 1700 whole-genome sequences available in the NCBI Microbial Genomes database [6] as of October 2022. Multiple comparative genomic studies have been performed with V. cholerae genomes to determine the population structure and genetic diversity of the species, with the focus mainly on clinical isolates [7,8,9,10,11,12,13,14]. Comparative genomics has been proven to be a useful tool in elucidating the tempo and mode of evolution of pathogenic V. cholerae, applied in the analyses of cholera outbreaks in Haiti in 2010 [7,8,9,10, 14] and Yemen in 2016 [11, 14], two of the largest cholera outbreaks in recent history.

By contrast, Vibrio metoecus is a recently described species and one of the closest known relatives of V. cholerae [15, 16] that remains poorly understood. It was initially described as an atypical variant of V. cholerae [17] as it appears as yellow V. cholerae-like cells on thiosulfate citrate bile sucrose (TCBS) agar; shares a high 16S rRNA gene sequence identity (> 98%) with V. cholerae reference genomes; and resembles V. cholerae in the majority of biochemical characteristics. However, it is negative for the production of acetoin (Voges-Proskauer assay), amylase, and lipase [15, 17, 18]. It was further determined that V. metoecus is able to grow using D-glucuronic acid or N-acetyl-D-galactosamine as sole carbon sources, both of which are distinguishing characteristics for the species [15]. So far, clinical and environmental strains had been isolated mostly in the USA [15, 18,19,20], and two environmental singletons had been isolated in Europe (Italy and Spain) [21, 22]. As of October 2022, genomes of only about 30 strains are available in the NCBI Microbial Genomes database [6], including the 12 sequenced in this study. Here, the availability of additional V. metoecus genomes provides more insight on the biology of the species and possible genetic differences from V. cholerae that can be useful in taxonomic identification to avoid misclassification.

Multiple strains of both V. cholerae and V. metoecus have been co-isolated from a cholera-free, brackish coastal pond in the US East Coast (Oyster Pond, Falmouth, Massachusetts, USA) [12, 13, 19]. The integron gene cassettes of geographically co-occurring V. cholerae and V. metoecus are more similar than geographically distinct V. cholerae (i.e., between isolates from the USA and Bangladesh) [19]. Thus, their co-isolation has led us to hypothesize that both species are likely in constant interaction with each other, providing opportunities for gene exchange by horizontal gene transfer (HGT). In this study, the genomes of 17 strains from each species originating from the same pond were screened for interspecies HGT events. Results suggest that up to three times as many genes were found to have moved from V. cholerae to V. metoecus than vice versa. From a previous study, it has been found that in this sampling site (Oyster Pond), V. cholerae is more abundant than V. metoecus, with an average of three times as many cells in a water sample [23]. V. metoecus environmental population has a strong seasonal bias in abundance, only being able to be significantly observed at the end of the summer (i.e., August to September) in Oyster Pond, while V. cholerae is always present throughout the entire summer (i.e., June to September) [23]. We hypothesize that these differences in relative abundance and/or seasonality could be contributing factors to the bias in HGT directionality from V. cholerae to V. metoecus.


Isolates of V. cholerae and V. metoecus used in this study

Environmental strains of V. cholerae and V. metoecus were isolated from Oyster Pond, Falmouth, Massachusetts, USA (41°32′31.8″N 70°38′19.9″W) on August 24 and September 19, 2009, as previously described [13]. In brief, three 100-L water samples were collected on each sampling date and sequentially filtered (63-, 5-, 1-, 0.22-μm filters). Filters were then placed on TCBS agar (Becton Dickinson) and incubated overnight at 37 °C. Yellow colonies (indicative of Vibrio species) were picked from the TCBS plates and streaked on tryptic soy agar (TSA) (Becton Dickinson) with 1.0% NaCl (BDH) and incubated overnight. Cultures in tryptic soy broth (TSB) (Becton Dickinson) were incubated overnight with shaking at 200 rpm. Preliminary identification of strains was subsequently performed by multilocus sequence analysis (MLSA) using seven housekeeping genes (gppA, intI, mdh, mutS, pgi, plsX, recA), as previously described [13].

Genomic DNA extraction and sequencing

Whole-genome sequences were obtained from 17 strains each of V. cholerae and V. metoecus (Table 1). The genomes of seven strains (two V. cholerae and five V. metoecus) were sequenced previously [15, 20]. For the remaining 27 strains, genomic DNA was extracted from overnight TSB cultures using the DNeasy Blood and Tissue Kit (QIAGEN) and quantified using the Quant-iT PicoGreen dsDNA Assay Kit (Molecular Probes) and the Synergy H1 microplate reader (BioTek). A paired-end library was constructed using the Nextera XT DNA Library Preparation Kit (Illumina). Whole-genome sequencing was performed on the Illumina NextSeq platform using NextSeq 500/550 High Output Kit v2 (for 300 cycles), generating 150-bp paired-end reads. The CLC Genomics Workbench 7.5.2 [24] was used for raw read quality control and de novo assembly. Raw reads were first filtered using the following parameters: quality score limit = 0.05, maximum number of ambiguous nucleotides = 0, discard reads below length = 15. De novo assembly was then performed using the following parameters: word size = 45, bubble size = 98, minimum contig length = 1000, mismatch cost = 2, insertion cost = 3, deletion cost = 3, length fraction = 0.9, similarity fraction = 0.96. The resulting draft genomes were annotated with the RAST Server 2.0 using the Classic RAST annotation scheme, RAST gene caller, and FIGfams release 70 [25].

Table 1 Genome sequences of V. cholerae and V. metoecus Oyster Pond isolates used in this study

Completeness of genomes was assessed by determining the presence or absence of a subset of core housekeeping genes using BLAST+ 2.5.0 [26]. Completeness in this context was reported as the percentage of 104 single-copy core housekeeping genes present in each genome (Additional file 1). The set of core genes used was modified from Luo and Moran [27] with the addition of genes used for MLSA of the genus Vibrio [15, 28,29,30,31]. The amino acid sequences of the 104 genes from V. cholerae N16961 (GenBank accession nos. NC_002505.1 and NC_002506.1) were used as reference [32]. Putative homologues were decided by calculating the BLAST score ratio (BSR) [33] between reference and query genes and considered if the BSR values were at least 0.3 [34].

Comparative genomic analyses

The 34 V. cholerae and V. metoecus genomes were aligned using Mugsy 1.2.3 [35]. Locally collinear blocks less than 500 bp were removed using Galaxy 16.04 [36], and alignment positions with at least one gap were then stripped using Geneious 8.1.2 [37]. The resulting alignment with a total length of 2,801,207 bp was used to reconstruct a maximum-likelihood phylogenetic tree using RAxML 8.0.19 [38] with the general time reversible nucleotide substitution model and gamma model of rate heterogeneity. Robustness of branches was assessed with 100 bootstrap replicates.

Orthologous protein-coding gene families were determined from the annotated genomes using the BPGA tool 1.3.0 [39] with a cutoff of 30% amino acid identity [34]. BPGA was also used to determine the core (100% present in all strains), accessory (present in some strains), and unique (present in one strain only) gene families. Furthermore, species-specific gene families were identified using Intella 1.7.0 [40]. The functions of the gene families were predicted using WebMGA [41] and eggNOG-mapper 2.1.8 [42], based on the Clusters of Orthologous Groups (COG) of proteins database [43]. Gene content was further analyzed using the STRING database 11.5 [44] to elucidate presence or absence of pathways.

To visualize genome comparisons, a BLAST atlas was constructed using GView 1.7 [45] with the V. cholerae N16961 genome as reference [32]. The annotated genomes were also scanned for the presence or absence of major genomic islands of V. cholerae N16961 [32, 46], as well as known virulence factors of the genus Vibrio, available from the Virulence Factor Database (VFDB) [47, 48]. Putative homologues were determined by calculating the BSR [33], as described above.

For every gene family, nucleotide sequences were aligned with ClustalW 2.1 [49]. A maximum-likelihood tree was then reconstructed for each gene alignment using RAxML 8.0.19 [38], as described above. Interspecies gene transfer events were determined and quantified by tree topology comparisons, as described previously [20, 50]. Briefly, the trees were partitioned into clades and visually inspected to determine whether the clades were perfect or not. Following the definition by Schliep and colleagues [50], a perfect clade is a partition that is both complete and homogeneous for a given taxonomic category (e.g., a clade with all V. cholerae members and only V. cholerae). At least one gene transfer event was hypothesized if a tree showed perfect clades for neither V. cholerae nor V. metoecus (i.e., in a rooted tree, V. cholerae and V. metoecus are both polyphyletic). The direction of transfer was then inferred if within a clade of one species (the donor) there was a strain from the other species (the recipient), provided the clustering has robust bootstrap support of at least 70% [51]. For example, a gene transfer from V. cholerae to V. metoecus is inferred if a strain of V. metoecus clusters within the V. cholerae clade in that gene tree.

Statistical analysis of HGT events

Statistical analyses of total HGT events between V. cholerae and V. metoecus were assessed separately for the core and accessory gene datasets using QI Macros 2018 [52]. An Anderson–Darling test for normality was performed for each grouping. Accordingly, the accessory gene dataset used t-test assuming equivalent variances, while the non-parametric Mann–Whitney–Wilcoxon test was used on the core gene dataset. Differences were acknowledged as statistically significant at p-values < 0.05.

Quality assurance

To obtain a pure culture, each strain was repeatedly subcultured (three times) on TSA while ensuring appropriate aseptic techniques. A single colony was then inoculated into TSB for overnight culturing and for use in genomic DNA extraction. Only high-quality genomes that were complete (100%) or near complete (99%) were used in this study (Table 1) based on our completeness assessment (see above). After preliminary identification of strains by MLSA (see above), species identities of the sequenced genomes were verified to be V. cholerae or V. metoecus by determining the average nucleotide identity (ANI) against known reference/type strain genomes, V. cholerae N16961 or V. metoecus OP3H (JJMN00000000), and against each other. Comparisons among V. cholerae genomes exhibited 98–100% ANI. Comparisons among V. metoecus genomes exhibited 97–100% ANI. These are well above the 95% ANI cutoff for genomes to belong to the same species [53]. On the other hand, comparisons between V. cholerae and V. metoecus genomes resulted in 86–87% ANI only.

Results and discussion

Genes exclusively present in V. cholerae or V. metoecus are potential phenotypic markers for species identification

To obtain a clearer picture of genetic differences between V. cholerae and V. metoecus, we sequenced genomes of strains from both species co-isolated from Oyster Pond, a cholera-free environment in the US East Coast [13, 19]. The rarity of V. metoecus limited the size of the dataset, because only 17 V. metoecus isolates with high quality genome sequences could be obtained (5 from previous studies [15, 20] and 12 in this study). An equal number of V. cholerae genomes of strains isolated from the same location were therefore selected (2 from a previous study [20] and 15 from this study), matching the genetic diversity of the V. metoecus genomes based on clonal complex or sequence type previously determined by MLSA [13] to avoid bias (Table 1). A whole-genome-based phylogenetic tree clearly clustered our 17 V. cholerae and 17 V. metoecus isolates in two distinct clades with robust bootstrap support (Fig. 1).

Fig. 1
figure 1

The phylogenetic relationship of V. cholerae and V. metoecus. The phylogeny was reconstructed from a core-genome alignment of ~ 2.8 Mb in length. Bootstrap values are indicated on the nodes. The scale bar represents 0.02 nucleotide substitutions per site. Parallel lines indicate shortened branch lengths, approximately 5× the scale bar. Dots after the strain names indicate genomes sequenced in this study

Subsequently, the pan-genome of the two species was categorized into either core or species-specific gene families (Fig. 2). With this larger dataset, metabolic differences between the species could be found, such as the ability of V. metoecus to use D-glucuronic acid and N-acetyl-D-galactosamine as sole carbon sources [15] (COG3250). These glycosaminoglycans are present in the extracellular matrix or attached to the membrane of eukaryotic cells. Pathogenic bacteria express various adhesins to bind to these structures prior to colonization and infection, as in pulmonary infections by Bordetella pertussis and Mycobacterium tuberculosis and urogenital infections by Chlamydia trachomatis and Neisseria gonorrhoeae [54].

Fig. 2
figure 2

Distribution of core genes and species-specific genes between V. cholerae and V. metoecus. The Venn diagram shows the number of genes shared between V. cholerae and V. metoecus or present within each species. Top numbers indicate total gene families in the pan-genome of both species. Bottom numbers (in bold and parentheses) indicate gene families exclusively present in all strains of V. cholerae, V. metoecus, or both

Another striking difference between the species is in nitrogen metabolism. Although both species display pathways for nitrate reduction, only V. metoecus harbors enzymes used for nitrite reduction to ammonia (COG3301, COG3303) (Additional file 2). Nitrite reductases have also been implicated in virulence [55]. For example, pathogens such as M. tuberculosis, N. gonorrhoeae, and Pseudomonas aeruginosa reduce nitrite—a byproduct produced by mammals in low oxygen conditions such as in the gut—to less toxic secondary components like nitrous oxide or ammonia using nitrite reductases, giving them selective advantage to survive in the harsh environment in the host [55, 56]. This suggests that V. metoecus could also have an advantage growing under anaerobic conditions if nitrite is available [57, 58]. For its part, V. cholerae has a multisubunit Na+/H+ antiporter complex (COG1006, COG1863, COG2212, COG1320) involved in salt and pH tolerance, which could have a significant effect on its niche (Additional file 3). V. cholerae also harbors a superoxide dismutase (COG0605) that is absent in V. metoecus, which could aid with resistance to phagocytosis by predatory protists or phagocytes [59].

Two other genes unique to V. metoecus potentially suited to phenotypic assays to differentiate it from V. cholerae were identified (Additional file 2). The first is a gene encoding a rhodanese-related sulfurtransferase (COG2897). Rhodanese is widely distributed in Bacteria, Archaea, and Eukarya, and studied mainly for its role in sulfur metabolism and cyanide detoxification [60,61,62]. The direct role of bacterial rhodanese in pathogenicity, if any, is yet to be investigated. Meanwhile, studies on other model systems show that rhodanese from a ruminant nematode parasite (Haemonchus contortus) can bind to peripheral blood mononuclear cells of its mammalian host (goat) and results in a poor host immune response [63]. This finding is interesting since a clinical strain of V. metoecus was extracted from blood [20], indicating that strains of this species can be opportunistic pathogens, where rhodanese may also play a role in its pathogenicity. A test for rhodanese activity in bacteria for phenotypic differentiation using the EDTA-lysozyme cell lysis method has been developed previously [62, 64].

The second gene encodes peptidase E (PepE) (COG3340), an enzyme involved in the degradation of proteins to free amino acids [65]. In addition to direct nutrient acquisition, peptidases also play a role in pathogenicity. During invasion of host cells and tissues, gut pathogens can secrete peptidases to degrade host proteins and competitively colonize the gut [66]. However, unlike many peptidases with broad substrate specificities, PepE is unique as it has a very specific target (N-terminal aspartic dipeptides). Interestingly, PepE is not as widespread as the broad-specificity peptidases. A survey of microbial genomes for PepE homologues revealed that its distribution is mainly limited to Gammaproteobacteria (Escherichia coli, Haemophilus influenzae, Salmonella enterica sv. Typhimurium, among others) [65]. V. metoecus PepE has the closest homologs to those found in other distant vibrios (Vibrio alginolyticus, Vibrio parahaemolyticus, Vibrio vulnificus, among others). Among the more closely related members of the Cholerae clade, which includes both V. metoecus and V. cholerae [16], PepE is only found in one other species (Vibrio mimicus). The fact that PepE is not so widespread suggests that it could serve a very specialized and possibly advantageous function to organisms that encode it [65]. PepE activity can be assessed using a previously established assay [67] that can be modified specifically for vibrios.

Pathogenic potential of V. cholerae and V. metoecus from Oyster Pond

The two major virulence factors of toxigenic V. cholerae are the cholera toxin (CTX) and toxin-coregulated pilus (TCP), both acquired horizontally by phage infections [68, 69]. The cholera toxin, encoded by the ctxAB genes, is a potent enterotoxin that is responsible for the profuse watery diarrhea in cholera patients [68]. The TCP, encoded by genes of the TCP cluster, is a crucial factor to establish colonization in the host intestine [70] and serves as a receptor for the CTX phage [69]. These virulence factors are not present in any of the Oyster Pond isolates (Additional file 4), suggesting that these environmental isolates cannot produce the cholera toxin and are not capable of acquiring the genomic island that encodes it.

Nonetheless, both V. cholerae and V. metoecus encode several virulence factors such as metalloprotease (hap/vvp), thermolabile hemolysin (tlh), and cytolysin (hlyA), implicated in extra-intestinal infections (Fig. 3). For example, metalloproteases have been demonstrated in Vibrio species to degrade biologically relevant host proteins such as collagen, mucin, and gelatin [71]. Hemolysin and cytolysin are able to permeabilize cell membranes through the formation of pores and induce apoptosis in various animal cell models [72]. Iron bioavailability is strictly regulated within the host as part of an innate immune response during an infection; hemolysins can lyse red blood cells with iron-chelating proteins to supersede the restriction [72, 73]. They also encode the type VI secretion system (T6SS) for the contact-dependent killing of nearby cells (Additional file 4) [74]. The T6SS in V. cholerae and V. metoecus Oyster Pond isolates possesses a diverse and complex arrangement of effector toxins that can be lethal to bacteria and/or eukaryotes [75, 76], although it has been demonstrated previously that V. metoecus is less effective at defending against eukaryotic predation [15]. V. metoecus RC341, an environmental isolate from Chesapeake Bay, Maryland, USA, contains similar virulence factors in its genome [18]. Additionally, both V. metoecus and V. cholerae have evidence of species-specific multidrug efflux pumps (Additional files 2 and 3, respectively), indicating the potential for antibiotic resistance.

Fig. 3
figure 3

Presence/absence map of selected virulence factors in V. cholerae and V. metoecus. The table of virulence factors for the genus Vibrio was obtained and modified from the VFDB [47, 48]. Colored squares represent BSRs of at least 0.3 against reference genes (i.e., homologues of genes are present); white squares represent absence of genes. The genes involved in iron uptake are further categorized into (A) periplasmic binding protein-dependent ABC transport systems, (B) vibriobactin biosynthesis, and (C) vibriobactin utilization. Locus tags of the genes in the V. cholerae N16961 reference genome are indicated after the gene names

Multiple virulence factors are present only in the 17 V. cholerae isolates but not in V. metoecus, including genes involved in iron uptake, colonization, and toxin production (Fig. 3). Iron is an essential nutrient but is a very limited resource in the host, and this has shaped the evolution of survival strategies and bactericidal defense mechanisms of pathogens [77]. The presence of various iron uptake genes in the environmental strains of V. cholerae but not V. metoecus could facilitate survival in the intestine after exposure by ingestion. In addition, 14 V. cholerae isolates have the acfD gene, part of a cluster of four genes (acfABCD) called the accessory colonization factor also involved in the colonization of the intestine but distinct from the TCP cluster [78]. Gene acfD was not found in other non-Oyster Pond V. metoecus strains, including clinical isolates [20]. However, none of the V. cholerae nor V. metoecus Oyster Pond isolates have the acfABC genes (Additional file 4), whereas known clinical and non-Oyster Pond environmental strains of V. cholerae have the complete cluster [20]. Lastly, the RTX gene cluster is only present in V. cholerae and absent in V. metoecus. In this cluster, RtxA causes the rounding and death of eukaryotic cells by catalyzing covalent cross-linking of cellular actin through its actin-crosslinking domain [79]. This gene cluster could provide a more robust anti-eukaryotic response to V. cholerae, which has been shown to survive predation by slime molds better than V. metoecus [15].

On the other hand, neuraminidase (nanH) is present in 15 V. metoecus isolates, but only present in 6 V. cholerae (Fig. 3). In toxin-producing organisms, neuraminidase cleaves sialic acid of gangliosides on host cell surfaces, exposing the inner core sugars that then serves as receptors for the toxin [80]. In vibrios, the neuraminidase cluster is part of the Vibrio pathogenicity island – 2, a major genomic island initially identified only in toxigenic V. cholerae [81]. However, fragments of this island have been detected in environmental V. cholerae, as well as in environmental and clinical V. metoecus [20]. Specifically, an almost complete neuraminidase cluster is present in all but one V. metoecus isolates from this study (Additional file 5). Since these isolates are not toxigenic (i.e., no ctxAB genes), neuraminidase is likely used for nutrient acquisition [82].

HGT inference suggests a directional bias in gene transfer from V. cholerae to V. metoecus

It has previously been suggested that V. cholerae and V. metoecus frequently exchange genes through HGT, although with a limited dataset sourced from various geographical locations and a mixture of clinical and environmental strains [19, 20]. To verify this hypothesis, the core genome of the two species was determined for our genome dataset (Fig. 2). A maximum-likelihood tree was reconstructed for each of the 2675 core gene families. Phylogenetic trees were also reconstructed for the accessory genes. We only examined accessory gene families present in at least 17 genomes (half of the number genomes, 621 gene families in total) (Additional file 6). A gene transfer event and its directionality were inferred in a tree, whether from V. cholerae to V. metoecus or vice versa (see Methods for details on HGT inference). Out of the 2675 core gene family trees, 554 (20.7%) could not be partitioned into perfect clades (i.e., has at least one HGT event) (Fig. 4A and Additional file 7). From the 554 trees exhibiting HGT, 1368 gene transfer events were inferred, where V. metoecus was a recipient of genes from V. cholerae in 1027 (75.1%) of those transfer events, whereas V. cholerae was a recipient in only 341 (24.9%). Additionally, from the 621 trees of accessory gene families, 228 were excluded from our analysis, as they consisted of members from one species only or did not have robust bootstrap support (< 70%) to confidently infer HGT. From the remaining 393 trees, 178 (45.3%) exhibited interspecies HGT, more than twice the proportion (21%) found in the core genome (Fig. 4B and Additional file 8). Among the gene families from the accessory genome that could be analyzed, V. metoecus was a recipient in 429 out of 783 (54.8%) transfer events, while V. cholerae was a recipient in 354 (45.2%). Considering both core and accessory genomes, there is a higher number of gene transfer events from V. cholerae to V. metoecus, indicating a bias in the direction of gene transfer. This is more prominent within the core genome, where V. metoecus was three times more often the recipient of genes from V. cholerae (a statistically significant difference, p < 0.001) (Additional file 7). In contrast, HGT directionality was only 1.2× higher in favour of V. metoecus as the recipient in the accessory genome (not statistically significant, p = 0.114) (Additional file 8). The accessory genes might encode supplementary functions that are not necessarily essential for growth but may offer selective advantages, such as niche adaptation, antibiotic resistance, or host colonization [83]. On the contrary, the core genome, which contains genes essential for growth, is expected to be under strong selective pressure, limiting the extent of sequence changes and preventing gene loss [84], which could explain the lesser frequency of transfers in the core genes than accessory genes. However, the more pronounced HGT directional bias observed for the core genes as opposed to accessory genes could be linked to the fact that the core genes are more abundant since they are present in all members of the population [83].

Fig. 4
figure 4

Inferred HGT events in the core and accessory genomes. The pie charts show the number of (A) core and (B) accessory gene families with and without HGT. The bars show the number of HGT events within the gene families, showing the frequency of V. cholerae or V. metoecus as recipients

A possible explanation for the observed bias in HGT towards V. metoecus is that V. cholerae could be more refractory to gene acquisition, containing more barriers to gene uptake or less efficient DNA uptake systems. Thus, we surveyed our annotated genomes for the presence or absence of genes that are involved in the regulation of competence or provide barriers to DNA uptake. Major regulators and genes of the DNA uptake system [85,86,87] are present in all of our isolates (Additional file 9). On the other hand, some isolates of V. cholerae and V. metoecus were missing some or all genes of the restriction-modification system, an immune system in bacteria that recognize self from non-self (foreign) DNA [88]. Nuclease activity can also inhibit natural transformation [89, 90], and the genes encoding the deoxyribonucleases Dns and Xds are present in all our isolates of both species. There is therefore no clear difference in genotype that could explain the HGT bias observed.

It is hypothesized that one of the reasons for the bias in HGT directionality could be due to a higher abundance of V. cholerae than V. metoecus in the environment [20]. In Oyster Pond, V. cholerae is three times more abundant than V. metoecus on average when both species are present [23]. This higher abundance would lead to having more DNA from V. cholerae readily available for acquisition by V. metoecus, making it a decisive factor in biasing HGT directionality. Additionally, seasonal abundance patterns have been found for V. cholerae, which is more abundant in the warmer months of the year (May to September) and rarely detectable in winter [91]. Thus, it is also hypothesized that seasonality could play a role in this bias. The seasonal abundance of V. metoecus was determined previously and is much more striking than V. cholerae. The former species is essentially limited to the last 2 months of summer in Oyster Pond (August and September) [23]. This bloom is remarkable, as V. metoecus would need to compete with established V. cholerae and several other species in the same environment. It is possible that strong competition from blooming V. metoecus would release V. cholerae genetic material upon lysis, promoting integration of new genetic material in surviving competitors. The more pronounced bottlenecks in the V. metoecus population could also lead to many genes acquired from V. cholerae during the blooms being lost from the population (and therefore resulting in the directional bias observed) [92]. It is unclear if the abundance and seasonality differences observed between V. cholerae and V. metoecus in the Oyster Pond ecosystem is also present in other environments. Intensive molecular and culture-based surveys of coastal and inland water regions of Bangladesh have failed to find any V. metoecus so far despite detecting abundant V. cholerae [23], suggesting that the two organisms have a different geographical distribution and that the latter might be far more ubiquitous.


Comparison of the gene repertoires of a larger number of V. metoecus genomes obtained in this study sheds further insight into the biology of the poorly understood species, a close relative of V. cholerae. Additional genes unique to V. metoecus were identified—for nitrite reduction, rhodanese-related sulfurtransferase, and peptidase E—which could potentially be used in phenotypic tests to differentiate it from V. cholerae, as well as from more distant vibrios. Additionally, since the discovery of V. metoecus [15], two more very close relatives of V. cholerae have been described (Vibrio paracholerae [93] and Vibrio tarriae [16]), both of which have also been associated with human infections. As more strains are being identified as potentially novel sister species to V. cholerae [16, 93,94,95,96], our understanding of these species is also evolving. Therefore, an update on the diagnostic tests for V. cholerae (and close relatives) is needed to incorporate new knowledge from recently discovered sister species, which could easily be misdiagnosed as V. cholerae.

Furthermore, the co-isolation of V. metoecus and V. cholerae has provided the opportunity to study the extent of HGT between closely related species in a natural environment. Here, the correlation between the rate of HGT and the abundance of donor and recipient species is suggested. Although the direct link between abundance and rate of HGT has not been experimentally demonstrated, there is an exact correlation between these two factors in a natural environment [23], therefore warranting further investigation. It should be noted that the majority of bacteria in nature (up to 80%) are estimated to exist in biofilms [97], which form multispecies communities that facilitate resource and gene exchange, competition, cell signaling, and resistance to environmental stressors such as desiccation [98,99,100,101]. Within these complex assemblies, and in environmental reservoirs in general, HGT is expected to occur outside of the instances we have suggested between V. cholerae and V. metoecus, which may have contributed to the gene acquisitions we have observed. Interestingly, we find potential instances of interspecific HGT in examples such as V. metoecus possessing nitrite reductase genes. As denitrification occurs in many aquatic environmental reservoirs, it can be posited that the acquisition of these genes may have occurred with a donor outside of V. cholerae or V. metoecus. However, this would need to be assessed with metagenomics analyses, which is outside of the scope of this work. Nonetheless, the majority of HGT has been demonstrated to happen intraspecifically or between closely related species [102,103,104]. As both Vibrio species are closely related, and a careful genetic screen also shows that there is no reason to suspect a difference in ability between the two species to uptake DNA, we hypothesize that differential abundance is the most likely explanation for the HGT bias reported in this study. Observation of this correlation in various geographic locations would strengthen this hypothesis. If a direct relationship between relative abundance of a species in a community and its intake of foreign DNA is confirmed for these vibrios and other bacterial species, it would have major implications for the dynamics of speciation and the spread of various fitness characteristics between species.

Availability of data and materials

The genome datasets supporting the conclusions of this article are available in the GenBank repository under BioProject accession number PRJNA281423. The individual GenBank accession numbers are listed in Table 1.



Average nucleotide identity


BLAST score ratio


Clusters of Orthologous Groups


Cholera toxin


Horizontal gene transfer


Multilocus sequence analysis


Sequence type


Thiosulfate citrate bile sucrose


Toxin-coregulated pilus


Tryptic soy agar


Tryptic soy broth


Type VI secretion system


Virulence Factor Database


  1. Ali M, Lopez AL, You YA, Kim YE, Sah B, Maskery B, Clemens J. The global burden of cholera. Bull World Health Organ. 2012;90:209-218A.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Boucher Y, Orata FD, Alam M. The out-of-the-delta hypothesis: dense human populations in low-lying river deltas served as agents for the evolution of a deadly pathogen. Front Microbiol. 2015;6:1120.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Boucher Y. Sustained local diversity of Vibrio cholerae O1 biotypes in a previously cholera-free country. mBio. 2016;7:e00570-16.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Islam MT, Alam M, Boucher Y. Emergence, ecology and dispersal of the pandemic generating Vibrio cholerae lineage. Int Microbiol. 2017;20:106–15.

    CAS  PubMed  Google Scholar 

  5. Faruque SM, Albert MJ, Mekalanos JJ. Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae. Microbiol Mol Biol Rev. 1998;62:1301–14.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Microbial Genomes. National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USA. Accessed 18 Oct 2022.

  7. Katz LS, Petkau A, Beaulaurier J, Tyler S, Antonova ES, Turnsek MA, Guo Y, Wang S, Paxinos EE, Orata F, et al. Evolutionary dynamics of Vibrio cholerae O1 following a single-source introduction to Haiti. mBio. 2013;4:e00398-13.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Hendriksen RS, Price LB, Schupp JM, Gillece JD, Kaas RS, Engelthaler DM, Bortolaia V, Pearson T, Waters AE, Upadhyay BP, et al. Population genetics of Vibrio cholerae from Nepal in 2010: evidence on the origin of the Haitian outbreak. mBio. 2011;2:e00157-11.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Orata FD, Keim PS, Boucher Y. The 2010 cholera outbreak in Haiti: how science solved a controversy. PLoS Pathog. 2014;10:e1003967.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Eppinger M, Pearson T, Koenig SS, Pearson O, Hicks N, Agrawal S, Sanjar F, Galens K, Daugherty S, Crabtree J, et al. Genomic epidemiology of the Haitian cholera outbreak: a single introduction followed by rapid, extensive, and continued spread characterized the onset of the epidemic. mBio. 2014;5:e01721-14.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Weill FX, Domman D, Njamkepo E, Almesbahi AA, Naji M, Nasher SS, Rakesh A, Assiri AM, Sharma NC, Kariuki S, et al. Genomic insights into the 2016–2017 cholera epidemic in Yemen. Nature. 2019;565:230–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Kirchberger PC, Orata FD, Nasreen T, Kauffman KM, Tarr CL, Case RJ, Polz MF, Boucher YF. Culture-independent tracking of Vibrio cholerae lineages reveals complex spatiotemporal dynamics in a natural population. Environ Microbiol. 2020;22:4244–56.

    Article  CAS  PubMed  Google Scholar 

  13. Kirchberger PC, Orata FD, Barlow EJ, Kauffman KM, Case RJ, Polz MF, Boucher Y. A small number of phylogenetically distinct clonal complexes dominate a coastal Vibrio cholerae population. Appl Environ Microbiol. 2016;82:5576–86.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Liang KYH, Orata FD, Islam MT, Nasreen T, Alam M, Tarr CL, Boucher YF. A Vibrio cholerae core genome multilocus sequence typing scheme to facilitate the epidemiological study of cholera. J Bacteriol. 2020;202:e00086-20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Kirchberger PC, Turnsek M, Hunt DE, Haley BJ, Colwell RR, Polz MF, Tarr CL, Boucher Y. Vibrio metoecus sp. nov., a close relative of Vibrio cholerae isolated from coastal brackish ponds and clinical specimens. Int J Syst Evol Microbiol. 2014;64:3208–14.

    Article  PubMed  Google Scholar 

  16. Islam MT, Liang K, Orata FD, Im MS, Alam M, Lee CC, Boucher YF. Vibrio tarriae sp. nov., a novel member of the Cholerae clade. Int J Syst Evol Microbiol. 2022;72:005571.

    Article  CAS  Google Scholar 

  17. Choopun N. The population structure of Vibrio cholerae in Chesapeake Bay. PhD thesis. College Park, Maryland, USA: University of Maryland; 2004.

  18. Haley BJ, Grim CJ, Hasan NA, Choi SY, Chun J, Brettin TS, Bruce DC, Challacombe JF, Detter JC, Han CS, et al. Comparative genomic analysis reveals evidence of two novel Vibrio species closely related to V. cholerae. BMC Microbiol. 2010;10:154.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Boucher Y, Cordero OX, Takemura A, Hunt DE, Schliep K, Bapteste E, Lopez P, Tarr CL, Polz MF. Local mobile gene pools rapidly cross species boundaries to create endemicity within global Vibrio cholerae populations. mBio. 2011;2:e00335-10.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Orata FD, Kirchberger PC, Méheust R, Barlow EJ, Tarr CL, Boucher Y. The dynamics of genetic interactions between Vibrio metoecus and Vibrio cholerae, two close relatives co-occurring in the environment. Genome Biol Evol. 2015;7:2941–54.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Vezzulli L, Stauder M, Grande C, Pezzati E, Verheye HM, Owens NJ, Pruzzo C. gbpA as a novel qPCR target for the species-specific detection of Vibrio cholerae O1, O139, non-O1/non-O139 in environmental, stool, and historical continuous plankton recorder samples. PLoS ONE. 2015;10:e0123983.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Carda-Diéguez M, Ghai R, Rodriguez-Valera F, Amaro C. Wild eel microbiome reveals that skin mucus of fish could be a natural niche for aquatic mucosal pathogen evolution. Microbiome. 2017;5:162.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Nasreen T, Hussain NAS, Islam MT, Orata FD, Kirchberger PC, Case RJ, Alam M, Yanow SK, Boucher YF. Simultaneous quantification of Vibrio metoecus and Vibrio cholerae with its O1 serogroup and toxigenic subpopulations in environmental reservoirs. Pathogens. 2020;9:1053.

    Article  CAS  PubMed Central  Google Scholar 

  24. QIAGEN CLC Genomics Workbench: NGS data analysis for any species, any platform, any workflow. QIAGEN, Aarhus, Denmark. Accessed 28 Sept 2022.

  25. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.

    Article  CAS  PubMed  Google Scholar 

  27. Luo H, Moran MA. Evolutionary ecology of the marine Roseobacter clade. Microbiol Mol Biol Rev. 2014;78:573–87.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Thompson FL, Gevers D, Thompson CC, Dawyndt P, Naser S, Hoste B, Munn CB, Swings J. Phylogeny and molecular identification of vibrios on the basis of multilocus sequence analysis. Appl Environ Microbiol. 2005;71:5107–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Tarr CL, Patel JS, Puhr ND, Sowers EG, Bopp CA, Strockbine NA. Identification of Vibrio isolates by a multiplex PCR assay and rpoB sequence determination. J Clin Microbiol. 2007;45:134–40.

    Article  CAS  PubMed  Google Scholar 

  30. Orata FD, Xu Y, Gladney LM, Rishishwar L, Case RJ, Boucher Y, Jordan IK, Tarr CL. Characterization of clinical and environmental isolates of Vibrio cidicii sp. nov., a close relative of Vibrio navarrensis. Int J Syst Evol Microbiol. 2016;66:4148–55.

    Article  CAS  PubMed  Google Scholar 

  31. Gladney LM, Tarr CL. Molecular and phenotypic characterization of Vibrio navarrensis isolates associated with human illness. J Clin Microbiol. 2014;52:4070–4.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, Dodson RJ, Haft DH, Hickey EK, Peterson JD, Umayam L, et al. DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature. 2000;406:477–83.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Rasko DA, Myers GS, Ravel J. Visualization of comparative genomic analyses by BLAST score ratio. BMC Bioinformatics. 2005;6:2.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Rost B. Twilight zone of protein sequence alignments. Protein Eng. 1999;12:85–94.

    Article  CAS  PubMed  Google Scholar 

  35. Angiuoli SV, Salzberg SL. Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics. 2011;27:334–42.

    Article  CAS  PubMed  Google Scholar 

  36. Goecks J, Nekrutenko A, Taylor J, Galaxy Team. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010;11:R86

    Article  PubMed  PubMed Central  Google Scholar 

  37. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–9.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Chaudhari NM, Gupta VK, Dutta C. BPGA- an ultra-fast pan-genome analysis pipeline. Sci Rep. 2016;6:24373.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Intella. Vound LLC, Scottsdale, Arizona, USA. Accessed 28 Sept 2022.

  41. Wu S, Zhu Z, Fu L, Niu B, Li W. WebMGA: a customizable web server for fast metagenomic sequence analysis. BMC Genomics. 2011;12:444.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Cantalapiedra CP, Hernandez-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol. 2021;38:5825–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Galperin MY, Wolf YI, Makarova KS, Vera Alvarez R, Landsman D, Koonin EV. COG database update: focus on microbial diversity, model organisms, and widespread pathogens. Nucleic Acids Res. 2021;49:D274–81.

    Article  CAS  PubMed  Google Scholar 

  44. Szklarczyk D, Gable AL, Nastou KC, Lyon D, Kirsch R, Pyysalo S, Doncheva NT, Legeay M, Fang T, Bork P, et al. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021;49:D605–12.

    Article  CAS  PubMed  Google Scholar 

  45. Petkau A, Stuart-Edwards M, Stothard P, Van Domselaar G. Interactive microbial genome visualization with GView. Bioinformatics. 2010;26:3125–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Chun J, Grim CJ, Hasan NA, Lee JH, Choi SY, Haley BJ, Taviani E, Jeon YS, Kim DW, Lee JH, et al. Comparative genomics reveals mechanism for short-term and long-term clonal transitions in pandemic Vibrio cholerae. Proc Natl Acad Sci USA. 2009;106:15442–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Liu B, Zheng D, Zhou S, Chen L, Yang J. VFDB 2022: a general classification scheme for bacterial virulence factors. Nucleic Acids Res. 2022;50:D912–7.

    Article  CAS  PubMed  Google Scholar 

  48. Liu B, Zheng D, Zhou S, Chen L, Yang J. VFDB – pathogenesis of Vibrio. Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China. Accessed 28 Sept 2022.

  49. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–8.

    Article  CAS  PubMed  Google Scholar 

  50. Schliep K, Lopez P, Lapointe FJ, Bapteste É. Harvesting evolutionary signals in a forest of prokaryotic gene trees. Mol Biol Evol. 2011;28:1393–405.

    Article  CAS  PubMed  Google Scholar 

  51. Hillis DM, Bull JJ. An empirical test of bootstrapping as a method for assessing confidence in phylogenetic analysis. Syst Biol. 1993;42:182–92.

    Article  Google Scholar 

  52. QI Macros. KnowWare International Inc., Denver, Colorado, USA. Accessed 28 Sept 2022.

  53. Chun J, Oren A, Ventosa A, Christensen H, Arahal DR, da Costa MS, Rooney AP, Yi H, Xu XW, De Meyer S, et al. Proposed minimal standards for the use of genome data for the taxonomy of prokaryotes. Int J Syst Evol Microbiol. 2018;68:461–6.

    Article  CAS  PubMed  Google Scholar 

  54. Menozzi FD, Pethe K, Bifani P, Soncin F, Brennan MJ, Locht C. Enhanced bacterial virulence through exploitation of host glycosaminoglycans. Mol Microbiol. 2002;43:1379–86.

    Article  CAS  PubMed  Google Scholar 

  55. Castiglione N, Rinaldo S, Giardina G, Stelitano V, Cutruzzolà F. Nitrite and nitrite reductases: from molecular mechanisms to significance in human health and disease. Antioxid Redox Signal. 2012;17:684–716.

    Article  CAS  PubMed  Google Scholar 

  56. Jourd’heuil D, Grisham MB, Granger DN. Nitric oxide and the gut. Curr Gastroenterol Rep. 1999;1:384–8.

    Article  CAS  PubMed  Google Scholar 

  57. Bueno E, Sit B, Waldor MK, Cava F. Genetic dissection of the fermentative and respiratory contributions supporting Vibrio cholerae hypoxic growth. J Bacteriol. 2020;202:e00243-20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Bueno E, Sit B, Waldor MK, Cava F. Anaerobic nitrate reduction divergently governs population expansion of the enteropathogen Vibrio cholerae. Nat Microbiol. 2018;3:1346–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Gabbianelli R, Signoretti C, Marta I, Battistoni A, Nicolini L. Vibrio cholerae periplasmic superoxide dismutase: isolation of the gene and overexpression of the protein. J Biotechnol. 2004;109:123–30.

    Article  CAS  PubMed  Google Scholar 

  60. Cipollone R, Ascenzi P, Visca P. Common themes and variations in the rhodanese superfamily. IUBMB Life. 2007;59:51–9.

    Article  CAS  PubMed  Google Scholar 

  61. Motl N, Skiba MA, Kabil O, Smith JL, Banerjee R. Structural and biochemical analyses indicate that a bacterial persulfide dioxygenase-rhodanese fusion protein functions in sulfur assimilation. J Biol Chem. 2017;292:14026–38.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Vandenbergh PA, Bawdon RE, Berk RS. Rapid test for determining the intracellullar rhodanese activity of various bacteria. Int J Syst Bacteriol. 1979;29:339–44.

    Article  CAS  Google Scholar 

  63. Wang Y, Ehsan M, Huang J, Aimulajiang K, Yan R, Song X, Xu L, Li X. Characterization of a rhodanese homologue from Haemonchus contortus and its immune-modulatory effects on goat immune cells in vitro. Parasit Vectors. 2020;13:454.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Lányi B. Rhodanese activity: a simple and reliable taxonomic tool for gram-negative bacteria. J Med Microbiol. 1982;15:263–6.

    Article  PubMed  Google Scholar 

  65. Lassy RA, Miller CG. Peptidase E, a peptidase specific for N-terminal aspartic dipeptides, is a serine hydrolase. J Bacteriol. 2000;182:2536–43.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Nguyen TTH, Myrold DD, Mueller RS. Distributions of extracellular peptidases across prokaryotic genomes reflect phylogeny and habitat. Front Microbiol. 2019;10:413.

    Article  PubMed  PubMed Central  Google Scholar 

  67. Carter TH, Miller CG. Aspartate-specific peptidases in Salmonella typhimurium: mutants deficient in peptidase E. J Bacteriol. 1984;159:453–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Waldor MK, Mekalanos JJ. Lysogenic conversion by a filamentous phage encoding cholera toxin. Science. 1996;272:1910–4.

    Article  CAS  PubMed  Google Scholar 

  69. Karaolis DK, Somara S, Maneval DR Jr, Johnson JA, Kaper JB. A bacteriophage encoding a pathogenicity island, a type-IV pilus and a phage receptor in cholera bacteria. Nature. 1999;399:375–9.

    Article  CAS  PubMed  Google Scholar 

  70. Taylor RK, Miller VL, Furlong DB, Mekalanos JJ. Use of phoA gene fusions to identify a pilus colonization factor coordinately regulated with cholera toxin. Proc Natl Acad Sci USA. 1987;84:2833–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Lee JH, Ahn SH, Lee EM, Kim YO, Lee SJ, Kong IS. Characterization of the enzyme activity of an extracellular metalloprotease (VMC) from Vibrio mimicus and its C-terminal deletions. FEMS Microbiol Lett. 2003;223:293–300.

    Article  CAS  PubMed  Google Scholar 

  72. Wong SK, Zhang X-H, Woo NY. Vibrio alginolyticus thermolabile hemolysin (TLH) induces apoptosis, membrane vesiculation and necrosis in sea bream erythrocytes. Aquaculture. 2012;330:29–36.

    Article  Google Scholar 

  73. Doherty CP. Host-pathogen interactions: the role of iron. J Nutr. 2007;137:1341–4.

    Article  CAS  PubMed  Google Scholar 

  74. Joshi A, Kostiuk B, Rogers A, Teschler J, Pukatzki S, Yildiz FH. Rules of engagement: the type VI secretion system in Vibrio cholerae. Trends Microbiol. 2017;25:267–79.

    Article  CAS  PubMed  Google Scholar 

  75. Kirchberger PC, Unterweger D, Provenzano D, Pukatzki S, Boucher Y. Sequential displacement of type VI secretion system effector genes leads to evolution of diverse immunity gene arrays in Vibrio cholerae. Sci Rep. 2017;7:45133.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  76. Hussain NAS, Kirchberger PC, Case RJ, Boucher YF. Modular molecular weaponry plays a key role in competition within an environmental Vibrio cholerae population. Front Microbiol. 2021;12:671092.

    Article  PubMed  PubMed Central  Google Scholar 

  77. Schaible UE, Kaufmann SH. Iron and microbial infection. Nat Rev Microbiol. 2004;2:946–53.

    Article  CAS  PubMed  Google Scholar 

  78. Peterson KM, Mekalanos JJ. Characterization of the Vibrio cholerae ToxR regulon: identification of novel genes involved in intestinal colonization. Infect Immun. 1988;56:2822–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  79. Fullner KJ, Mekalanos JJ. In vivo covalent cross-linking of cellular actin by the Vibrio cholerae RTX toxin. EMBO J. 2000;19:5315–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Zuverink M, Barbieri JT. Protein toxins that utilize gangliosides as host receptors. Prog Mol Biol Transl Sci. 2018;156:325–54.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  81. Jermyn WS, Boyd EF. Characterization of a novel Vibrio pathogenicity island (VPI-2) encoding neuraminidase (nanH) among toxigenic Vibrio cholerae isolates. Microbiology. 2002;148:3681–93.

    Article  CAS  PubMed  Google Scholar 

  82. Coker JK, Moyne O, Rodionov DA, Zengler K. Carbohydrates great and small, from dietary fiber to sialic acids: how glycans influence the gut microbiome and affect human health. Gut Microbes. 2021;13:1869502.

    Article  PubMed Central  Google Scholar 

  83. Vernikos G, Medini D, Riley DR, Tettelin H. Ten years of pan-genome analyses. Curr Opin Microbiol. 2015;23:148–54.

    Article  CAS  PubMed  Google Scholar 

  84. Lapierre P, Gogarten JP. Estimating the size of the bacterial pan-genome. Trends Genet. 2009;25:107–10.

    Article  CAS  PubMed  Google Scholar 

  85. Watve SS, Thomas J, Hammer BK. CytR is a global positive regulator of competence, type VI secretion, and chitinases in Vibrio cholerae. PLoS ONE. 2015;10:e0138834.

    Article  PubMed  PubMed Central  Google Scholar 

  86. Borgeaud S, Metzger LC, Scrignari T, Blokesch M. The type VI secretion system of Vibrio cholerae fosters horizontal gene transfer. Science. 2015;347:63–7.

    Article  CAS  PubMed  Google Scholar 

  87. Seitz P, Blokesch M. DNA-uptake machinery of naturally competent Vibrio cholerae. Proc Natl Acad Sci USA. 2013;110:17987–92.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  88. Vasu K, Nagaraja V. Diverse functions of restriction-modification systems in addition to cellular defense. Microbiol Mol Biol Rev. 2013;77:53–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  89. Blokesch M, Schoolnik GK. The extracellular nuclease Dns and its role in natural transformation of Vibrio cholerae. J Bacteriol. 2008;190:7232–40.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  90. Dalia AB, Seed KD, Calderwood SB, Camilli A. A globally distributed mobile genetic element inhibits natural transformation of Vibrio cholerae. Proc Natl Acad Sci USA. 2015;112:10485–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  91. Jiang SC, Fu W. Seasonal abundance and distribution of Vibrio cholerae in coastal waters quantified by a 16S–23S intergenic spacer probe. Microb Ecol. 2001;42:540–8.

    Article  CAS  PubMed  Google Scholar 

  92. Wein T, Dagan T. The effect of population bottleneck size and selective regime on genetic diversity and evolvability in bacteria. Genome Biol Evol. 2019;11:3283–90.

    PubMed  PubMed Central  Google Scholar 

  93. Islam MT, Nasreen T, Kirchberger PC, Liang KYH, Orata FD, Johura FT, Hussain NAS, Im MS, Tarr CL, Alam M, et al. Population analysis of Vibrio cholerae in aquatic reservoirs reveals a novel sister species (Vibrio paracholerae sp. nov.) with a history of association with humans. Appl Environ Microbiol. 2021;87:e00422-21.

    Article  CAS  PubMed Central  Google Scholar 

  94. Islam MT, Liang K, Im MS, Winkjer J, Busby S, Tarr CL, Boucher Y. Draft genome sequences of nine Vibrio sp. isolates from across the United States closely related to Vibrio cholerae. Microbiol Resour Announc. 2018;7:e00965-18.

    Article  PubMed  PubMed Central  Google Scholar 

  95. Liang K, Islam MT, Hussain N, Winkjer NS, Im MS, Rowe LA, Tarr CL, Boucher Y. Draft genome sequences of eight Vibrio sp. clinical isolates from across the United States that form a basal sister clade to Vibrio cholerae. Microbiol Resour Announc. 2019;8:e01473-18.

    Article  PubMed  PubMed Central  Google Scholar 

  96. Liang K, Orata FD, Winkjer NS, Rowe LA, Tarr CL, Boucher Y. Complete genome sequence of Vibrio sp. strain 2521-89, a close relative of Vibrio cholerae isolated from lake water in New Mexico, USA. Genome Announc. 2017;5:e00905-17.

    Article  PubMed  PubMed Central  Google Scholar 

  97. Flemming HC, Wuertz S. Bacteria and archaea on Earth and their abundance in biofilms. Nat Rev Microbiol. 2019;17:247–60.

    Article  CAS  PubMed  Google Scholar 

  98. Abe K, Nomura N, Suzuki S. Biofilms: hot spots of horizontal gene transfer (HGT) in aquatic environments, with a focus on a new HGT mechanism. FEMS Microbiol Ecol. 2020;96:fiaa031.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  99. Aminov RI. Horizontal gene exchange in environmental microbiota. Front Microbiol. 2011;2:158.

    Article  PubMed  PubMed Central  Google Scholar 

  100. Madsen JS, Roder HL, Russel J, Sorensen H, Burmolle M, Sorensen SJ. Coexistence facilitates interspecific biofilm formation in complex microbial communities. Environ Microbiol. 2016;18:2565–74.

    Article  CAS  PubMed  Google Scholar 

  101. Flemming HC. The perfect slime. Colloids Surf B Biointerfaces. 2011;86:251–9.

    Article  CAS  PubMed  Google Scholar 

  102. Keymer DP, Boehm AB. Recombination shapes the structure of an environmental Vibrio cholerae population. Appl Environ Microbiol. 2011;77:537–44.

    Article  CAS  PubMed  Google Scholar 

  103. Pretzer C, Druzhinina IS, Amaro C, Benediktsdóttir E, Hedenstrom I, Hervio-Heath D, Huhulescu S, Schets FM, Farnleitner AH, Kirschner AK. High genetic diversity of Vibrio cholerae in the European lake Neusiedler See is associated with intensive recombination in the reed habitat and the long-distance transfer of strains. Environ Microbiol. 2017;19:328–44.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  104. Esteves K, Mosser T, Aujoulat F, Hervio-Heath D, Monfort P, Jumas-Bilak E. Highly diverse recombining populations of Vibrio cholerae and Vibrio parahaemolyticus in French Mediterranean coastal lagoons. Front Microbiol. 2015;6:708.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We thank Sophie Dang (Molecular Biology Service Unit, University of Alberta) for her assistance with genome library preparation and sequencing.


This work was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Integrated Microbial Biodiversity program of the Canadian Institute for Advanced Research (to YFB). We also acknowledge additional support through graduate student scholarships from Alberta Innovates—Technology Futures (to FDO and KYHL), the Bank of Montréal Financial Group (to FDO), NSERC (to KYHL), the Department of Biological Sciences of the University of Alberta (to NASH), and Alberta Student Aid (to NASH).

Author information

Authors and Affiliations



FDO and YFB conceptualized and designed the study. FDO performed genomic DNA extraction, sequencing, and analysis. FDO, NASH, and KYHL performed bioinformatic analysis. NASH performed statistical analysis. All authors contributed to the interpretation of data. FDO, NASH, and YFB drafted the manuscript. FDO and NASH prepared the figures and tables. KYHL and DH provided feedback and substantial revisions to the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yann F. Boucher.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that this study received funding from the Bank of Montréal Financial Group. The funder was not involved in the study design; collection, analysis, and interpretation of data; the writing of this article; and the decision to submit it for publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

: Core housekeeping genes used to determine genome completeness.

Additional file 2

: Predicted functions of genes found in V. metoecus but not in V. cholerae.

Additional file 3

: Predicted functions of genes found in V. cholerae but not in V. metoecus.

Additional file 4

: Presence/absence map of virulence factors in V. cholerae and V. metoecus.

Additional file 5

: Presence/absence map of genes of the Vibrio pathogenicity island – 2 in V. cholerae and V. metoecus.

Additional file 6

: The pan-genome of V. cholerae and V. metoecus.

Additional file 7

: Count of horizontal gene transfer events within the core genes of V. cholerae and V. metoecus.

Additional file 8

: Count of horizontal gene transfer events within the accessory genes of V. cholerae and V. metoecus.

Additional file 9

: Presence/absence map of genes involved in DNA uptake and restriction-modification, as well as endo- and exonucleases in V. cholerae and V. metoecus.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Orata, F.D., Hussain, N.A.S., Liang, K.Y.H. et al. Genomes of Vibrio metoecus co-isolated with Vibrio cholerae extend our understanding of differences between these closely related species. Gut Pathog 14, 42 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Vibrio cholerae
  • Vibrio metoecus
  • Comparative genomics
  • Horizontal gene transfer
  • Evolution