- Genome Report
- Open Access
Genomic characterization of Escherichia coli LCT-EC001, an extremely multidrug-resistant strain with an amazing number of resistance genes
Gut Pathogens volume 11, Article number: 25 (2019)
Multidrug resistance is a growing global public health threat with far more serious consequences than generally anticipated. In this study, we investigated the antibiotic resistance and genomic traits of a clinical strain of Escherichia coli LCT-EC001.
LCT-EC001 was resistant to 16 kinds of widely used antibiotics, including fourth-generation cephalosporins and carbapenems. In total, up to 68 determinants associated with antibiotic resistance were identified, including 8 beta-lactamase genes (notably producing ESBLs and KPCs), 31 multidrug efflux system genes, 6 outer membrane transport system genes, 4 aminoglycoside-modifying enzyme genes, 10 two-component regulatory system genes, and 9 other enzyme or transcriptional regulator genes, covering nearly all known drug-resistance mechanisms in E. coli. More than half of the resistance genes were located close to mobile genetic elements, such as plasmids, transposons, genomics islands, and insertion sequences. Phylogenetic analysis revealed that this strain may have evolved from E. coli K-12 but is a completely new MLST type.
Antibiotic resistance was extremely severe in E. coli LCT-EC001, mainly due to mobile genetic elements that allowed the gain of a large quantity of resistance genes. The antibiotic resistance genes of E. coli LCT-EC001 can probably be transferred to other bacteria. To the best of our knowledge, this is the first report of a strain of E. coli which has such a large amount of antibiotic resistance genes. Apart from providing an E. coli reference genome with an extremely high multidrug-resistant background for future analyses, this work also offers a strategy for investigating the complement and characteristics of genes contributing to drug resistance at the whole-genome level.
According to the World Health Organization (WHO) report ‘Antimicrobial resistance: global report on surveillance 2014’, multidrug resistance is a growing global public health threat with far more serious consequences than generally anticipated. Out of the WHO member states, 50% reported that E. coli isolated from within these states was resistant to third-generation cephalosporins and fluoroquinolones—the best antibiotics available for treating multidrug-resistant bacteria. In February 2017, the WHO published its first ever list of antibiotic-resistant “priority pathogens”—a catalogue of 12 families of bacteria that pose the greatest threat to human health. E. coli was defined as one of the most critical multidrug-resistant bacteria, which were considered to have built-in abilities to find new ways to resist treatment and pass along genetic material that allows other bacteria to become drug-resistant as well. It is widely accepted that infections caused by antibiotic-resistant bacteria burden healthcare resources and increase the risk of poor clinical outcomes for patients. Global estimates suggest that more than 700,000 people per year die from drug-resistant infections . It is predicted that antibiotic-resistant infections will kill ~ 10 million people per year by 2050, costing the global economy ~ $100 trillion . The seriousness of this situation was surmised in the WHO report: ‘A post antibiotic era, in which common infections and minor injuries can kill, is instead a very real possibility for the 21st century’.
Revealing the mechanisms underlying drug resistance in bacterial pathogens is crucial in infection disease control and management. With significant progress in high-throughput sequencing and bioinformatics analysis of pathogens, whole-genome sequencing has become more accessible for the identification and tracking of multidrug-resistance (MDR) microorganisms in hospitals and communities . In this study, we isolated E. coli strain LCT-EC001 from a 78-year-old male patient with several health issues, including diabetes, hypertension and chronic obstructive pulmonary disease, who had received long-term therapy with multiple drugs. The drug resistance of E. coli strain LCT-EC001 was tested, and whole-genome sequencing was conducted to understand the genetic elements contributing to antibiotic resistance. This work contributes a clinically isolated drug-resistant E. coli strain as a valuable reference for future studies and presents a strategy for the comprehensive analysis of drug resistance at the whole-genome level.
Bacterial isolation and culture conditions
An E. coli isolate (designated LCT-EC001) was obtained from the sputum of a 78-year-old male patient who had several health issues (diabetes, hypertension and chronic obstructive pulmonary disease) and had received multidrug therapy over a long time period. The bacterium was inoculated in Brain Heart Infusion (Oxoid, UK) medium at 37 °C.
Antibiotic susceptibility test
The antibiotic susceptibility profile was tested using a VITEK 2 Compact System (bioMerieux Inc., USA) according to the manufacturer’s instructions as previously reported . 17 kinds of antibiotics tested are as follows: ampicillin, cefazolin, ampicillin/sulbactam, cefotetan, ceftriaxone, cefepime, ceftazidime, aztreonam, ertapenem, imipenem, amikacin, gentamicin, tobramycin, levofloxacin, ciprofloxacin, trimethoprim/sulfa, and nitrofurantoin.
High-throughput sequencing and assembly
Isolation of genomic DNA was carried out using the cetyltrimethylammonium bromide (CTAB) method. Total DNA obtained was subjected to quality control by agarose gel electrophoresis and quantified by Qubit . The genome of E. coli strain LCT-EC001 was sequenced with MPS (massively parallel sequencing) Illumina technology. Two DNA libraries were constructed: a paired-end library with an insert size of 500 bp and a paired-end library with an insert size of 5 kb. The 500 bp library and the 5 kb library were sequenced using an Illumina HiSeq 2000 platform (Illumina, USA). Quality control of the two paired-end library reads was performed using readfq (version 10) program  with the following steps: (1) Eliminate reads once its low quality nucleotide bases (Q-value ≤ 38) exceeding the threshold (40 bp by default), (2) Eliminate the reads containing Ns in the reads greater than the threshold (10 bases by default), (3) Eliminate reads whose overlap with the adapter exceeding the threshold (15 bp by default), and (4) Filter duplicates to keep only one copy of the totally same reads. For a library of 500 bp, 6.19% of reads were filtered, while 8.48% of reads were filtered for a library of 5 kb. The filtered reads were assembled by SOAPdenovo  to generate scaffolds. The parameters used for assembly were as follows: SOAPdenovo all -F -K 107 -k 107. All reads were used for further gap closure by using GapCloser (version 1.12)  with default parameters.
Gene prediction, annotation and protein classification
Gene prediction was performed on the LCT-EC001 genome assembly by GeneMarkS  with an integrated model that combined the GeneMarkS generated (native) and heuristic model parameters. Gene annotation was performed with a BLASTp  search (E-value less than 1·e−5, minimal alignment length percentage larger than 40%) against 4 databases in a standalone environment. The databases are KEGG (Kyoto Encyclopedia of Genes and Genomes, v2016.4) , COG (Clusters of Orthologous Groups, v2015.12) , GO (Gene Ontology, v2014.10) , and ncRNA (noncoding RNA database, tRNA: v1.3.1, rRNA: v1.2, and sRNA: v2013.8) [14,15,16]. A genome overview was created with Circos  to show annotation information. In addition, genomic islands (GIs), prophages, repeat regions, transfer elements, plasmids, and insertion sequences elements (IS elements) in LCT-EC001 were analyzed. Repetitive sequences were predicted using RepeatMasker . Tandem repeats were analyzed using Tandem Repeat Finder (TRF) . PHAST  was used for prophage prediction. IslandPath-DIOMB  was used to predict genomic islands and horizontal gene transfer by examining features such as dinucleotide sequence composition bias and the presence of mobility genes.
Phylogenetic analysis and multilocus sequence typing (MLST)
The genome datasets of the other 62 E. coli strains were compared with the genome of LCT-EC001 for SNP detection by using MUMmer with default settings (version 3.22). Then, the repeat regions of LCT-EC001 were detected by self-blast (choosing BLASTn parameter with blastall, using BLAST v2.2.23), TRF and RepeatMasker. After that, SNPs located in the repeat region were filtered. Based on the location array of SNPs, a phylogenetic tree was generated using the neighbor-joining method with 1000 bootstraps via MEGA6. MLST was performed with the web tool at http://cge.cbs.dtu.dk/services/MLST/, using the assembled genome. By comparing the sequences of seven housekeeping genes (ADK,FUMC,GYRB,ICD,MDH,PURA,RECA) in LCT-EC001 with that in the database, the MLST type was analyzed.
Analysis of antibiotic resistance genes
A BLASTp  search (E-value less than 1·e−5, minimal alignment length percentage larger than 40%) was performed against 3 databases for drug resistance analysis. The databases are ARDB (Antibiotic Resistance Genes Database), CARD and ARG-ANNOT (Antibiotic Resistance Gene-ANNOTation). Then, the identified sequences were all BLAST searched online (https://blast.ncbi.nlm.nih.gov/Blast.cgi) to match genes in NCBI. The identified resistance genes were further verified by PCR and Sanger sequencing. Location relationships between these identified genes and genomic islands, prophages, repeat regions, transfer elements, plasmids, and IS elements were analyzed.
Results and discussion
Strain LCT-EC001 is resistant to most clinical antibiotics
We tested the susceptibility of E. coli strain LCT-EC001 to 17 kinds of widely used antibiotics with the VITEK 2 Compact System in triplicate. Our findings showed that E. coli strain LCT-EC001 was resistant to 16 kinds of antibiotics, including fourth-generation cephalosporins (cefepime) and carbapenems (ertapenem and imipenem), and was only sensitive to amikacin, indicating that it is a severely multidrug-resistant bacterium. However, extended spectrum β-lactamases (ESBL) were negatively detected. The results are shown in Table 1.
Normally, E. coli colonizes the intestines of humans and other animals . However, it is a frequent cause of community and hospital-acquired infections, such as those of the urinary tract, bloodstream, abdomen, skin and soft tissues under certain circumstances . This bacterium also causes pneumonia, neonatal meningitis and food-borne infections on a global scale . It is well accepted that antimicrobial resistance is related to widespread antibiotic use, especially their inappropriate use in humans and other animals, as well as in the food industry . With the increasing incidence of multidrug-resistant organisms, antibiotic resistance has now become a serious global public health problem.
Genomic features of the strain LCT-EC001
An illustration of the genomic contents in the genome of E. coli strain LCT-EC001 is shown in Fig. 1. The final assembled genome consisted of 17 scaffolds with a total length of 5,198,242 bp and a mean GC content of 50.79%. The gene annotation included 5013 protein coding sequences (CDSs) accounting for 86.61% of the genome (Table 2), 84 tRNA (transfer RNA) fragments, 65 snRNA (small nuclear RNA) genes, 7 copies of 5S rRNA (ribosomal RNA), 6 copies of 16S rRNA, 6 copies of 23S rRNA (Additional file 1: Table S1), 17,031 bp of interspersed repeat sequences and 31,219 bp of tandem repeat sequences (Additional file 2: Table S2). A total of 69.18% of the gene distribution in the GO database is shown in Additional file 3: Table S3, 78.04% in the COG database shown in Additional file 4: Table S4, and 65.93% in the KEGG database shown in Additional file 5: Table S5.
Phylogenetic tree and MLST analysis of LCT-EC001
To interpret the evolution of such an extreme multidrug-resistant Escherichia coli isolate, a selection of 62 E. coli complete genomes (1 chromosome) downloaded from NCBI was used to map phylogenetic trees by using neighbor-joining. All samples except LCT-EC001 were named as E. coli plus the NCBI uid. The results showed that LCT-EC001 was most closely related to E. coli K-12, which is mostly used in laboratories (Fig. 2), indicating that LCT-EC001 may have evolved. MLST analysis showed that the seven housekeeping genes in LCT-EC001 were ADK10, FUMC11, GYRB4, ICD8, MDH8, PURA13, and RECA2. However, no available MLST type could match that of LCT-EC001, revealing that this strain was a completely new type.
Analysis of the complement of antibiotic resistance genes
To understand the basis of antibiotic resistance in E. coli strain LCT-EC001, we carried out sequence alignments with the ARDB database, CARD database and ARG-ANNOT database. A total of 68 determinants associated with antibiotic resistance were identified, with a length range of 348–3594 bp, and mean length of 1305 bp (Additional file 6: Table S6). All those determinants were matched to genes in NCBI with similarity of at least 99%, then further named and classified according to the matched gene information, including 8 beta-lactamase genes, 31 multidrug efflux system genes, 6 outer membrane transport system genes, 4 aminoglycoside-modifying enzyme genes, 10 two-component regulatory system genes, and 9 other enzyme or transcriptional regulator genes (Fig. 3). PCR and Sanger sequencing were further used to confirm that all the genes did exist in E. coli strain LCT-EC001. Beta-lactamases are enzymes produced by bacteria that provide resistance to β-lactam antibiotics such as penicillins, cephalosporins, and cephamycins by breaking the antibiotics’ structure, a four-atom ring known as a β-lactam. Among the 8 beta-lactamase genes, 2 were the extended-spectrum β-lactamase (ESBL) genes Tem-1 and CTXM-14, and 1 was the Klebsiella pneumoniae carbapenemase (KPC) gene KPC-2. ESBLs can hydrolyze extended-spectrum cephalosporins, including cefotaxime, ceftriaxone, and ceftazidime, as well as the oxyimino-monobactam aztreonam. Thus, ESBLs confer multiresistance to these antibiotics and related oxyimino-beta lactams, which play an important role in antibiotic resistance in E. coli. KPC is another key enzyme in MDR, due to its ability to hydrolyze a broad variety of β-lactams, including carbapenems, cephalosporins and penicillins . Interestingly, ESBL gene were not detected by VITEK 2 Compact System, highlighting its flaws in clinical setting.
The drug resistance genes in LCT-EC001 covered nearly all known drug-resistance mechanisms in E. coli. Of these genes, 34 genes were detected from the ARDB database, 61 genes were detected from the CARD database, and 19 genes were detected from the ARG-ANNOT database (Additional file 7: Table S7). In addition, 6 of these genes were located in genome islands, 11 genes were located in plasmids, 3 genes were near transposons, 14 genes were near insertion sequences, and no genes were related to prophages or repeat regions (Additional file 7: Table S7). A more concerning problem is that antibiotic resistance traits in bacteria can transfer between each other, regardless of their genus , via mobile genetic elements (MGEs) such as plasmids , insertion sequences , integrons/transposons , and chromosomal fragments (including resistance islands) . A plasmid is a kind of extrachromosomal DNA molecule with the ability to autonomously replicate. A plasmid can harbor genes encoding β-lactams, even carbapenemases or extended-spectrum β-lactamases, and aminoglycosides  and genes producing antibiotic-target protecting proteins, antibiotic-modifying enzymes or multidrug efflux pumps . Plasmids can also acquire mobile genetic elements by encoding endonucleases/methylase restriction systems . Furthermore, plasmids can move from one bacterial cell to another by conjugal transfer , playing a vital role in the spread of resistance determinants among bacteria. An insertion sequence (IS) is an important MGE that widely exists in bacterial genomes, usually with a length of 0.6–2.0 kb . IS elements can help resistance genes to transfer between and within bacteria  and can upregulate downstream resistance genes . Integrons are another MGE responsible for the emergence and spread of antibiotic resistance genes, including β-lactamases, aminoglycosides, and fluoroquinolones . Transposons, like plasmids, have the potential to transfer horizontally or vertically among pathogens, driving the development of antibiotic resistance . A genomic island (GI), usually with a size of 4.5–600 kb and generated by lateral gene transfer (LGT), is a large continuous genomic region. In addition, GIs can carry tens to hundreds of genes, often important for bacterial evolution, such as antibiotic resistance .
It is worth mentioning that our genome is a draft genome comprising 18 contigs, which means there are 17 gaps of sequence missed and other drug-resistant genes that may not have been identified.
Ashiru-Oredope D, Hopkins S. Antimicrobial resistance: moving from professional engagement to public action. J Antimicrob Chemother. 2015;70:2927–30.
Silva ON, de la Fuente-Núñez C, Haney EF, Fensterseifer IC, Ribeiro SM, Porto WF, et al. An anti-infective synthetic peptide with dual antimicrobial and immunomodulatory activities. Sci Rep. 2016;6:35465.
Quainoo S, Coolen JPM, van Hijum SAFT, Huynen MA, Melchers WJG, van Schaik W, et al. Whole-genome sequencing of bacterial pathogens: the future of nosocomial outbreak analysis. Clin Microbiol Rev. 2017;30:1015–63.
Li J, Liu F, Wang Q, Ge P, Woo PC, Yan J, et al. Genomic and transcriptomic analysis of NDM-1 Klebsiella pneumoniae in spaceflight reveal mechanisms underlying environmental adaptability. Sci Rep. 2014;4:6216.
Casaril AE, de Oliveira LP, Alonso DP, de Oliveira EF, Gomes Barrios SP, de Oliveira Moura Infran J, et al. Standardization of DNA extraction from sand flies: application to genotyping by next generation sequencing. Exp Parasitol. 2017;177:66–72. https://github.com/lh3/readfq.
Li H. Fast multi-line fasta/q reader in several programming languages. 2013. https://github.com/lh3/readfq.
Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 2010;20(2):265–72.
Liu T, Zhu L, Zhang Z, Jiang L, Huang H. Draft genome sequence of Bacillus sp. (2017) M13, a multidrug-resistant subclass B1 blaNDM-producing, spore-forming bacterium isolated from China. J Glob Antimicrob Resist. 2018;14:152–3.
Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 2001;29(12):2607–18.
Rost B. Twilight zone of protein sequence alignments. Protein Eng. 1999;12(2):85–94.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, et al. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006;34(Database issue):D354–7.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, et al. The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003;4:41.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–9.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64.
Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35(9):3100–8.
Gardner PP, Daub J, Tate JG, Nawrocki EP, Kolbe DL, Lindgreen S, et al. Rfam: updates to the RNA families database. Nucleic Acids Res. 2009;37(Database issue):D136–40.
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19:1639–45.
Saha S, Bridges S, Magbanua ZV, Peterson DG. Empirical comparison of ab initio repeat finding programs. Nucleic Acids Res. 2008;36(7):2284–94.
Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573.
Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. PHAST: a fast phage search tool. Nucleic Acids Res. 2011;39(Web Server issue):W347–52.
Bertelli C, Brinkman FSL. Improved genomic island predictions with IslandPath-DIMOB. Bioinformatics. 2018;34(13):2161–7.
Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464:59–65.
Pennington THE. coli O157 outbreaks in the United Kingdom: past, present, and future. Infect Drug Resist. 2014;7:211–22.
Rohde H, Qin J, Cui Y, Li D, Loman NJ, Hentschke M, et al. Open-source genomic analysis of Shiga-toxin-producing E. coli O104:H4. N Engl J Med. 2011;365:718–24.
Hawkey PM, Jones AM. The changing epidemiology of resistance. J Antimicrob Chemother. 2009;64(Suppl 1):i3–10.
Galdadas I, Lovera S, Pérez-Hernández G, Barnes MD, Healy J, Afsharikho H, et al. Defining the architecture of KPC-2 Carbapenemase: identifying allosteric networks to fight antibiotics resistance. Sci Rep. 2018;8:12916.
Tacconelli E, Sifakis F, Harbarth S, Schrijver R, van Mourik M, Voss A, et al. Surveillance for control of antimicrobial resistance. Lancet Infect Dis. 2017. https://doi.org/10.1016/S1473-3099(17)30485-1.
Masud MR, Afroz H, Fakruddin M. Prevalence of extended-spectrum β-lactamase positive bacteria in radiologically positive urinary tract infection. Springerplus. 2014;3:216.
Zhong LL, Phan HTT, Shen C, Doris-Vihta K, Sheppard AE, Huang X, et al. High rates of human fecal carriage of mcr-1-positive multi-drug resistant Enterobacteriaceae isolates emerge in China in association with successful plasmid families. Clin Infect Dis. 2018;66:676–85.
Subedi D, Vijay AK, Willcox M. Overview of mechanisms of antibiotic resistance in Pseudomonas aeruginosa: an ocular perspective. Clin Exp Optom. 2018;101:162–71.
Martínez JL, Baquero F. Emergence and spread of antibiotic resistance: setting a parameter space. Ups J Med Sci. 2014;119:68–77.
Carattoli A. Plasmids and the spread of resistance. Int J Med Microbiol. 2013;303:298–304.
Correia S, Poeta P, Hébraud M, Capelo JL, Igrejas G. Mechanisms of quinolone action and resistance: where do we stand? J Med Microbiol. 2017;66:551–9.
Challacombe JF, Pillai S, Kuske CR. Shared features of cryptic plasmids from environmental and pathogenic Francisella species. PLoS ONE. 2017;12:e0183554.
Schmitz-Esser S, Penz T, Spang A, Horn M. A bacterial genome in transition–an exceptional enrichment of IS elements but lack of evidence for recent transposition in the symbiont Amoebophilus asiaticus. BMC Evol Biol. 2011;11:270.
Bennett PM. Plasmid encoded antibiotic resistance: acquisition and transfer of antibiotic resistance genes in bacteria. Br J Pharmacol. 2008;153(Suppl 1):S347–57.
Figueiredo S, Poirel L, Papa A, Koulourida V, Nordmann P. Overexpression of the naturally occurring blaOXA-51 gene in Acinetobacter baumannii mediated by novel insertion sequence ISAba9. Antimicrob Agents Chemother. 2009;53:4045–7.
Chen DQ, Jiang YT, Feng DH, Wen SX, Su DH, Yang L. Integron mediated bacterial resistance and virulence on clinical pathogens. Microb Pathog. 2018;114:453–7.
Rangasamy K, Athiappan M, Devarajan N, Samykannu G, Parray JA, Aruljothi KN, et al. Pesticide degrading natural multidrug resistance bacterial flora. Microb Pathog. 2017;114:304–10.
Lu B, Leong HW. Computational methods for predicting genomic islands in microbial genomes. Comput Struct Biotechnol J. 2016;14:200–6.
XZ, SX, XJ and CL designed the study and wrote the manuscript; YL and ZF analyzed the data; XZ, YY, PW, DL and XZ carried out the experiments. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Availability of data and materials
This Whole Genome Shotgun project of E. coli strain LCT-EC001 has been deposited at DDBJ/ENA/GenBank under the accession JMIC00000000. The version described in this paper is version JMIC02000000.
Consent for publication
Ethics approval and consent to participate
This work was supported by the National Natural Science Foundation of China (No: 81600011) and the China Postdoctoral Science Foundation (Grant No: 2016M592928).
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Gene annotation of LCT-EC001.
Repeat sequences of LCT-EC001.
Gene distribution of LCT-EC001 in the GO database.
Gene distribution of LCT-EC001 in the COG database.
Gene distribution of LCT-EC001 in the KEGG database.
Gene length of 68 antibiotic resistance determinants in LCT-EC001.
Location and database source of 68 antibiotic resistance determinants in LCT-EC001.
About this article
Cite this article
Zhang, X., Xiao, S., Jiang, X. et al. Genomic characterization of Escherichia coli LCT-EC001, an extremely multidrug-resistant strain with an amazing number of resistance genes. Gut Pathog 11, 25 (2019). https://doi.org/10.1186/s13099-019-0298-5
- Escherichia coli
- Antibiotic resistance
- High-throughput sequencing