Whole genome sequence and comparative genomic analysis of multidrug-resistant Staphylococcus capitis subsp. urealyticus strain LNZR-1

Background Staphylococcus capitis is an emerging opportunistic pathogen of humans, and found as a colonizer of the human gut. Here, we report a case of S. capitis subsp. urealyticus infection. The strain LNZR-1 was isolated from the blood culture of a patient with sigmoid colon cancer. It was found to be resistant to some important antibiotics, such as linezolid, a highly effective antimicrobial against clinically important Staphylococci pathogens. However, data on the genetic resistance mechanisms in S. capitis subsp. urealyticus are only sparsely available. Results The draft genome of S. capitis subsp. urealyticus strain LNZR-1 was sequenced by using next-generation sequencing technologies. Sequence data assembly revealed a genome size of 2,595,865 bp with a G + C content of 32.67%. Genome annotation revealed the presence of antibiotic resistance genes conferring resistance against some of the tested antibiotics as well as non-tested antibiotics. The genome also possesses a lot of genes that may be related to multidrug resistance. Whole genome comparison of the LNZR-1 with five other S. capitis strains showed that some functional regions are highly homologous between the six assemblies made herein. The LNZR-1 genome has high similarity with the genomes of the strains VCU116 and CR01, although some short stretches present in the genomes of strains VCU116 and CR01 were absent in the strain LNZR-1. Conclusions The presence of a plethora of genes responsible for antibiotic resistance suggests that strain LNZR-1 could present a potential threat to human health. The comparative genomic analysis of S. capitis strains presented in this study is important for better understanding of multidrug resistance in S. capitis. Electronic supplementary material The online version of this article (doi:10.1186/s13099-014-0045-x) contains supplementary material, which is available to authorized users.


Background
Staphylococcus capitis are Gram positive cocci belonging to the Coagulase-Negative Staphylococci group (CoNS) that is frequently found on the human skin and mucosa [1,2] and even in the human gut [3]. Although infection caused by this species is rare compared with S. aureus, infection cases associated with S. capitis increase gradually [4]. Recent reports indicate its emergence as a significant pathogen causing nosocomial and bloodstream infections, meningitis, prosthetic valve endocarditis, and late-onset sepsis [4][5][6][7]. This bacterium is a subtype of CoNS and the pathogenesis of S. capitis is mainly due to its ability to produce a slimy biofilm, enabling it to adhere to the medical devices such as prosthetic valves and catheters; this makes them difficult to be controlled or cleared by immune responses or antibiotic therapy [7].
The S. capitis subsp. urealyticus strain LNZR-1 described herein was isolated from the blood culture of a patient with sigmoid colon cancer. Antimicrobial susceptibility assay revealed that it was resistant to some important antibiotics, such as linezolid. In order to elucidate the molecular mechanisms behind the multidrug resistance of S. capitis subsp. urealyticus LNZR-1 clone, here, we report the sequencing and annotation of its genome, together with a functional level genomic comparison with other important S. capitis strains, namely QN1 [8], CR01 [9], VCU116, C87 and SK14.

Strain information and growth conditions
The blood samples were collected from a patient with sigmoid colon cancer from the Fourth Affiliated Hospital of Harbin Medical University, in March 2013. S. capitis subsp. urealyticus strain LNZR-1 was isolated after cultivation. It is a Gram positive, coccus-shaped bacterium growing on 5% sheep blood enriched Columbia agar (BioMérieux, Marcyl'Etoile, France) at 37°C. Cell morphology, motility and sporulation were examined by using scanning electron microscopy.
Genomic DNA extraction and 16S rRNA gene PCR Late log-phase cells were harvested and lysed with EDTA and lysozyme, followed by proteinase K and RNase digestion. Genomic DNA was extracted using a DNeasy Blood & Tissue Kit (Qiagen, Germany) according to the manufacturer's recommended protocol. Agarose gel (0.7%) electrophoresis was used to evaluate the genomic DNA purity and the concentration was measured using a NanoDrop 1000 Spectrophotometer (Thermo Fisher Scientific, USA). The genomic DNA was stored at -20°C. Strain LNZR-1 was identified by 16S rRNA gene sequencing as described earlier [6]. PCR amplification was performed by using primers 27 F (5′-AGAGTTTGATCCTG GCTCAG-3′) and 1500R (5′-AGAAAGGAGGTGATCCAGGC-3′). Agarose gel (1%) electrophoresis was used to separate amplified PCR fragments which were subjected to sequencing of the 16 s rRNA gene. Phylogenetic analysis was conducted based on the 16S rRNA nucleotide sequence. The representative 16S rRNA nucleotide sequence of strain LNZR-1 was compared against the most recent release of the EzTaxon-e database [10]. Phylogenetic inferences were made using Neighbor-joining method based on Tamura-Nei model within the MEGA 6.06 [11].

Antimicrobial susceptibility testing
Antimicrobial susceptibility was determined by the disk diffusion method on Mueller-Hinton agar recommended by the Clinical and Laboratory Standards Institute guidelines [12]. The following antimicrobial agents were tested: sulfamethoxazole, gentamicin, oxacillin, tetracycline, linezolid, clindamycin, ciprofloxacin, cefoxitin, cefazolin, cefuroxime and vancomycin. The other reference strains used for this study were Escherichia coli ATCC 25922 and Klebsiella pneumoniae ATCC 700603.

Genome sequencing, assembly and annotation
The genome of S. capitis subsp. urealyticus strain LNZR-1 was sequenced using a standard run of Illumina HiSeq 2000 sequencing technology which generated paired-end libraries (500-bp insert size) according to the manufacturer's instructions. Clean reads were assembled into scaffolds using Velvet version 1.2.07 [13], and Post-Assembly Genome Improvement Toolkit (PAGIT) was used to extend the initial contiguous sequences (contigs) and to correct sequencing errors [14]. Open reading frames (ORFs) were identified using Glimmer version 3.0 [15]. Transfer RNAs and ribosomal RNA genes rRNAs were detected by tRNAscan-SE [16] and RNAmmer 1.2 software [17], respectively. The genome was annotated using the RAST (Rapid Annotation using Subsystem Technology) server [18]. The classification of some predicted genes and pathways was analyzed using the Clusters of Orthologous Groups of proteins (COGs) [19] and Kyoto Encyclopedia of Genes and Genomes (KEGG) [20] databases. Functional annotation was also performed by using public database of National Centre for Biotechnology Information (NCBI).
Orthology identification was performed by using NCBI blastp 2.2.25+ with default parameters. Then, bidirectional best hits (BBHs) among proteins from different strains were identified. Furthermore, an identity threshold over a given alignment length to define orthologous genes was applied. Score Ratio Values (SRVs) of BBHs between two genes was calculated with the following formula: Bits (AB) means bits score when using gene A as query while B as database; Bits (BA) means bits score when using gene B as query while A as database; Bits (AA) or Bits (BB) are bits score when using gene A or B to align with itself, respectively. BBHs with SRVs no less than 0.3 was considered as one candidate orthology group (SRVs Table in Additional file 1: Table S1).

Identification of strain LNZR-1
Cells of strain LNZR-1 are cocci, 0.7 to 1.2 μm in diameter, occurring predominantly singly or in pairs (Additional file 2: Figure S1). To assess the purity of strain LNZR-1, the 16S rRNA gene sequence of strain LNZR-1 was aligned with sequences of other members of the genus Staphylococcus retrieved from the EzTaxon database. Phylogenetic tree indicated the taxonomic status of strain LNZR-1 clearly classified into the same branch with species S. capitis subsp. urealyticus GTC 727 T (Figure 1).

Genomic features of S. capitis subsp. urealyticus LNZR-1
A total of 2,214,438 available reads were filtered from 5,619,015 raw reads. Quality control was performed with following criteria: reads that contained more than one N bases were removed. Reads that contained more than 50 bases with low quality (Q30) were removed. Reads with more than 10 bases with low quality (Q30) or N bases in the tail of the reads were trimmed. For sequences which lost their mated reads were considered as single reads and were not used in the downstream analysis (Additional file 1: Table S1). 177 initial contigs (best kmer length 57) was assembled by Velvet; then 90 prolonged contigs were assembled based on PAGIT flow (PAGIT is just a flow and it actually does not provide any practically available scripts or programs). The assembled genome of S. capitis subsp. urealyticus revealed a genome size of 2,595,865 bp and a G + C content of 32.67% (90 scaffolds). The largest contig consisted of 319,806 bp and the length of N50 contig was 66,677 bp. These scaffolds contain 2430 coding sequences (CDSs), 8 tRNAs (excluding 1 Pseudo tRNA) and 2 incomplete rRNA operons (1 small subunit rRNA and 1 large subunit rRNA). The properties and the statistics of the genome are summarized in Table 1. RAST server based annotation of the whole genome describes the subsystem distribution of strain LNZR-1 (Figure 2). Genes responsible for amino acids and derivatives (264 ORFs), carbohydrates (209 ORFs), and protein metabolism (175 ORFs) were abundant among the subsystem categories.

Antibiotic resistance profile
The in vitro antibiotic sensitivity tests demonstrated that this strain is susceptible to vancomycin and sulfamethoxazole, and resistant to tetracycline, gentamicin, ampicillin, methicillin, linezolid, clindamycin, cefoxitin, cefazolin, cefuroxime and ciprofloxacin. To gain insights into the genomic basis for the observed antibiotic resistance traits, the genome was searched for specific genes known to confer antibiotic resistance. The results revealed the presence of antibiotic resistance genes conferring resistance against some of the tested antibiotics as well as nontested antibiotics ( Table 2). MDR-type ABC transporters, multidrug and toxin extrusion (MATE) family efflux pumps and multidrug major facilitator superfamily (MFS) transporters were also detected in the genome. Furthermore, 10 putative MarR family transcriptional regulators were found in the genome, which are recognized as a widely conserved group of multiple antibiotic resistance regulators that respond to diverse antibiotics [22].

Comparative analysis with other S. capitis strains
Whole genome comparison of the LNZR-1 with S. capitis C87, S. capitis SK14, S. capitis CR01, S. capitis VCU116 and S. capitis QN1 showed that some functional regions are highly homologous between the six assemblies ( Figure 4). The LNZR-1 genome has high similarity with VCU116 and CR01, although some short stretches present in the genome of VCU116 and CR01 were absent in LNZR-1. Furthermore, LNZR-1, C87, CR01, VCU116 and QN1 revealed a large number of orthologous genes ( Figure 5). Venn diagram indicates the presence of a large Genes with transmembrane helices core-genome. These five S. capitis strains shared 2042 CDS in the genome. A particular overlap between C87 and QN1 became evident; these two chromosomes shared 121 orthologous CDS exclusively. The chromosome of LNZR-1 overlapped less with the C87 and QN1, which shared 5 and 4 exclusively orthologous CDS, respectively. In addition, 244 CDS from the LNZR-1 genome were classified as unique.

Future directions
In recent times, the alarming spread of antibiotic resistance has severely limited the treatment options for nosocomial infections. Currently, the antibiotics linezolid, vancomycin and daptomycin form an empirical therapy towards the control of serious infections caused by Staphylococci. However, the overuse of these antibiotics could have resulted in the emergence of multidrug resistant  bacteria. Despite untiring efforts directed at the analyses of intermediately resistant clinical isolates, the explicit mode of the development of resistance to these antimicrobials remains obscure. Further studies by using high-throughput mRNA sequencing (RNA-Seq) experiments to explore differential RNA expression levels under selective antibiotic pressures are warranted. Further studies are required to elucidate the resistance mechanisms, and information on these mechanisms could potentially aid in antibiotic development.

Availability of supporting data
This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession JGYJ00000000. The version described in this paper is version JGYJ01000000.  Submit your manuscript at www.biomedcentral.com/submit