Skip to main content

First complete genome sequence and comparative analysis of Salmonella enterica subsp. diarizonae serovar 61:k:1,5,(7) indicates host adaptation traits to sheep



The Salmonella enterica subsp. diarizonae serovar 61:k:1,5,(7) (SASd) has been found to be host-adapted to sheep, with a high prevalence in sheep herds worldwide. Infections are usually sub-clinical, however the serovar has the potential to cause diarrhea, abortions and chronic proliferative rhinitis. Although occurrence and significance of SASd infections in sheep have been extensively studied, the genetic mechanism underlying this unusual host-adaptation have remained unknown, due to a lack of (a) available high-quality genome sequence(s).


We utilized Nanopore and Illumina sequencing technologies to generate a de novo assembly of the 4.88-Mbp complete genome sequence of the SASd strain 16-SA00356, isolated from the organs of a deceased sheep in 2016. We annotated and analyzed the genome sequence with the aim to gain a deeper understanding of the genome characteristics associated with its pathogenicity and host adaptation to sheep. Overall, we found a number of interesting genomic features such as several prophage regions, a VirB4/D4 plasmid and novel genomic islands. By comparing the genome of 16-SA00356 to other S. enterica serovars we found that SASd features an increased number of pseudogenes as well as a high level of genomic rearrangements, both known indicators of host-adaptation.


With this sequence, we provide the first complete and closed genome sequence of a SASd strain. With this study, we provide an important basis for an understanding of the genetic mechanism that underlie pathogenicity and host adaptation of SASd to sheep.


Salmonella enterica subsp. diarizonae serovar 61:k:1,5,(7) (also designated as SASd) is a Gram-negative bacterium of the genus Salmonella. SASd is considered host-adapted to sheep, based on its wide distribution and high prevalence in sheep flocks worldwide [1,2,3,4,5,6,7]. SASd colonizes the intestines and tonsils of sheep and can be isolated from the faeces and nasal discharge of the animals [6]. Colonization might be chronic, with faecal shedding of the pathogen allowing transmission between individuals [8]. Although the serovar does not usually induce diseases [8,9,10], it has the potential to cause diarrhea [11], abortions [2] and chronic proliferative rhinitis [3, 6]. Over the last years, occurrence, distribution and impact of SASd infections in sheep have been extensively studied. However the potential genetic features underlying this unusual host-adaptation have remained unknown, due to a lack of available high-quality genome sequences. Here, we announce the first, complete and closed genome sequence of S. enterica subsp. diarizonae serovar 61:k:1,5,(7). Through genome analysis and a genome comparison study we identified numerous genetic features indicating host adaptation traits of SASd to sheep.


Strain isolation and characterization

Strain 16-SA00356 was isolated from an enriched pooled organ sample of an adult sheep that was found dead in Northern Germany in 2016. The sheep was postmortem diagnosed with liver cell necrosis, liver abscesses and serofibrinous peritonitis, most likely resulting from an infection with Fasciola hepatica. Detection of Salmonella spp. in the enriched culture was considered an incidental finding. Enrichment and isolation of Salmonella spp. was achieved by pooling small pieces of different organs (lung, liver, kidney, spleen and small intestine) in tetrathionate broth of Preuss, a selective medium for enrichment of Salmonella spp., followed by incubation for 12 h at 37 °C. After incubation, the broth was spread on three selective solid agar plates (Rambach agar, XLD agar, BSB agar), followed by another incubation cycle. Salmonella spp. colonies were confirmed through MALDI-TOF, before further subcultivation on Lysine Iron Agar (LIA) slants. The Salmonella isolate was serotyped by slide agglutination with the antigenic formula 61:k:1,5,(7). Antimicrobial susceptibility testing was performed by broth microdilution following CLSI guidelines (CLSI M07-A9) and EUCAST epidemiological cut-off values (ECOFFs; The isolate was found to be sensitive to all tested antibiotics (ampicillin, chloramphenicol, ciprofloxacin, colistin, cefotaxime, gentamicin, nalidixic acid, sulfamethoxazole, ceftazidime, tetracycline and trimethoprim).

Genome sequencing and de novo assembly

Genomic DNA for both sequencing techniques was isolated from an overnight liquid culture using the PureLink® Genomic DNA Mini Kit (Invitrogen, Carlsbad, CA, USA). Sequencing libraries for Illumina sequencing were prepared with the Nextera XT DNA Sample Preparation Kit (Illumina, San Diego, CA, USA) according to the manufacturer’s protocol. Sequencing was performed in 2 × 251 cycles on the Illumina MiSeq benchtop using the MiSeq Reagent v3 600-cycle Kit (Illumina). A total number of 1,433,866 reads was generated, with 86% of bases above a quality score of 30 (> Q30) and an overall coverage of 60×. The paired Illumina reads were trimmed using Trimmomatic v0.36 [12] with option sliding window 4:20 and minlen 50 yielding 1,295,014 read pairs. To generate long reads for scaffolding, Oxford Nanopore MinION technology (ONT) was applied. MinION libraries were prepared using the Rapid barcoding kit (Oxford Nanopore Technologies, Oxford, UK), following the manufacturer’s instructions, and sequenced for approximately 16 hours using a FLO-MIN106 R9 flow cell generating 82,147 reads. The genome was assembled with Unicycler v0.4.4 [13], including Pilon v1.23 [14], providing the trimmed Illumina reads as paired short reads and the ONT reads as long reads with default parameters.

Genome annotation

Antibiotic resistance genes were identified with ResFinder v3.1 [15]. Salmonella pathogenicity islands (SPI) were detected with SPIFinder v1.0 with default parameters [16] and by BLAST against known Salmonella pathogenicity islands. Prophage regions were identified with PHASTER [15]. Pseudogenes were determined with Pseudofinder v0.10 [17] with standard parameters and length 0.8. Genomic rearrangements were detected with progressive Mauve v2.4.0, with standard parameters [18].

Comparative genomic analysis

The genome of the SASd isolate 16-SA00356 was compared to a set of well annotated S. enterica serovars which were chosen to represent different host ranges. All serovars together with their NCBI accession numbers and information regarding the size of their genomes, number of ORFs and GC content, are listed in Additional file 1: Table S1. Plasmid sequences were excluded from the comparative analysis.

Phylogenetic analysis

Phylogeny was inferred through alignment free genome comparison with feature frequency profiles (FFP v3.19) and through comparison of 107 essential single-copy core genes following the bcgTree pipeline (v1.1.0). The bcgTree pipeline was applied with default parameters as as described by Ankenbrand and Keller [19]. FPP was performed with default parameters and l-mer length 24 as described by Wang and Ash [20].

Quality assurance

A single colony of 16-SA00356 was transferred to fresh LB medium to obtain a pure culture for genomic DNA extraction. After the genome sequence was obtained subspecies and serovar assignments were confirmed by the in silico typing tool SISTR v1.0.2 [21].

Results and discussion

General features

The genome of SASd isolate 16-SA00356 is composed of a circular chromosome of 4,832,672 bp (GC 51.49%) and a circular plasmid of 42,663 bp (GC 41.34%). A graphical representation of the annotated chromosome and plasmid is shown in Fig. 1. A total of 4687 CDSs, 80 tRNAs, 1 tmRNA and 22 rRNAs regions were predicted within the chromosome by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP).

Fig. 1

Genome (a) and plasmid (b) map of S. enterica subsp. diarizonae serovar 61:k:1,5,(7), isolate 16-SA00356, displayed in Circos. The tracks from inside to outside represent the Nanopore sequencing coverage, GC content, reverse-strand CDSs, forward-strand CDSs and labeled genetic regions of interest such as important operons, Salmonella pathogenicity islands (SPI), genomic islands (GI) and prophage regions

Antibiotic resistance and pathogenicity

Although the strain was susceptible to eleven tested antibiotics, two antibiotic resistance genes homologous to aac(6′)-Iaa (Accession: NC_003197), an aminoglycoside acetyltransferase, and mdfA (Accession: Y08743), a macrolide–lincosamide–streptogramin B (MLS) resistance gene were identified. We found that the strain possesses the major pathogenicity islands SPI-1, SPI-2, SPI-3, SPI-4, SPI-5, SPI-9, SPI-12, SPI-13, SPI-18 and GI-2 [22]. Furthermore, by comparing the sequence based similarity of the 16-SA00356 genome to five other S. enterica subsp. diarizonae serovars (see Additional file 2: Figure S1) we identified five novel genomic islands: GI-6 - GI-10. These novel genomic islands contain mainly proteins of unknown function and no major virulence genes could be attributed to them. In addition, we identified four incomplete prophage regions, with high similarity to the prophages Entero mEp237, Gifsy 1 and Gifsy 2. Overall, the resistance and virulence potential of SASd appears to be low. The absence of major antibiotic resistance genes can be attributed to the generally low antibiotic usage/low intensity farming practice of sheep.


SASd isolate16-SA00356 was found to carry an IncX1/ColRNAI type plasmid of 42,663 bp which we named pSE16-SA00356. We found pSE16-SA00356 to harbour an almost complete conjugative Type IV secretion system (missing virB7), which has been linked to persistent infections in numerous pathogens [23]. The plasmid furthermore carries the RelE/StbE toxin/antitoxin system and a small Haemolysin expression-modulating protein Hha, although the complementary tomB antitoxin gene was not detected in our analysis. The addiction module RelE/StbE probably increases the stability and therefore the persistence of the plasmid in the microbial population.

Phylogenetic analysis

Phylogeny of the different Salmonella species was inferred through alignment-free genome comparison with feature frequency profiles (FFP) and through comparison of 107 essential single-copy core genes with bcgTree. The resulting phylogenetic trees are shown in Fig. 2 and both indicate that 16-SA00356 clusters within the group of the S. enterica subsp. diarizonae serovars. Bootstrap values attribute greater certainty to the pyhlogenetic tree obtained through FFP.

Fig. 2

Phylogenetic analysis of Salmonella species with Salmonella bongori N268-08 (CP006608) as outgroup. a NJ tree based on the comparison of complete genome sequences with the alignment-free feature frequency profiles (FFP) method. Numbers at nodes designate bootstrap support values generated using 100 permutations. b Best-scoring maximum-likelihood tree based on the comparison of the amino acid sequences of 107 essential single-copy core genes with bcgTree. Numbers at nodes designate bootstrap support values resulting from 100 bootstrap replicates. Only bootstrapping values greater than 50 are displayed


Recent intracellular pathogens have a higher number of pseudogenes that results from the fact that adaptation to an intracellularly lifestyle causes bacteria to gradually loses genes no longer needed in their new environment. Nuccio and Bäumler [24] propose that Salmonella serovars could be divided into a group with a low number of pseudogenes and those with a high number of pseudogenes, with the later group referred as the extraintestinal pathovars. When comparing the percentage of pseudogenes normalized to the total number of ORFs among different Salmonella serovars we found SASd isolate16-SA00356 to possess a medium number of pseudogenes. An overview of the results of our analysis is shown in Table 1. Overall, host-adapted and host-restricted serovars such as S. enterica subsp. enterica serovar Choleraesuis (pigs), S. enterica subsp. enterica serovar Typhi (humans), S. enterica subsp. enterica serovars Gallinarum and Pullorum (birds) feature a higher percentage of pseudogenes (6.5–7.6%), than those reported to have a broad host range i.e. S. enterica subsp. enterica serovar Typhimurium (4.9%). Interestingly, the genome of the SASd isolate 16-SA00356 features a comparable number of pseudogenes (6.0%), to the cattle-adapted S. enterica subsp. enterica serovars Dublin (5.7%) and Kentucky (5.5%). Together with the fact that among the investigated S. enterica subsp. diarizonae serovars, SASd possesses the highest number of pseudogenes, these results further indicate a host-adaptation to sheep.

Table 1 Correlation between number of pseudogenes and host range of the respective organism

Genome rearrangements

Host restricted pathogens often exhibit genomic rearrangements [25] and by comparing the genome of SASd strain16-SA00356 to other S. enterica subsp. diarizonae serovars, we were able to detect large scale genome rearrangements with many inversions as shown in Additional file 3: Figure S2.


Overall, this study found a number of interesting genomic features linked to pathogenicity and host specificity of SASd to sheep. Among these, we detected increased pseudogene formation, large scale genomic rearrangements, a VirB4/D4 plasmid and novel genomic islands. The complete genome sequence generated in this study forms an important basis for further understanding of the pathogenicity and host adaptation of SASd, as well as a high-quality reference for future genome comparison studies.

Availability of data and materials

Nucleotide sequences were deposited in GenBank under the accession numbers CP034074 (chromosome) and CP034075 (plasmid). The datasets supporting the conclusions of this article are included within the article and its additional files.


  1. 1.

    Amagliani G, Petruzzelli A, Carloni E, Tonucci F, Foglini M, Micci E, et al. Presence of Escherichia coli O157, Salmonella spp., and Listeria monocytogenes in raw ovine milk destined for cheese production and evaluation of the equivalence between the analytical methods applied. Foodborne Pathog Dis. 2016;13:626–32.

    CAS  Article  Google Scholar 

  2. 2.

    Davies RH, Evans SJ, Chappell S, Kidd S, Jones YE, Preece BE. Increase in Salmonella enterica subspecies diarizonae serovar 61:k:l,5,(7) in sheep. Vet Rec. 2001;149:555–7.

    CAS  Article  Google Scholar 

  3. 3.

    Lacasta D, Ferrer LM, Ramos JJ, Bueso JP, Borobia M, Ruiz de Arcaute M, et al. Chronic proliferative rhinitis associated with Salmonella enterica subspecies diarizonae serovar 61:k:1,5,(7) in sheep in Spain. J Comp Pathol. 2012;147:406–9.

    CAS  Article  Google Scholar 

  4. 4.

    Methner U, Moog U. Occurrence and characterisation of Salmonella enterica subspecies diarizonae serovar 61:k:1,5,(7) in sheep in the federal state of Thuringia, Germany. BMC Vet Res. 2018;14:401.

    CAS  Article  Google Scholar 

  5. 5.

    Sörén K, Lindblad M, Jernberg C, Eriksson E, Melin L, Wahlström H, et al. Changes in the risk management of Salmonella enterica subspecies diarizonae serovar 61:(k):1,5,(7) in Swedish sheep herds and sheep meat due to the results of a prevalence study 2012. Acta Vet Scand. 2015;57:6.

    Article  Google Scholar 

  6. 6.

    Stokar-Regenscheit N, Overesch G, Giezendanner R, Roos S, Gurtner C. Salmonella enterica subspecies diarizonae serotype 61:k:1,5,(7) associated with chronic proliferative rhinitis and high nasal colonization rates in a flock of Texel sheep in Switzerland. Prev Vet Med. 2017;145:78–82.

    Article  Google Scholar 

  7. 7.

    Dargatz DA, Marshall KL, Fedorka-Cray PJ, Erdman MM, Kopral CA. Salmonella prevalence and antimicrobial susceptibility from the National Animal Health Monitoring System Sheep 2011 study. Foodborne Pathog Dis. 2015;12:953–7.

    Article  Google Scholar 

  8. 8.

    Lacasta D, Figueras L, Bueso JP, De las Heras M, Ramos JJ, Ferrer LM, et al. Experimental infection with Salmonella enterica subspecies diarizonae serotype 61:k:1,5,(7) in sheep: study of cell mediated immune response. Small Rumin Res. 2017;149:28–33.

    Article  Google Scholar 

  9. 9.

    Bonke R, Wacheck S, Bumann C, Thum C, Stüber E, König M, et al. High prevalence of Salmonella enterica subsp. diarizonae in tonsils of sheep at slaughter. Food Res Int. 2012;45:880–4.

    Article  Google Scholar 

  10. 10.

    Sandberg M, Alvseike O, Skjerve E. The prevalence and dynamics of Salmonella enterica IIIb 61:k:1,5,(7) in sheep flocks in Norway. Prev Vet Med. 2002;52:267–75.

    Article  Google Scholar 

  11. 11.

    Wray C, Wray A. Salmonella in domestic animals. Wallingford: CABI Pub; 2000.

    Google Scholar 

  12. 12.

    Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.

    CAS  Article  Google Scholar 

  13. 13.

    Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol. 2017;13:e1005595.

    Article  Google Scholar 

  14. 14.

    Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963.

    Article  Google Scholar 

  15. 15.

    Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44:W16–21.

    CAS  Article  Google Scholar 

  16. 16.

    SPIFinder 1.0. Accessed 23 Apr 2019.

  17. 17.

    Pseudofinder. Accessed 15 May 2019.

  18. 18.

    Darling AE, Mau B, Perna NT. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS ONE. 2010;5:e11147.

    Article  Google Scholar 

  19. 19.

    Ankenbrand MJ, Keller A. bcgTree: automatized phylogenetic tree building from bacterial core genomes. Genome. 2016;59:783–91.

    CAS  Article  Google Scholar 

  20. 20.

    Wang A, Ash GJ. Whole genome phylogeny of Bacillus by feature frequency profiles (FFP). Sci Rep. 2015;5:13644.

    Article  Google Scholar 

  21. 21.

    Yoshida CE, Kruczkiewicz P, Laing CR, Lingohr EJ, Gannon VPJ, Nash JHE, et al. The Salmonella In Silico Typing Resource (SISTR): an open web-accessible tool for rapidly typing and subtyping draft Salmonella genome assemblies. PLoS ONE. 2016;11:e0147101.

    Article  Google Scholar 

  22. 22.

    Gerlach RG, Walter S, McClelland M, Schmidt C, Steglich M, Prager R, et al. Comparative whole genome analysis of three consecutive Salmonella diarizonae isolates. Int J Med Microbiol IJMM. 2017;307:542–51.

    CAS  Article  Google Scholar 

  23. 23.

    Voth DE, Broederdorf LJ, Graham JG. Bacterial Type IV secretion systems: versatile virulence machines. Fut Microbiol. 2012;7:241–57.

    CAS  Article  Google Scholar 

  24. 24.

    Nuccio S-P, Bäumler AJ. Comparative analysis of Salmonella genomes identifies a metabolic network for escalating growth in the inflamed gut. mBio. 2014;5:e00929-14.

    Article  Google Scholar 

  25. 25.

    Matthews TD, Schmieder R, Silva GGZ, Busch J, Cassman N, Dutilh BE, et al. Genomic comparison of the closely-related Salmonella enterica serovars Enteritidis, Dublin and Gallinarum. PLoS ONE. 2015;10:e0126883.

    Article  Google Scholar 

  26. 26.

    Hindermann D, Gopinath G, Chase H, Negrete F, Althaus D, Zurfluh K, et al. Salmonella enterica serovar Infantis from food and human infections, Switzerland, 2010–2015: poultry-related multidrug resistant clones and an emerging ESBL producing clonal lineage. Front Microbiol. 2017;8:1322.

    Article  Google Scholar 

  27. 27.

    Jacobsen A, Hendriksen RS, Aaresturp FM, Ussery DW, Friis C. The Salmonella enterica pan-genome. Microb Ecol. 2011;62:487–504.

    Article  Google Scholar 

  28. 28.

    Tanner JR, Kingsley RA. Evolution of Salmonella within hosts. Trends Microbiol. 2018;26:986–98.

    CAS  Article  Google Scholar 

  29. 29.

    Uzzau S, Brown DJ, Wallis T, Rubino S, Leori G, Bernard S, et al. Host adapted serotypes of Salmonella enterica. Epidemiol Infect. 2000;125:229–55.

    CAS  Article  Google Scholar 

  30. 30.

    Li Q, Hu Y, Chen J, Liu Z, Han J, Sun L, et al. Identification of Salmonella enterica serovar Pullorum antigenic determinants expressed in vivo. Infect Immun. 2013;81:3119–27.

    CAS  Article  Google Scholar 

Download references


We thank Ernst Junker for his excellent laboratory assistance.


This work was supported by the German Federal Institute for Risk Assessment (BfR) (Research Project Number 1322-716).

Author information




LU and BM designed the study. CJ and IS provided the isolate and information to the isolate and host organism. MB performed the sequencing. LU, CD and SHT conducted the bioinformatic analysis. BM supervised the project. LU wrote the manuscript and created the figures. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Burkhard Malorny.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

List of S. enterica serovars analyzed in this study.

Additional file 2: Figure S1.

Sequence based similarity of five S. enterica subsp. diarizonae serovars to S. enterica subsp. diarizonae serovar 61:k:1,5,(7), isolate 16-SA00356. The sequence similarity is shown by color-coded tracks which from inside to outside represent (i) S. enterica subsp. diarizonae SA20044251, (ii) S. enterica subsp. diarizonae NCTC10381, (iii) S. enterica subsp. diarizonae MZ0080 and (iv) S. enterica subsp. diarizonae HZS154 and (v) S. enterica subsp. diarizonae 11-01853. The location of genetic regions of interest such as Salmonella pathogenicity islands (SPI), genomic islands (GI) and prophage regions are indicated.

Additional file 3: Figure S2.

Mauve alignment of 16-SA00356, SA20044251, NCTC10381, MZ0080, HZS154 and 11-01853. Colored blocks indicate individual locally collinear blocks (LCB). Homologous LCBs are connected with lines. 16-SA00356 is set as the reference genome.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Uelze, L., Borowiak, M., Deneke, C. et al. First complete genome sequence and comparative analysis of Salmonella enterica subsp. diarizonae serovar 61:k:1,5,(7) indicates host adaptation traits to sheep. Gut Pathog 11, 48 (2019).

Download citation


  • Salmonella enterica subsp. diarizonae
  • Host-adaptation
  • Pseudogenes
  • Sheep