- Current Issue
Volume 10, Issue 5, 2024
- Research Articles
-
- Genomic Methodologies
-
-
Hybracter: enabling scalable, automated, complete and accurate bacterial genome assemblies
Improvements in the accuracy and availability of long-read sequencing mean that complete bacterial genomes are now routinely reconstructed using hybrid (i.e. short- and long-reads) assembly approaches. Complete genomes allow a deeper understanding of bacterial evolution and genomic variation beyond single nucleotide variants. They are also crucial for identifying plasmids, which often carry medically significant antimicrobial resistance genes. However, small plasmids are often missed or misassembled by long-read assembly algorithms. Here, we present Hybracter which allows for the fast, automatic and scalable recovery of near-perfect complete bacterial genomes using a long-read first assembly approach. Hybracter can be run either as a hybrid assembler or as a long-read only assembler. We compared Hybracter to existing automated hybrid and long-read only assembly tools using a diverse panel of samples of varying levels of long-read accuracy with manually curated ground truth reference genomes. We demonstrate that Hybracter as a hybrid assembler is more accurate and faster than the existing gold standard automated hybrid assembler Unicycler. We also show that Hybracter with long-reads only is the most accurate long-read only assembler and is comparable to hybrid methods in accurately recovering small plasmids.
-
- Pathogens and Epidemiology
-
-
Discovery and genetic characterization of novel paramyxoviruses from small mammals in Hubei Province, Central China
Paramyxoviruses are a group of single-stranded, negative-sense RNA viruses, some of which are responsible for acute human disease, including parainfluenza virus, measles virus, Nipah virus and Hendra virus. In recent years, a large number of novel paramyxoviruses, particularly members of the genus Jeilongvirus, have been discovered in wild mammals, suggesting that the diversity of paramyxoviruses may be underestimated. Here we used hemi-nested reverse transcription PCR to obtain 190 paramyxovirus sequences from 969 small mammals in Hubei Province, Central China. These newly identified paramyxoviruses were classified into four clades: genera Jeilongvirus, Morbillivirus, Henipavirus and Narmovirus, with most of them belonging to the genus Jeilongvirus. Using Illumina sequencing and Sanger sequencing, we successfully recovered six near-full-length genomes with different genomic organizations, revealing the more complex genome content of paramyxoviruses. Co-divergence analysis of jeilongviruses and their known hosts indicates that host-switching occurred more frequently in the evolutionary histories of the genus Jeilongvirus. Together, our findings demonstrate the high prevalence of paramyxoviruses in small mammals, especially jeilongviruses, and highlight the diversity of paramyxoviruses and their genome content, as well as the evolution of jeilongviruses.
-
-
-
Defining the phylogenetics and resistome of the major Clostridioides difficile ribotypes circulating in Australia
Keeley O’Grady, Stacey Hong, Papanin Putsathit, Narelle George, Christine Hemphill, Peter G. Huntington, Tony M. Korman, Despina Kotsanas, Monica Lahra, Rodney McDougall, Andrew McGlinchey, Avram Levy, Casey V. Moore, Graeme Nimmo, Louise Prendergast, Jennifer Robson, David J. Speers, Lynette Waring, Michael C. Wehrhahn, Gerhard F. Weldhagen, Richard M. Wilson, Thomas V. Riley and Daniel R. KnightClostridioides difficile infection (CDI) remains a significant public health threat globally. New interventions to treat CDI rely on an understanding of the evolution and epidemiology of circulating strains. Here we provide longitudinal genomic data on strain diversity, transmission dynamics and antimicrobial resistance (AMR) of C. difficile ribotypes (RTs) 014/020 (n=169), 002 (n=77) and 056 (n=36), the three most prominent C. difficile strains causing CDI in Australia. Genome scrutiny showed that AMR was uncommon in these lineages, with resistance-conferring alleles present in only 15/169 RT014/020 strains (8.9 %), 1/36 RT056 strains (2.78 %) and none of 77 RT002 strains. Notably, ~90 % of strains were resistant to MLSB agents in vitro, but only ~5.9 % harboured known resistance alleles, highlighting an incongruence between AMR genotype and phenotype. Core genome analyses revealed all three RTs contained genetically heterogeneous strain populations with limited evidence of clonal transmission between CDI cases. The average number of pairwise core genome SNP (cgSNP) differences within each RT group ranged from 23.3 (RT056, ST34, n=36) to 115.6 (RT002, ST8, n=77) and 315.9 (RT014/020, STs 2, 13, 14, 49, n=169). Just 19 clonal groups (encompassing 40 isolates), defined as isolates differing by ≤2 cgSNPs, were identified across all three RTs (RT014/020, n=14; RT002, n=3; RT056, n=2). Of these clonal groups, 63 % (12/19) comprised isolates from the same Australian State and 37 % (7/19) comprised isolates from different States. The low number of plausible transmission events found for these major RTs (and previously documented populations in animal and environmental sources/reservoirs) points to widespread and persistent community sources of diverse C. difficile strains as opposed to ongoing nationwide healthcare outbreaks dominated by a single clone. Together, these data provide new insights into the evolution of major lineages causing CDI in Australia and highlight the urgent need for enhanced surveillance, and for public health interventions to move beyond the healthcare setting and into a One Health paradigm to effectively combat this complex pathogen.
-
-
-
Deep population structure linked to host vernalization requirement in the barley net blotch fungal pathogen
Invasive fungal pathogens pose a substantial threat to widely cultivated crop species, owing to their capacity to adapt to new hosts and new environmental conditions. Gaining insights into the demographic history of these pathogens and unravelling the mechanisms driving coevolutionary processes are crucial for developing durably effective disease management programmes. Pyrenophora teres is a significant fungal pathogen of barley, consisting of two lineages, Ptt and Ptm, with global distributions and demographic histories reflecting barley domestication and spread. However, the factors influencing the population structure of P. teres remain poorly understood, despite the varietal and environmental heterogeneity of barley agrosystems. Here, we report on the population genomic structure of P. teres in France and globally. We used genotyping-by-sequencing to show that Ptt and Ptm can coexist in the same area in France, with Ptt predominating. Furthermore, we showed that differences in the vernalization requirement of barley varieties were associated with population differentiation within Ptt in France and at a global scale, with one population cluster found on spring barley and another population cluster found on winter barley. Our results demonstrate how cultivation conditions, possibly associated with genetic differences between host populations, can be associated with the maintenance of divergent invasive pathogen populations coexisting over large geographic areas. This study not only advances our understanding of the coevolutionary dynamics of the Pt-barley pathosystem but also prompts further research on the relative contributions of adaptation to the host versus adaptation to abiotic conditions in shaping Ptt populations.
-
-
-
Presence of phage-plasmids in multiple serovars of Salmonella enterica
Evidence is accumulating in the literature that the horizontal spread of antimicrobial resistance (AMR) genes mediated by bacteriophages and bacteriophage-like plasmid (phage-plasmid) elements is much more common than previously envisioned. For instance, we recently identified and characterized a circular P1-like phage-plasmid harbouring a bla CTX-M-15 gene conferring extended-spectrum beta-lactamase (ESBL) resistance in Salmonella enterica serovar Typhi. As the prevalence and epidemiological relevance of such mechanisms has never been systematically assessed in Enterobacterales, in this study we carried out a follow-up retrospective analysis of UK Salmonella isolates previously sequenced as part of routine surveillance protocols between 2016 and 2021. Using a high-throughput bioinformatics pipeline we screened 47 784 isolates for the presence of the P1 lytic replication gene repL, identifying 226 positive isolates from 25 serovars and demonstrating that phage-plasmid elements are more frequent than previously thought. The affinity for phage-plasmids appears highly serovar-dependent, with several serovars being more likely hosts than others; most of the positive isolates (170/226) belonged to S. Typhimurium ST34 and ST19. The phage-plasmids ranged between 85.8 and 98.2 kb in size, with an average length of 92.1 kb; detailed analysis indicated a high amount of diversity in gene content and genomic architecture. In total, 132 phage-plasmids had the p0111 plasmid replication type, and 94 the IncY type; phylogenetic analysis indicated that both horizontal and vertical gene transmission mechanisms are likely to be involved in phage-plasmid propagation. Finally, phage-plasmids were present in isolates that were resistant and non-resistant to antimicrobials. In addition to providing a first comprehensive view of the presence of phage-plasmids in Salmonella, our work highlights the need for a better surveillance and understanding of phage-plasmids as AMR carriers, especially through their characterization with long-read sequencing.
-
- Short Communications
-
- Genomic Methodologies
-
-
Evaluation of the accuracy of bacterial genome reconstruction with Oxford Nanopore R10.4.1 long-read-only sequencing
Whole-genome reconstruction of bacterial pathogens has become an important tool for tracking transmission and antimicrobial resistance gene spread, but highly accurate and complete assemblies have largely only historically been achievable using hybrid long- and short-read sequencing. We previously found the Oxford Nanopore Technologies (ONT) R10.4/kit12 flowcell/chemistry produced improved assemblies over the R9.4.1/kit10 combination, however long-read only assemblies contained more errors compared to Illumina-ONT hybrid assemblies. ONT have since released an R10.4.1/kit14 flowcell/chemistry upgrade and recommended the use of Bovine Serum Albumin (BSA) during library preparation, both of which reportedly increase accuracy and yield. They have also released updated basecallers trained using native bacterial DNA containing methylation sites intended to fix systematic basecalling errors, including common adenosine (A) to guanine (G) and cytosine (C) to thymine (T) substitutions. To evaluate these improvements, we successfully sequenced four bacterial reference strains, namely Escherichia coli, Klebsiella pneumoniae, Pseudomonas aeruginosa and Staphylococcus aureus, and nine genetically diverse E. coli bloodstream infection-associated isolates from different phylogroups and sequence types, both with and without BSA. These sequences were de novo assembled and compared against Illumina-corrected reference genomes. In this small evaluation of 13 isolates we found that nanopore long-read-only R10.4.1/kit 14 assemblies with updated basecallers trained using bacterial methylated DNA produce accurate assemblies with ≥40×depth, sufficient to be cost-effective compared with hybrid ONT/Illumina sequencing in our setting.
-