INSaFLU-TELE-Vir: an open web-based bioinformatics suite for influenza and SARS-CoV-2 genome-based surveillance

2021
A new era of virus surveillance is emerging based on the real-time monitoring of virus evolution at whole-genome scale (World Health Organization 2021). Although national and international health authorities have strongly recommended this technological transition, especially for influenza and SARS-CoV-2 (World Health Organization 2021, Revez et al. 2017), the implementation of genomic surveillance can be particularly challenging due to the lack of bioinformatics infrastructures and/or expertise to process and interpret next-generation sequencing (NGS) data (Oakeson et al. 2017).We developed and implemented INSaFLU-TELE-Vir platform (https://insaflu.insa.pt/) (Borges et al. 2018), which is an influenza- and SARS-CoV-2-oriented bioinformatics free web-based suite that handles primary NGS data (reads) towards the automatic generation of the main “genetic requests'' for effective and timely laboratory surveillance. By handling NGS data collected from any amplicon-based schema (making it applicable for other pathogens), INSaFLU-TELE-Vir enables any laboratory to perform multi-step and intensive bioinformatics analyses in a user-oriented manner without requiring advanced training.INSaFLU-TELE-Vir handles NGS data collected from distinct sequencing technologies (Illumina, Ion Torrent and Oxford Nanopore Technologies), with the possibility of constructing comparative analyses using different technologies. It gives access to user-restricted sample databases and project management, being a transparent and flexible tool specifically designed to automatically update project outputs as more samples are uploaded. Data integration is thus cumulative and scalable, fitting the need for both routine surveillance and outbreak investigation activities.The bioinformatics pipeline consists of six core steps:read quality analysis and improvement,human betacoronaviruses (including SARS-CoV-2 Pango lineages) and influenza type/subtype classification,mutation detection and consensus generation,coverage analysis,alignment/phylogeny,intra-host minor variant detection (and automatic detection of putative mixed infections).The multiple outputs are provided in nomenclature-stable and standardized formats that can be visualized and explored in situ or through multiple compatible downstream applications for fine-tuned data analysis.Novel features are being implemented into the INSaFLU-TELE-Vir bioinformatics toolkit as part of the OHEJP TELE-Vir (https://onehealthejp.eu/jrp-tele-vir/) project, including rapid detection of selected genotype-phenotype associations, and enhanced geotemporal data visualization.All the code is available in github (https://github.com/INSaFLU) with the possibility of a local docker installation (https://github.com/INSaFLU/docker). A detailed documentation and tutorial is also available (https://insaflu.readthedocs.io/en/latest/).In summary, INSaFLU supplies public health laboratories and researchers with an open and user-friendly framework, potentiating a strengthened and timely multi-country genome-based virus surveillance.
    • Correction
    • Source
    • Cite
    • Save
    3
    References
    0
    Citations
    NaN
    KQI
    []
    Baidu
    map