Skip to content

Latest commit

 

History

History

README.md

Welcome to PhageScope

PhageScope is an online bacteriophage database, including 873,718 phage sequences with comprehensive annotations. PhageScope incorporates automatic analysis and interactive visualization.

Database

We have performed an exhaustive search for phage sequences across multiple public repositories (such as RefSeq, Genbank, EMBL, and DDBJ) and published datasets (such as PhagesDB, GOV2, GVD, GPD, MGV, CHVD, STV, TemPhD, IGVD and IMG/VR). As a result, we have gathered a dataset comprising 873,718 phage sequences, only a tiny fraction of which have annotation information available, such as host taxonomy, lifestyle, and genetic features.

To provide comprehensive and accurate annotations for the collected phage sequences, we applied fifteen state-of-the-art tools to give completeness assessment, phenotype annotation (host and lifestyle), structural annotation (ORFs, proteins, and terminators), taxonomic annotation, functional annotation (tRNA & tmRNA, Anti-CRISPR protein, CRISPR array, virulent factors, antimicrobial resistance genes, and transmembrane proteins), and sequence comparison (genome clustering, sequence alignment, and comparative tree) for the phage sequences. The 873,718 phage sequences, along with their annotated information, are available in PhageScope.

image

Analysis

We also provide annotation pipelines for users to analyze their customized data. Users can upload single or multiple phage sequences in fasta format and run the complete or partial annotation steps. The complete workflow includes completeness assessment, phenotype annotation, structural annotation, taxonomic annotation, and functional annotation, as described above. For multiple sequences, genome comparison pipelines, including sequence clustering, sequence alignment, and comparative tree construction, are provided. PhageScope platform performs automatic analysis and returns results that can be visualized and downloaded.

image

Visualization

Additionally, PhageScope supports interactive visualization of the curated database and customized analysis results. Specifically, PhageScope generates completeness and phenotype distribution charts, graphical annotation, multiple sequence alignment visualizations, and comparative tree visualizations. All visualizations can be downloaded in high-quality publication-ready format.

image