eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Domain-centric database to uncover structure of minimally characterized viral genomes

Resource Type
Authors: William Buchser; Aaron DiAntonio; John C. Bramley; Alex L. Yenkin; Jeffrey Milbrandt; Mark A. Zaydman
Source: Scientific Data, Vol 7, Iss 1, Pp 1-11 (2020)
Scientific Data
Subject: Statistics and Probability
Data Descriptor
Computer science
Protein domain
Genome, Viral
Computational biology
Library and Information Sciences
Genome
Education
Domain (software engineering)
03 medical and health sciences
0302 clinical medicine
Protein Domains
Databases, Protein
Hidden Markov model
lcsh:Science
Gene
030304 developmental biology
Sequence (medicine)
Structure (mathematical logic)
0303 health sciences
Comparative genomics
Markov Chains
Computer Science Applications
Metadata
lcsh:Q
Statistics, Probability and Uncertainty
Genetic databases
030217 neurology & neurosurgery
Information Systems
Language: English
ISSN: 2052-4463

Online Access

초록

Protein domain-based approaches to analyzing sequence data are valuable tools for examining and exploring genomic architecture across genomes of different organisms. Here, we present a complete dataset of domains from the publicly available sequence data of 9,051 reference viral genomes. The data provided contain information such as sequence position and neighboring domains from 30,947 pHMM-identified domains from each reference viral genome. Domains were identified from viral whole-genome sequence using automated profile Hidden Markov Models (pHMM). This study also describes the framework for constructing “domain neighborhoods”, as well as the dataset representing it. These data can be used to examine shared and differing domain architectures across viral genomes, to elucidate potential functional properties of genes, and potentially to classify viruses.
Measurement(s)Protein Domain • RNA viral genome • DNA viral genome • protein domain neighborhoods • protein domain clusterTechnology Type(s)digital curation • bioinformatics method • Cluster AnalysisFactor Type(s)Viral GenomeSample Characteristic - OrganismViruses Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.12319631

공지

DAU Library

eArticles

요약정보

Domain-centric database to uncover structure of minimally characterized viral genomes

Online Access

초록