Articles

What is UniGene in bioinformatics?

June 28, 2019 by Rhyley Bryan

What is UniGene in bioinformatics?

UniGene is an experimental system for automatically partitioning GenBank sequences into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains sequences that represent a unique gene, as well as related information such as the tissue types in which the gene has been expressed and map location.

What is Entrez in bioinformatics?

Entrez is a molecular biology database system that provides integrated access to nucleotide and protein sequence data, gene-centered and genomic mapping information, 3D structure data, PubMed MEDLINE, and more.

How many human clusters in UniGene?

In total, the human UniGene database contains 279 different tissues; the mouse database, 90.

What is NCBI stand for?

National Center for Biotechnology Information
U.S. National Library of Medicine. NCBI National Center for Biotechnology Information.

What is the difference between RefSeq and GenBank?

What is the difference between RefSeq and GenBank? GenBank sequence records are owned by the original submitter and cannot be altered by a third party. RefSeq sequences are not part of the INSDC but are derived from INSDC sequences to provide non-redundant curated data representing our current knowledge of known genes.

What is the goal of transcriptomics study?

The transcriptome is the complete set of transcripts in a specific type of cell or tissue. Generally, the goal of transcriptome analysis is to identify genes differentially expressed among different conditions, leading to a new understanding of the genes or pathways associated with the conditions.

What can a transcriptome tell us?

What can a transcriptome tell us? Consequently, by analyzing the entire collection of RNA sequences in a cell (the transcriptome) researchers can determine when and where each gene is turned on or off in the cells and tissues of an organism.

Which type of tool is Entrez?

Entrez is NCBI’s primary text search and retrieval system that integrates the PubMed database of biomedical literature with 38 other literature and molecular databases including DNA and protein sequence, structure, gene, genome, genetic variation and gene expression.

Why is NCBI useful?

The NCBI houses a series of databases relevant to biotechnology and biomedicine and is an important resource for bioinformatics tools and services. Major databases include GenBank for DNA sequences and PubMed, a bibliographic database for biomedical literature.

What does the term’unigene’mean in biology?

But unigene refers to cluster of genes that perform a particular function. Broadly we can tell, clusters ESTs and other mRNA sequences, along with coding sequences (CDSs) annotated on genomic DNA, into subsets of related sequences. You are correct; UniGene is a database and not a biological concept.

Is the UniGene database primarily a gene database?

UniGene is an NCBI database of the transcriptome and thus, despite the name, not primarily a database for genes. Each entry is a set of transcripts that appear to stem from the same transcription locus (i.e. gene or expressed pseudogene).

What is the purpose of the UniGene resource?

The UniGene resource, developed at NCBI, clusters ESTs and other mRNA sequences, along with coding sequences (CDSs) annotated on genomic DNA, into subsets of related sequences. In most cases, each cluster is made up of sequences produced by a single gene, including alternatively spliced transcripts.

Which is the longest sequence in a UniGene cluster?

UniGene clusters can also be accessed by GenBank accession numbers of sequences in the cluster. Unlike other gene indices (such as TIGR gene indices), no consensus sequences are available in UniGene. However, the longest sequence in a cluster is selected and separately documented (in file called Hs.seq.uniq.Z).