Bioinformatics
From Augix' Wiki
Contents |
Database
Genome Browser
length information of human chromosomes
http://genome.ucsc.edu/cgi-bin/hgTracks?hgsid=1230&chromInfoPage=
Ensembl
genebuild
Ensembl transcripts displayed on our website are products of the Ensembl automatic pipeline, termed the Ensembl genebuild.
All Ensembl transcripts are based on experimental evidence and thus the automated pipeline relies on the mRNAs and protein sequences deposited into public databases from the scientific community.
Exonerate is a tool for pairwise sequence comparison, used in the Ensembl genebuild to align mRNA to the assembly.
Regulatory Build
Promoters are not available for most genes in Ensembl.
One can find information on genome-wide studies of regulatory regions in Ensembl Regulatory Build.
VEGA database
The Vertebrate Genome Annotation (VEGA) database is a central repository for high quality manual annotation of vertebrate finished genome sequence.
Human, mouse and zebrafish are in the process of being completely annotated.
In other species, the annotation is only of specific genomic regions of particular biological interest.
Consensus CDS (CCDS) project
The CCDS project is a collaborative effort to identify a core set of protein coding regions that are consistently annotated and of high quality.
Initial results from the Consensus CDS (CCDS) project are now available through the appropriate Ensembl gene pages and from the CCDS project page at NCBI.
Bioinformatics tools
- biomart
- galaxy
- GenomeTools
http://genometools.org/index.html The GenomeTools genome analysis system is a free collection of bioinformatics tools.
- tallymer, a collection of flexible and memory-efficient programs for k-mer counting and indexing of large sequence sets.
- uniquesub, a program for computing minimum unique substrings.
- AnnotationSketch, a library for drawing genome annotations.
- MetaGenomeThreader, a software to predict genes, such as PCS's (predicted coding sequences) in sequences of metagenome projects.

