Home >> Links
In this page:
Date:
|
External Link:
-
Nucleotide Sequence Databases核酸序列数据库
-
NCBI - National Center for Biotechnology
Information
-
EBI - European Bioinformatics Institute
-
DDBJ - DNA Data Bank of Japan
-
Protein Sequence Databases蛋白质序列数据库
-
SWISS-PROT & TrEMBL - Protein sequence
database and computer annotated supplement
-
UniProt - UniProt (Universal
Protein Resource) is the world's most
comprehensive catalog of information on
proteins. It is a central repository of
protein sequence and function created by
joining the information contained in
Swiss-Prot, TrEMBL, and PIR.
-
PIR - Protein Information Resource
-
MIPS - Munich Information centre for
Protein Sequences
-
HUPO - HUman Proteome Organization
-
Database Searching by Sequence Similarity序列相似性搜索
-
Sequence Alignment序列比对
-
USC Sequence Alignment Server - align 2
sequences with all possible varieties of
dynamic programming
-
T-COFFEE - multiple sequence alignment
-
ClustalW @ EBI - multiple sequence
alignment
-
MSA 2.1 - optimal multiple sequence
alignment using the Carrillo-Lipman method
-
BOXSHADE - pretty printing and shading
of multiple alignments
-
Splign - Splign is a utility for
computing cDNA-to-Genomic, or spliced
sequence alignments. At the heart of the
program is a global alignment algorithm that
specifically accounts for introns and splice
signals.
-
Spidey - an mRNA-to-genomic alignment
program
-
SIM4 - a program to align cDNA
and genomic DNA
-
Wise2 - align a protein or profile HMM
against genomic sequence to predict a gene
structure, and related tools
-
PipMaker - computes alignments of
similar regions in two (long) DNA sequences
-
VISTA - align + detect conserved regions
in long genomic sequences
-
myGodzilla - align a sequence to its
ortholog in the human genome
-
Human Genome Databases人类基因组数据库
[Top]
-
Next Generation Sequencing下一代测序
-
Journal Bioinformatics for Next Generation
Sequencing
- Integrated solutions
*
CLCbio Genomics Workbench - de novo
and reference assembly of Sanger, Roche FLX,
Illumina, Helicos, and SOLiD data.
Commercial next-gen-seq software that
extends the CLCbio Main Workbench software.
Includes SNP detection, CHiP-seq, browser
and other features. Commercial. Windows, Mac
OS X and Linux.
*
Galaxy - Galaxy = interactive and
reproducible genomics. A job webportal.
*
Genomatix - Integrated Solutions for
Next Generation Sequencing data analysis.
*
JMP Genomics - Next gen visualization
and statistics tool from SAS. They are
working with NCGR to refine this tool
and produce others.
*
NextGENe - de novo and reference
assembly of Illumina, SOLiD and Roche FLX
data. Uses a novel Condensation Assembly
Tool approach where reads are joined via
"anchors" into mini-contigs before assembly.
Includes SNP detection, CHiP-seq, browser
and other features. Commercial. Win or MacOS.
*
SeqMan Genome Analyser - Software for
Next Generation sequence assembly of
Illumina, Roche FLX and Sanger data
integrating with Lasergene Sequence Analysis
software for additional analysis and
visualization capabilities. Can use a hybrid
templated/de novo approach. Commercial. Win
or Mac OS X.
*
SHORE - SHORE, for Short Read, is a
mapping and analysis pipeline for short DNA
sequences produced on a Illumina Genome
Analyzer. A suite created by the 1001
Genomes project. Source for POSIX.
*
SlimSearch - Fledgling commercial
product.
Align/Assemble to a reference
*
BFAST - Blat-like Fast Accurate Search
Tool. Written by Nils Homer, Stanley F.
Nelson and Barry Merriman at UCLA.
*
Bowtie - Ultrafast, memory-efficient
short read aligner. It aligns short DNA
sequences (reads) to the human genome at a
rate of 25 million reads per hour on a
typical workstation with 2 gigabytes of
memory. Uses a Burrows-Wheeler-Transformed (BWT)
index.
Link to discussion thread here. Written
by Ben Langmead and Cole Trapnell. Linux,
Windows, and Mac OS X.
*
BWA - Heng Lee's BWT Alignment program -
a progression from Maq. BWA is a fast
light-weighted tool that aligns short
sequences to a sequence database, such as
the human reference genome. By default, BWA
finds an alignment within edit distance 2 to
the query sequence. C++ source.
*
ELAND - Efficient Large-Scale Alignment
of Nucleotide Databases. Whole genome
alignments to a reference genome. Written by
Illumina author Anthony J. Cox for the
Solexa 1G machine.
*
Exonerate - Various forms of pairwise
alignment (including Smith-Waterman-Gotoh)
of DNA/protein against a reference. Authors
are Guy St C Slater and Ewan Birney from
EMBL. C for POSIX.
*
GenomeMapper - GenomeMapper is a short
read mapping tool designed for accurate read
alignments. It quickly aligns millions of
reads either with ungapped or gapped
alignments. A tool created by the 1001
Genomes project. Source for POSIX.
*
GMAP - GMAP (Genomic Mapping and
Alignment Program) for mRNA and EST
Sequences. Developed by Thomas Wu and Colin
Watanabe at Genentec. C/Perl for Unix.
*
gnumap - The Genomic Next-generation
Universal MAPper (gnumap) is a program
designed to accurately map sequence data
obtained from next-generation sequencing
machines (specifically that of Solexa/Illumina)
back to a genome of any size. It seeks to
align reads from nonunique repeats using
statistics. From authors at Brigham Young
University. C source/Unix.
*
MAQ - Mapping and Assembly with
Qualities (renamed from MAPASS2).
Particularly designed for Illumina with
preliminary functions to handle ABI SOLiD
data. Written by Heng Li from the Sanger
Centre. Features extensive supporting tools
for DIP/SNP detection, etc. C++ source
*
MOSAIK - MOSAIK produces gapped
alignments using the Smith-Waterman
algorithm. Features a number of support
tools. Support for Roche FLX, Illumina,
SOLiD, and Helicos. Written by Michael
Strömberg at Boston College. Win/Linux/MacOSX
*
MrFAST and MrsFAST - mrFAST & mrsFAST
are designed to map short reads generated
with the Illumina platform to reference
genome assemblies; in a fast and
memory-efficient manner. Robust to INDELs
and MrsFAST has a bisulphite mode. Authors
are from the University of Washington. C as
source.
*
MUMmer - MUMmer is a modular system for
the rapid whole genome alignment of finished
or draft sequence. Released as a package
providing an efficient suffix tree library,
seed-and-extend alignment, SNP detection,
repeat detection, and visualization tools.
Version 3.0 was developed by Stefan Kurtz,
Adam Phillippy, Arthur L Delcher, Michael
Smoot, Martin Shumway, Corina Antonescu and
Steven L Salzberg - most of whom are at The
Institute for Genomic Research in Maryland,
USA. POSIX OS required.
*
Novocraft - Tools for reference
alignment of paired-end and single-end
Illumina reads. Uses a Needleman-Wunsch
algorithm. Can support Bis-Seq. Commercial.
Available free for evaluation, educational
use and for use on open not-for-profit
projects. Requires Linux or Mac OS X.
*
PASS - It supports Illumina, SOLiD and
Roche-FLX data formats and allows the user
to modulate very finely the sensitivity of
the alignments. Spaced seed intial filter,
then NW dynamic algorithm to a SW(like)
local alignment. Authors are from CRIBI in
Italy. Win/Linux.
*
RMAP - Assembles 20 - 64 bp Illumina
reads to a FASTA reference genome. By Andrew
D. Smith and Zhenyu Xuan at CSHL. (published
in BMC Bioinformatics). POSIX OS required.
*
SeqMap - Supports up to 5 or more bp
mismatches/INDELs. Highly tunable. Written
by Hui Jiang from the Wong lab at Stanford.
Builds available for most OS's.
*
SHRiMP - Assembles to a reference
sequence. Developed with Applied Biosystem's
colourspace genomic representation in mind.
Authors are Michael Brudno and Stephen
Rumble at the University of Toronto. POSIX.
*
Slider- An application for the
Illumina Sequence Analyzer output that uses
the probability files instead of the
sequence files as an input for alignment to
a reference sequence or a set of reference
sequences. Authors are from BCGSC. Paper is
here.
*
SOAP - SOAP (Short Oligonucleotide
Alignment Program). A program for efficient
gapped and ungapped alignment of short
oligonucleotides onto reference sequences.
The updated version uses a BWT. Can call
SNPs and INDELs. Author is Ruiqiang Li at
the Beijing Genomics Institute. C++, POSIX.
*
SSAHA - SSAHA (Sequence Search and
Alignment by Hashing Algorithm) is a tool
for rapidly finding near exact matches in
DNA or protein databases using a hash table.
Developed at the Sanger Centre by Zemin
Ning, Anthony Cox and James Mullikin. C++
for Linux/Alpha.
*
SOCS - Aligns SOLiD data. SOCS is built
on an iterative variation of the Rabin-Karp
string search algorithm, which uses hashing
to reduce the set of possible matches,
drastically increasing search speed. Authors
are Ondov B, Varadarajan A, Passalacqua KD
and Bergman NH.
*
SWIFT - The SWIFT suit is a software
collection for fast index-based sequence
comparison. It contains: SWIFT — fast local
alignment search, guaranteeing to find
epsilon-matches between two sequences. SWIFT
BALSAM — a very fast program to find
semiglobal non-gapped alignments based on
k-mer seeds. Authors are Kim Rasmussen
(SWIFT) and Wolfgang Gerlach (SWIFT BALSAM)
*
SXOligoSearch - SXOligoSearch is a
commercial platform offered by the Malaysian
based
Synamatix. Will align Illumina reads
against a range of Refseq RNA or NCBI genome
builds for a number of organisms. Web
Portal. OS independent.
*
Vmatch - A versatile software tool for
efficiently solving large scale sequence
matching tasks. Vmatch subsumes the software
tool REPuter, but is much more general, with
a very flexible user interface, and improved
space and time requirements. Essentially a
large string matching toolbox. POSIX.
*
Zoom - ZOOM (Zillions Of Oligos Mapped)
is designed to map millions of short reads,
emerged by next-generation sequencing
technology, back to the reference genomes,
and carry out post-analysis. ZOOM is
developed to be highly accurate, flexible,
and user-friendly with speed being a
critical priority. Commercial. Supports
Illumina and SOLiD data.
De novo Align/Assemble
*
ABySS - Assembly By Short Sequences.
ABySS is a de novo sequence assembler that
is designed for very short reads. The
single-processor version is useful for
assembling genomes up to 40-50 Mbases in
size. The parallel version is implemented
using MPI and is capable of assembling
larger genomes. By Simpson JT and others at
the Canada's Michael Smith Genome Sciences
Centre. C++ as source.
*
ALLPATHS - ALLPATHS: De novo assembly of
whole-genome shotgun microreads. ALLPATHS is
a whole genome shotgun assembler that can
generate high quality assemblies from short
reads. Assemblies are presented in a graph
form that retains ambiguities, such as those
arising from polymorphism, thereby providing
information that has been absent from
previous genome assemblies. Broad Institute.
*
Edena - Edena (Exact DE Novo Assembler)
is an assembler dedicated to process the
millions of very short reads produced by the
Illumina Genome Analyzer. Edena is based on
the traditional overlap layout paradigm. By
D. Hernandez, P. François, L. Farinelli, M.
Osteras, and J. Schrenzel. Linux/Win.
*
EULER-SR - Short read de novo
assembly. By Mark J. Chaisson and Pavel A.
Pevzner from UCSD (published in Genome
Research). Uses a de Bruijn graph approach.
*
MIRA2 - MIRA (Mimicking Intelligent Read
Assembly) is able to perform true hybrid
de-novo assemblies using reads gathered
through 454 sequencing technology (GS20 or
GS FLX). Compatible with 454, Solexa and
Sanger data. Linux OS required.
*
SEQAN - A Consistency-based Consensus
Algorithm for De Novo and Reference-guided
Sequence Assembly of Short Reads. By Tobias
Rausch and others. C++, Linux/Win.
*
SHARCGS - De novo assembly of short
reads. Authors are Dohm JC, Lottaz C,
Borodina T and Himmelbauer H. from the
Max-Planck-Institute for Molecular Genetics.
*
SSAKE - The Short Sequence Assembly by
K-mer search and 3' read Extension (SSAKE)
is a genomics application for aggressively
assembling millions of short nucleotide
sequences by progressively searching for
perfect 3'-most k-mers using a DNA prefix
tree. Authors are René Warren, Granger
Sutton, Steven Jones and Robert Holt from
the Canada's Michael Smith Genome Sciences
Centre. Perl/Linux.
*
SOAPdenovo - Part of the SOAP suite. See
above.
*
VCAKE - De novo assembly of short reads
with robust error correction. An improvement
on early versions of SSAKE.
*
Velvet - Velvet is a de novo genomic
assembler specially designed for short read
sequencing technologies, such as Solexa or
454. Need about 20-25X coverage and paired
reads. Developed by Daniel Zerbino and Ewan
Birney at the European Bioinformatics
Institute (EMBL-EBI).
SNP/Indel Discovery
*
ssahaSNP - ssahaSNP is a polymorphism
detection tool. It detects homozygous SNPs
and indels by aligning shotgun reads to the
finished genome sequence. Highly repetitive
elements are filtered out by ignoring those
kmer words with high occurrence numbers.
More tuned for ABI Sanger reads. Developers
are Adam Spargo and Zemin Ning from the
Sanger Centre. Compaq Alpha, Linux-64,
Linux-32, Solaris and Mac
*
PolyBayesShort - A re-incarnation of the
PolyBayes SNP discovery tool developed by
Gabor Marth at Washington University. This
version is specifically optimized for the
analysis of large numbers (millions) of
high-throughput next-generation sequencer
reads, aligned to whole chromosomes of model
organism or mammalian genomes. Developers at
Boston College. Linux-64 and Linux-32.
*
PyroBayes - PyroBayes is a novel base
caller for pyrosequences from the 454 Life
Sciences sequencing machines. It was
designed to assign more accurate base
quality estimates to the 454 pyrosequences.
Developers at Boston College.
Genome Annotation/Genome
Browser/Alignment Viewer/Assembly Database
*
EagleView - An information-rich genome
assembler viewer. EagleView can display a
dozen different types of information
including base quality and flowgram signal.
Developers at Boston College.
*
LookSeq - LookSeq is a web-based
application for alignment visualization,
browsing and analysis of genome sequence
data. LookSeq supports multiple sequencing
technologies, alignment sources, and viewing
modes; low or high-depth read pileups; and
easy visualization of putative single
nucleotide and structural variation. From
the Sanger Centre.
*
MapView - MapView: visualization of
short reads alignment on desktop computer.
From the Evolutionary Genomics Lab at
Sun-Yat Sen University, China. Linux.
*
SAM - Sequence Assembly Manager. Whole
Genome Assembly (WGA) Management and
Visualization Tool. It provides a generic
platform for manipulating, analyzing and
viewing WGA data, regardless of input type.
Developers are Rene Warren, Yaron
Butterfield, Asim Siddiqui and Steven Jones
at Canada's Michael Smith Genome Sciences
Centre. MySQL backend and Perl-CGI web-based
frontend/Linux.
*
STADEN - Includes GAP4. GAP5 once
completed will handle next-gen sequencing
data. A partially implemented test version
is available
here
*
XMatchView - A visual tool for analyzing
cross_match alignments. Developed by Rene
Warren and Steven Jones at Canada's Michael
Smith Genome Sciences Centre. Python/Win or
Linux.
Counting e.g. CHiP-Seq, Bis-Seq, CNV-Seq
*
BS-Seq - The source code and data for
the "Shotgun Bisulphite Sequencing of the
Arabidopsis Genome Reveals DNA Methylation
Patterning" Nature paper by
Cokus et al. (Steve Jacobsen's lab at
UCLA). POSIX.
*
CHiPSeq - Program used by Johnson et al.
(2007) in their Science publication
*
CNV-Seq - CNV-seq, a new method to
detect copy number variation using
high-throughput sequencing. Chao Xie and
Martti T Tammi at the National University of
Singapore. Perl/R.
*
FindPeaks - perform analysis of ChIP-Seq
experiments. It uses a naive algorithm for
identifying regions of high coverage, which
represent Chromatin Immunoprecipitation
enrichment of sequence fragments, indicating
the location of a bound protein of interest.
Original algorithm by Matthew Bainbridge, in
collaboration with Gordon Robertson. Current
code and implementation by Anthony Fejes.
Authors are from the Canada's Michael Smith
Genome Sciences Centre. JAVA/OS independent.
Latest versions available as part of the
Vancouver Short Read Analysis Package
*
MACS - Model-based Analysis for
ChIP-Seq. MACS empirically models the length
of the sequenced ChIP fragments, which tends
to be shorter than sonication or library
construction size estimates, and uses it to
improve the spatial resolution of predicted
binding sites. MACS also uses a dynamic
Poisson distribution to effectively capture
local biases in the genome sequence,
allowing for more sensitive and robust
prediction. Written by Yong Zhang and Tao
Liu from Xiaole Shirley Liu's Lab.
*
PeakSeq - PeakSeq: Systematic Scoring of
ChIP-Seq Experiments Relative to Controls. a
two-pass approach for scoring ChIP-Seq data
relative to controls. The first pass
identifies putative binding sites and
compensates for variation in the mappability
of sequences across the genome. The second
pass filters out sites that are not
significantly enriched compared to the
normalized input DNA and computes a precise
enrichment and significance. By Rozowsky J
et al. C/Perl.
*
QuEST - Quantitative Enrichment of
Sequence Tags. Sidow and Myers Labs at
Stanford. From the 2008 publication
Genome-wide analysis of transcription factor
binding sites based on ChIP-Seq data.
(C++)
*
SISSRs - Site Identification from Short
Sequence Reads. BED file input. Raja Jothi @
NIH. Perl.
**See also
this thread for ChIP-Seq, until I get
time to update this list.
Alternate Base Calling
*
Rolexa - R-based framework for base
calling of Solexa data. Project
publication
*
Alta-cyclic - "a novel Illumina
Genome-Analyzer (Solexa) base caller"
Transcriptomics
*
ERANGE - Mapping and Quantifying
Mammalian Transcriptomes by RNA-Seq.
Supports Bowtie, BLAT and ELAND. From the
Wold lab.
*
G-Mo.R-Se - G-Mo.R-Se is a method aimed
at using RNA-Seq short reads to build de
novo gene models. First, candidate exons are
built directly from the positions of the
reads mapped on the genome (without any ab
initio assembly of the reads), and all the
possible splice junctions between those
exons are tested against unmapped reads.
From CNS in France.
*
MapNext - MapNext: A software tool for
spliced and unspliced alignments and SNP
detection of short sequence reads. From the
Evolutionary Genomics Lab at Sun-Yat Sen
University, China.
*
QPalma - Optimal Spliced Alignments of
Short Sequence Reads. Authors are Fabio De
Bona, Stephan Ossowski, Korbinian
Schneeberger, and Gunnar Rätsch. A paper is
available.
*
RSAT - RSAT: RNA-Seq Analysis Tools.
RNASAT is developed and maintained by Hui
Jiang at Stanford University.
*
TopHat - TopHat is a fast splice
junction mapper for RNA-Seq reads. It aligns
RNA-Seq reads to mammalian-sized genomes
using the ultra high-throughput short read
aligner Bowtie, and then analyzes the
mapping results to identify splice junctions
between exons. TopHat is a collaborative
effort between the University of Maryland
and the University of California, Berkeley
[Top]
- Rice databases
BGI-RISe: Beijing Genomics Institute Rice Information System
RGP: Rice Genome Research Program
IRIS: International Rice Information System
MOsDB: MIPS Oryza sativa database
Oryzabase: Rice genetics and genomics
Oryza Tag Line database: T-DNA insertion mutants of rice
[Top]
- Plant Promoter and Regulatory Element Resources
AGRIS Currently contains two databases, AtcisDB (Arabidopsis thaliana cis-regulatory database) and AtTFDB (Arabidopsis thaliana transcriptionfactor database).
AthaMap A genome-wide map of putative transcription factor binding sites in Arabidopsis thaliana.
AtProbe The Arabidopsis thaliana promoter binding element database, an aid to find binding elements and check data against the primary literature.
DATF: Database of Arabidopsis Transcription Factors (DATF) contains known and predicted Arabidopsis transcription factors with sequences and many other features including 3D structure templates, EST expression information, transcription factor binding sites and Nuclear Location Signals.
DoOP: Databases of Orthologous Promoters, a database containing orthologous clusters of promoters from Homo sapiens, Arabidopsis thaliana and other organisms.
GRASSIUS A public web resource composed by a collection of databases, computational and experimental resources that relate to the control of gene expression in the grasses, and their relationship with agronomic traits. GRASSIUS currently contains regulatory information on maize, rice, sorghum and sugarcane.
PlantCare Database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences.
PlantProm DB Database with annotated, non-redundant collection of proximal promoter sequences for RNA polymerase II with experimentally determined transcription start sites (TSS) from various plant species.
PlantTFDB: Plant Transcription Factor Database, an integrative plant transcription factor database that provides a web interface to access large (close to complete) sets of transcription factors of several plant species, currently encompassing Arabidopsis thaliana (thale cress), Populus trichocarpa (poplar), Oryza sativa(rice), Chlamydomonas reinhardtii and Ostreococcus tauri.
PPDB (Plant Promoter DB) Database that provides transcription start sites (TSS) and other structural information for Arabidopsis and rice promoters.
Transfac Database on eukaryotic transcription factors, their genomic binding sites and DNA-binding profiles. Commercial site.
[Top]
-
Databases of other Organisms其他物种数据库
[Top]
-
Genome-wide Analysis基因组分析
-
MBGD - comparative analysis of
completely sequenced microbial genomes
-
COGs - phylogenetic classification of
orthologous proteins from complete genomes
-
STRING - detect whether a given query
gene occurs repeatedly with certain other
genes in potential operons
-
Pedant - automatic whole genome
annotation
-
GeneCensus - various whole genome
comparisons
[Top]
-
Protein Domains: Databases and Search Tools蛋白质域
-
InterPro - integration of Pfam, PRINTS,
PROSITE, SWISS-PROT + TrEMBL
-
PROSITE - database of protein families
and domains
-
Pfam - alignments and hidden Markov
models covering many common protein domains
-
SMART - analysis of domains in proteins
-
ProDom - protein domain database
-
PRINTS Database - groups of conserved
motifs used to characterise protein families
-
Blocks - multiply aligned ungapped
segments corresponding to the most highly
conserved regions of proteins
-
Protein Domain Profile Analysis @ BMERC
- search a library of profiles with a
protein sequence
-
TIGRFAMs - yet more protein families
based on Hidden Markov Models
[Top]
-
Motif and Pattern Search in Sequences序列模体和样式搜索
-
Gibbs Motif Sampler - identification of
conserved motifs in DNA or protein sequences
-
AlignACE Homepage - gene regulatory
motif finding
-
MEME - motif discovery and search in
protein and DNA sequences
-
SAM - tools for creating and using
Hidden Markov Models
-
Pratt - discover patterns in unaligned
protein sequences
-
Motivated Proteins - a web facility for
exploring small hydrogen-bonded motifs
-
[Top]
-
Protein 3D Structure蛋白质三维结构
-
PDB - protein 3D structure database
-
RasMol / Protein Explorer - molecule 3D
structure viewers
-
UCL BSM CATH classification
-
The DALI Domain Database
-
FSSP - fold classification based on
structure-structure alignment of proteins
-
SWISS-MODEL - homology modeling server
-
Structure Prediction Meta-server
-
K2 - protein structure alignment
-
DALI - 3D structure alignment server
-
DSSP - defines secondary structure and
solvent exposure from 3D coordinates
-
HSSP Database - Homology-derived
Secondary Structure of Proteins
-
PredictProtein & PHD - predict secondary
structure, solvent accessibility,
transmembrane helices, and other stuff
-
Jpred2 - protein secondary structure
prediction
-
PSIpred (& MEMSAT & GenTHREADER) -
protein secondary structure prediction (&
transmembrane helix prediction & tertiary
structure prediction by threading)
- Structural analysis
Pfam: A collection of protein families
SMART: Simple Modular Architecture Research Tool
SCOP: Structural Classification of Proteins
MEME: Motif-based sequence analysis tools
Jpred3: A Secondary Structure Prediction Server
PSIPRED: A highly accurate method for protein secondary structure prediction
DiANNA: Cysteine state and Disulfide Bond partner prediction
Robetta: Full-chain protein structure prediction server
PhyML: Phylogenetic Tree by Maximum Likelihood
[Top]
-
Phylogeny & Taxonomy进化与分类
[Top]
-
Gene Prediction基因预测
[Top]
-
Gene Expression Databases基因表达数据库
[Top]
-
Gene Regulation基因调控
-
TRAFAC - For identifying
conserved and shared cis regulatory elements
between a pair of genes.
-
CisMols - For identifying
conserved and shared cis regulatory elements
between a set of co-expressed genes.
-
TRANSFAC - database of eukaryotic cis-acting
regulatory DNA elements and trans-acting
factors
-
EPD - eukaryotic promoter database
-
DBTSS - DataBase of Transcriptional
Start Sites (human)
-
SCPD - Saccharomyces cerevisiae promoter
database
-
DCPD - Drosophila Core Promoter Database
-
RegulonDB - a database on
transcriptional regulation in E. coli
-
DPInteract - protein binding sites on E.
coli DNA
-
PromoterInspector - prediction of
promoter regions in mammalian genomic
sequences
-
MatInspector - search for transcription
factor binding sites
-
Cister - cis-element cluster finder
-
Gene regulatory Tools
[Top]
- Small RNA/MicroRNA小分子RNA
[Top]
-
Metabolic, Gene Regulatory & Signal
Transduction Network Databases代谢、基因调控和信号转导网络数据库
[Top]
-
Systems Biology系统生物学
-
Other Databases (Annotations, Ontologies,
Consortia, etc.)其他数据库(注释、本体、协会)
-
Entrez Gene - Gene provides a
unified query environment for genes defined
by sequence and/or in NCBI's Map Viewer. You
can query on names, symbols, accessions,
publications, GO terms, chromosome numbers,
E.C. numbers, and many other attributes
associated with genes and the products they
encode. Replaces LocusLink.
-
Cancer Genome Anatomy Project
-
HUGO's Human Gene Nomenclature
-
Gene Ontology Consortium - a controlled
vocabulary of eukaryotic gene roles
-
Open Biological Ontologies an umbrella
web address for well-structured controlled
vocabularies for shared use across different
biological domains.
-
ACUTS - compilation of Ancient Conserved
UnTranslated Sequences
-
UTR database
-
ENZYME - enzyme nomenclature database
-
BRENDA - enzyme database
-
TC-DB - comprehensive classification of
membrane transport proteins
-
The SNP Consortium
-
HGBASE - database of sequence variations
in the human genome
-
MethDB - DNA methylation database
-
SpliceDB - canonical and non-canonical
splice site sequences in mammalian genes
-
SpliceOme - database of intron-exon
boundaries
-
InBase - intein database
-
The I.M.A.G.E. Consortium
-
The Kabat Database of Sequences of Proteins
of Immunological Interest
-
Nelson Lab: Cytochrome C
-
REBASE - restriction enzyme database
-
Chemfinder.com - molecule database
-
Genomics Institute of the Novartis Research
Foundation
-
Mouse SNPs Database- 670,000+ SNP
records, 8.0+ million allele calls. Allele
tables are provided by investigators or
retrieved from public sources. All SNPs are
mapped to NCBI Mouse Genome build 33
(C57BL/6J assembly). Most are linked to NCBI
dbSNP build 123.
-
MetaBase is a user contributed database
of databases, listing all the biological
databases currently available on the
internet.
- Bio-computing.org Bioinformatics,
Databases and Software for Medicine.
[Top]
-
Miscellaneous Tools其他工具
[Top]
-
Computational Resources计算资源
[Top]
-
Bioinformatics on-line course materials and
tutorials (not an exhaustive collection)在线生物信息学课程教材
Intro to bioinformatics and computational
biology:
Algorithms:
[Top]
Miscellaneous:
[Top]
-
Web Sites for Background Information & News背景信息与新闻
[Top]
-
Other Collections of Bioinformatics Resources其他资源
Up to Top
|