A Brief Tour of PlantGDB!

Click on a topic to view a brief description and site navigation links and tips.

PlantGDB is a genomics database encompassing green plant (Viridiplantae) sequence data.

  • Users can search, BLAST, or download nucleotide or protein sequences from over 70,000 species as well as access custom transcript assemblies for over 200 species.
  • Sixteen plant genome browsers (xGDB) are available, with a focus on accurate spliced-alignment of transcript and protein.
  • PlantGDB provides web tools for sequence analysis and a unique bioinformatics workflow environment (BioExtract).
  • PlantGDB also hosts a plant genomics research outreach portal (PGROP) that facilitates access to a large number of resources for research and training.
  • PlantGDB downloads all plant sequence from GenBank approximately every 4 months, parses it by species, and makes it available for download, search, and BLAST analysis. Advanced sequence search capabilities are provided using TableMaker (See below).
    • To get there: Sequence > Public Plant Seq > Species Search or Keyword/ID Search
  • PlantGDB also provides custom transcript assemblies (PUTs or PlantGDB-derived Unique Transcripts) for all plant species with >10,000 ESTs, or by special request. Users can download PUT datasets, perform batch BLAST, or do keyword searches based on GO annotations and top Uniprot BLAST hits.
    • To get there: Sequence > EST Assembly
  • Genome Survey Sequence (GSS) assemblies are provided for maize and sorghum.
    • To get there: Sequence > GSS Assembly
  • Sequence Overview
  • PlantGDB provides genome browsers for sixteen plant species (chromosome, scaffold, or BAC-based). Each genome assembly is splice-aligned to transcripts as well as protein from similar species and presented in a simple graphical interface (the xGDB platform). An important feature of xGDB is the ability to view spliced alignment data for all aligned sequences.
  • Powerful search tools are provided for finding sequences and retrieving sequence data adjacent to coding regions, as well as BLAST and GeneSeqer. DAS (Distributed Annotation Service) is available for both viewing (DAS client) and exporting (DAS server) genome-aligned sequence. Finally, each genome browser has tools for community annotation of genes, as a way to improve the quality of gene models.
    • To get there: Genomes > (select GDB)
  • Genome Browser Overview | Help Page
  • Accurate and complete gene models (annotations) are essential to understanding genome biology. PlantGDB's yrGATE tool harnesses the power of the community to create gene annotations right in a xGDB genome browser. The tool shows all splice junctions revealed by transcript evidence and allows the user to easily create and validate gene models with a just few mouse clicks. Registered users can submit annotation for curation at PlantGDB.
  • After a submitted annotation is approved by the PlantGDB administrator, it is incorporated into the genome browser and is publicly viewable on the yrGATE track. To help identify genes most in need of annotation, the GAEVAL tool classifies potentially mis-annotated genes and displays them in tabular format.
    • To get there: Genomes > (select genome) > Login or Register
  • Annotation Overview | Help Page

PlantGDB provides special datasets for search and download:

  • SRGD, a Splice-Related Gene Database, in which splicing-related genes are compiled for model system species (animal and plant). To get there: Datasets > SRGD
  • ASIP, Alternative Splicing in Plants, a database encompassing alternative splicing in Arabidopsis, rice, Medicago truncatula and Lotus japonicus. To get there: Datasets > ASIP
  • Ac/Ds Transposons in maize, a project to tag maize genes using Ds. View tagged loci using BLAST or ZmGDB genome browser; order seed from tagged accessions To get there: Datasets > Ac/Ds Tagging;
  • RescueMu and Uniform Mu datasets, cataloging tagged GSS sequences. To get there: Datasets > RescueMu or UniformMu

Datasets Overview

  • PlantGDB provides a variety of tools for sequence analysis including BLAST, GeneSeqer Spliced Alignment, GenomeThreader Spliced Alignment, MuSeqBox, PatternSearch, Tracembler, and TE nest.
    • To get there: Tools > Overview or (select tool)
  • see below for more details on two additional tools at PlantGDB: TableMaker, a sequence database query tool, and BioExtract, a website for creating and executing custom informatics queries and workflows.
  • Tools Overview
  • TableMaker is an online search tool that queries GenBank tables housed at PlantGDB using MySQL queries constructed in an easy-to use GUI environment.
    • To get there: Tools > TableMaker
  • TableMaker | Help Page
  • The BioExtract Server provides a Web interface for automating bioinformatics workflows. In a single environment, users can query sequence databases, analyze data with Web-based or local bioinformatics tools, save results, and create and manage workflows.
  • As a simple example, a user could develop a workflow that performs a BLAST search, retrieves peptide sequences from query results, eliminates redundant sequences, and produces a multiple sequence alignment output. BioExtract workflows can be paused, modified, saved, shared with an online workgroup or the world, and documented electronically for future reference.
    • To get there: Tools > BioExtract
  • BioExtract Server
  • PlantGDB's PGROP (Plant Genome Research Outreach Portal) site provides a centralized access point for locating Plant Genome Research "Outreach" activities, programs and resources.
    • To get there: Outreach > PGROP
  • PGROP


New & Noteworthy

Click below or view all news | Twitter logoTwitter


New flanking Ds sequences May 8
85 new Ds-flanking sequences (fDs) have been placed in the maize reference genome. Visit the Ac/Ds Tagging project pages to search by region or gene to identify transposon insertions close to a gene of interest. (5-8-2011).
New Transcript Assemblies May 3
New and refreshed transcript assemblies (PUTs) based on GenBank Release 183.0 are now available from our EST Cluster page (May 3, 2011).

New/refreshed PUT assemblies: View all PUT assemblies on our EST Assembly Data Page.(5-3-2011)
GenBank Release 183 Apr. 28
GenBank Release 183.0 sequence data (close date 4-11-2011) have been downloaded and processed at PlantGDB; new blast indices will be in place by early May. New PUT Assemblies based on Release 183 are also in progress.(4-28-2011)
"All GDB" Table Improved Apr. 20
The "All GDB" page listing genome browsers available at PlantGDB has been expanded to include links to most features and tools available for each genome. (4-20-2011).
SbGDB (Sorghum bicolor) genome update Apr.20
SbGDB (Sorghum bicolor; sorghum), a chromosome-based genome database at PlantGDB, has been updated with new splice-aligned transcripts current as of GenBank Release 181.0. Genome assembly and annotation remain the same. (4-20-2011)
"All Loci/Annotations" Feature Release Apr. 14
The "All Loci/Annotations" page presents an ordered list of published gene loci and their EST/cDNA coverage/quality, together with any community annotations at that locus. Users can search by quality, keyword or region to find loci to re-annotate. (See example for GmGDB.) From the Top Menu choose Genomes → [XxGDB], Left Menu → All Loci/Annotations. Also available are custom "project" filters to facilitate annotation of the most interesting genes. Contact us for more info on adding projects. (4-14-2011).
PeGDB - Prunus (peach) new genome browser Apr. 14
PeGDB, a new genome database for peach (Prunus persica) is now available at PlantGDB (Genomes->Dicots->PeGDB). Based on the JGI draft genome, PeGDB includes 27864 protein-coding loci and 28702 protein-coding transcripts on 202 scaffolds. Other data displayed include splice-aligned cDNAs, EST and PUTs, and splice-aligned related species proteins. (4-14-2011)
CpGDB (Carica papaya) genome update Apr. 14
CpGDB (Carica payaya; papaya), a scaffold-based genome database at PlantGDB, has been updated with new gene model annotations from JGI and new splice-aligned transcripts current as of GenBank Release 181.0. Total of 27,796 transcripts; 27,332 protein-coding loci. Genome assembly remains the same. (4-14-2011)
GmGDB - Glycine max (soybean) genome update Apr. 14
GmGDB, a genome database for soybean (Glycine max) has been updated to display new EST and cDNA alignments (GenBank Release 181). GmGDB is based on the JGI-released draft genome and displays 46,367 protein-coding loci and 55,787 protein-coding transcripts on 20 pseudochromosomes (plus unlinked scaffolds which are concatenated and displayed as "chr21") (4-14-2011)

What's Coming?

Selaginella and Solanum genomes
Two additional genome databases are slated for release soon at PlantGDB:
  • SmGDB :Selaginella moellendorfii (Spike Moss), a scaffold-based genome assembly from JGI
  • SlGDB: Solanum lycopersici (tomato), a scaffold-based genome assembly from solgenomics.net
(4-20-2011)

Loading Help Page...Thanks for your patience!

Loading Video...Thanks for your patience!

Loading Image...Thanks for your patience!