|
For a tutorial purpose, the MuSeqBox online uses a local BLAST output
file, called my_blastx_output created
by executing a BLASTX search of the entries in the
query against mypept database of 28
protein sequences, to demonstrate the usages of MuSeqBox online.
We assume that following the instruction on BLAST usage as
follows, the users may produce their own BLAST outputs and save as local
files on their machine to replace our provided
my_blastx_output .
BLAST USAGE:
1. Formatting and searching mypept database
using BLASTX:
formatdb -i mypept -p T -o T
blastall -p blastx -d mypept -i query -o my_blastx_output -I
Notes:
The blastall command executes a BLASTX search of the entries in
query against mypept;
the results are saved in my_blastx_output
. The "-I" option selects to display the matching peptide sequences
with the NCBI identifiers (GIs).
2. Searching NCBI non-redundant nucleic acid database using net-client BLASTN:
blastcl3 -p blastn -d nr -i query -e 1e-10 -o my_blastx_output -I
MUSEQBOX ONLINE USAGE:
The MuSeqBox online provides for users input file choices. Users may use
our server provided MuSeqBox application outputs (e.g., maize EST BLASTX output,
maize contig BLASTX output, and soybean contig BLASTX output) to search for queries
of interests or supply their own BLAST outputs. If BLAST search produces a large
output file, we strongly suggest that users download the MuSeqBox stand-alone
version to post-process the BLAST results. Following is the tutorial showing
the usage of MuSeqBox online using a local file
my_blastx_output provided by our MuSeqBox distribution package.
1. Creating default tabulated output to your Browser from a BLAST input file:
- Clicking the checkbox left to "Supply your own BLAST output file"
- Clicking the Browse to locate your local BLAST output, e.g.,
my_blastx_output
- Clicking submit botton (see output derived
from my_blastx_output)
Notes: By default, the MuSeqBox online uses options "-n 3" and "-p 4" to create
the output (for more detailed information, see the manual document in the
distributed package). Those options are corresponding to online parameters
Display Hits and Print format, respectively. Users can change
those parameters by pulling down the selection menu and choose their desirable
ones. For example, to retain only the top two BLAST hits for each query in
condensed print format (pstyle=3), users may follow the first two steps above and do:
- Clicking the Display hits pull-down menu and select "Top 2 if any" option
- Clicking the Print format pull-down menu and select "Condesned" option
- Clicking submit botton
Note: The users may request the MuSeqBox output be sent via their email address. To
do so, the users may need to check the checkbox left to "Send the (text format)
output to this email address:" and fill in the blank with their email addresses.
2. Selecting queries satisfying complex criteria specified by the users and output
to your browser:
- Following steps in 1 to provide basic online settings (i.e., Display hits
and Print format) and to select a local BLAST output
file for the MuSeqBox
- Clicking the checkbox left to "Select queries based on the following
criteria:"
- Clicking the checkboxs left to those criteria you wanted and fill in the
blanks with the values. For example, to select the queries with query sequence
length large than 600 and with expection value less than 1e-10, first check
both checkboxs left to QLen and Eval, and then fill
in the blanks with 600 and 1e-10, resectively
- Clicking submit botton
3. Identification of potential retained introns in EST queries on the basis
of matching peptide BLAST hits:
- Following steps in 1 to provide basic online settings (i.e., Display hits
and Print format) and to select a local BLAST output
file (e.g., my_blastx_output) for the MuSeqBox
- Clicking the ckeckbox left to "Select queries that represent potential
alternatively spliced transcripts:"
- Filling in the blank correspoding to indel parameter with the
minimal insertion segment size value, for example, indel >=40 nt
- Clicking the type pull-down menu and choose "Insertion in query
relative to subject"
- Clicking the submit botton (see the output)
4. Identification of potential skipped exons in the EST queries on the basis
of matching peptide BLAST hits:
- Following the first two steps in 3
- Filling in the blank with the minimal deletion segment size value, for
example, indel=90 nt
- Clicking the type pull-down memu and choose "Deletion in query
relative to subject"
- clicking the submit botton
5. Identification of potential (near) full-length transcripts among EST
queries on the basis of matching peptide BLAST hits:
- Following steps in 1 to provide basic online settings (i.e., Display hits
and Print format) and to select a local BLAST output
file for the MuSeqBox
- Clicking the checkbox left to "Select queries that potentially encode
full-length coding sequences:"
- Fill in the parameter requirement blanks (clicking
help to see detailed information on those defined parameters). For
example, option "-F 10 10 95.0 40.0" is corresponding to d5p <=10, d3p <=10,
scv >=95.0%, and qsc >=40.0%, respectively.
- Clicking submit botton (see output)
Note: This example selects those hits for which the
matching peptide sequence has HSPs covering at least 95% of the peptide sequence,
with the terminal HSPs starting from within the first 10 amino acids of the peptide
and extending into the last 10 amino acids, respectively. Moreover, a query
sequence coverage of at least 40% is also required. In the example, it is clear
that the maize EST AW065755 encodes the entire
homolog of the Arabidopsis 60S ribosomal protein L18A.
|