GeneSeqer.   Version of February 10, 2003.
Date run: Mon Feb 24 10:29:27 2003

(Bayesian) Splice site model (species):	Zea mays

________________________________________________________________________________
Sequence    1:   21326110, from 7800 to 11800, both strands analyzed.





********************************************************************************
Query protein sequence   13 (File: 18496651)

     1  DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
    61  MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
   121  ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKSDG
   181  PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RMIWSALGHL NDKEDAPSQL KIVGVQATGG
   241  MIAGAVTSCV STPLDTIKTR LQVNQNKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
   301  GTSMIVCYEY LKRVCAKVEE A-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1   7969   7800 ( 170 n);  Protein     1    54 (  54 aa); score: 0.123

MATCH	21326110-	18496651	0.123	170	0.176	P
PGS_21326110-_18496651	(7969  7800)

Alignment:

TCCACCAATC CGCGACTCCC GAGGTGAGAA CCGAAGGGGG CGCAGGAGGC GTGGCGTGGA     7910
 S  T  N   P  R  L  P   R  *  E   P  K  G   A  Q  E  A   W  R  G  
 .  |  +      |         +         |  .         |  +  .         .  
 D  T  S   T  R  A  A   K  -  I   P  S  L   P  Q  Q  T   E  I  N        19


CTCGATTCGG CAAGAATGGC GGGCGGGCGG GCTGCTGTGG TTGGCTTCCC CCCGTTTTCC     7850
 L  D  S   A  R  M  A   G  G  R   A  A  V   V  G  F  P   P  F  S  
    |  +         |  .         +         |   |  |            |  |  
 W  D  N   L  D  M  T   -  -  K   L  Y  V   V  G  A  G   M  F  S        37


CTCCCCACTT TCGTGCTGTT TT-TGTCCAC AACAACTCAG CGAGAATGGC G     7800
 L  P  T   F  V  L  F      V  H   N  N  S   A  R  M  A  
       |      .  |  +      |            .   .  |  |     
 C  V  T   V  A  L  Y   P  V  S   V  I  K   T  R  M  Q         54





********************************************************************************
Query protein sequence   14 (File: 12278522)

     1  DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
    61  MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
   121  ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKADG
   181  PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RVIWSALGRL DDKEDTPSQL KIVGVQATGG
   241  MVAGAVTSCV STPLDTIKTR LQVNINKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
   301  GTSMIVCYEY LKRVCAKVEE A-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1   7969   7800 ( 170 n);  Protein     1    54 (  54 aa); score: 0.123

MATCH	21326110-	12278522	0.123	170	0.176	P
PGS_21326110-_12278522	(7969  7800)

Alignment:

TCCACCAATC CGCGACTCCC GAGGTGAGAA CCGAAGGGGG CGCAGGAGGC GTGGCGTGGA     7910
 S  T  N   P  R  L  P   R  *  E   P  K  G   A  Q  E  A   W  R  G  
 .  |  +      |         +         |  .         |  +  .         .  
 D  T  S   T  R  A  A   K  -  I   P  S  L   P  Q  Q  T   E  I  N        19


CTCGATTCGG CAAGAATGGC GGGCGGGCGG GCTGCTGTGG TTGGCTTCCC CCCGTTTTCC     7850
 L  D  S   A  R  M  A   G  G  R   A  A  V   V  G  F  P   P  F  S  
    |  +         |  .         +         |   |  |            |  |  
 W  D  N   L  D  M  T   -  -  K   L  Y  V   V  G  A  G   M  F  S        37


CTCCCCACTT TCGTGCTGTT TT-TGTCCAC AACAACTCAG CGAGAATGGC G     7800
 L  P  T   F  V  L  F      V  H   N  N  S   A  R  M  A  
       |      .  |  +      |            .   .  |  |     
 C  V  T   V  A  L  Y   P  V  S   V  I  K   T  R  M  Q         54





********************************************************************************
Query protein sequence   18 (File: 13365793)

     1  AAAAAAETSE ASTAGLALAE ANINWQRRIL RSDGIPGAFR GFGTSAVGAL PGRVFALTSL
    61  EVSKEMAFKY SEHFDMSEAS RIAVANGIAG LVSSIFSSAY FVPLDVICQR LMAQGLPGMA
   121  TYRGPFDVIS KVVRTEGLRG LYRGFGITML TQSPASALWW SSYGGAQHAI WRSLGYGIDS
   181  QKKPSQSELV VVQATAGTIA GACSSIITTP IDTIKTRLQV MDNYGRGRPS VMKTTRVLLE
   241  EDGWRGFYRG FGPRFLNMSL WGTSMIVTYE LIKRLSVKPE -

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1   8399   8307 (  93 n);  Protein     1    32 (  32 aa); score: 0.174
  Intron  1   8306   7807 ( 500 n);  Pd: 0.647   Pa: 0.000
 Exon  2   7806   7801 (   6 n);  Protein    33    34 (   2 aa); score: 0.583

MATCH	21326110-	13365793	0.174	99	0.117	P
PGS_21326110-_13365793	(8399  8307,7806  7801)

Alignment:

CGCCACGGTG ACGCCGCTGA ACATGCCTGC GCCCACCACG TAGAGCTTGG TCTTGTCGAG     8340
 R  H  G   D  A  A  E   H  A  C   A  H  H   V  E  L  G   L  V  E  
       .      |  |  |      +      |         .     |  .   |  .  |  
 A  A  A   A  A  A  E   T  S  E   A  S  T   A  G  L  A   L  A  E        20


GCT---GCAA CATTTCAGCG CCACGATTTG AGAACAGTAA AAGAATCGGA AACAACGGAA     8283
 A     A   T  F  Q  R   H  D  L   R  T                            
 |         .  +  |  |   .     |   |  +                            
 A  N  I   N  W  Q  R   R  I  L   R  S .... .......... ..........       32


TAAGCTACGA ATCCCCAAAT TGCGCTTCGT CCAGGAGCAT GAACAGCCAT GTCAAATCCT     8223
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


CAAACAACAG TTCCTAATCC TAAGGCGGTA AAGCCAATCC GGAATAGGGC GGGGGACACA     8163
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


GTTCAACGAT CGGAGCTAAA TTTCTAGAAG ACTAGGACCG CGAGAAAGGC GGAAATCGGC     8103
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


ACGAAATGGG AACCTAAGGA TTACAAGACC GGTGGGTTGC TTACTTGTCC CAGTTGATCT     8043
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


CCGTCTGGTG GAGCGACGGG ATCTTGGCGG CCCTAGAGGT TGTATCCATG GCCGCGCCGC     7983
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


TCAAATCCTC CCCTCCACCA ATCCGCGACT CCCGAGGTGA GAACCGAAGG GGGCGCAGGA     7923
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


GGCGTGGCGT GGACTCGATT CGGCAAGAAT GGCGGGCGGG CGGGCTGCTG TGGTTGGCTT     7863
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       32


CCCCCCGTTT TCCCTCCCCA CTTTCGTGCT GTTTTTGTCC ACAACAACTC AGCGAGAATG     7803
                                                              N   
                                                              +   
.......... .......... .......... .......... .......... ...... D         33


GC     7801
G 
| 
G        34





********************************************************************************
Query protein sequence   10 (File: 21594326)

     1  SLGALMEEKR RATTSSSSSQ VHMSNDIDWQ MLDKSRFFFL GAALFSGVST ALYPIVVLKT
    61  RQQVSPTRVS CANISLAIAR LEGLKGFYKG FGTSLLGTIP ARALYMTALE ITKSSVGQAT
   121  VRLGLSDTTS LAVANGAAGL TSAVAAQTVW TPIDIVSQRL MVQGDVSLSK HLPGVMNSCR
   181  YRNGFDAFRK ILYTDGPRGF YRGFGISILT YAPSNAVWWA SYSLAQKSIW SRYKHSYNHK
   241  EDAGGSVVVQ ALSSATASGC SALVTMPVDT IKTRLQVLDA EENGRRRAMT VMQSVKSLMK
   301  EGGVGACYRG LGPRWVAMSM SATTMITTYE FLKRLATKKQ K-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   7964   8058 (  95 n);  Protein     1    31 (  31 aa); score: 0.110
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8749 ( 411 n);  Protein    32   165 ( 134 aa); score: 0.385
  Intron  2   8750   8848 (  99 n);  Pd: 0.000   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   166   231 (  66 aa); score: 0.553
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9844 ( 147 n);  Protein   232   278 (  47 aa); score: 0.349
  Intron  4   9845  10200 ( 356 n);  Pd: 0.446   Pa: 0.975
 Exon  5  10201  10207 (   7 n);  Protein   279   281 (   3 aa); score: -0.467
  Intron  5  10208  10515 ( 308 n);  Pd: 0.994   Pa: 0.987
 Exon  6  10516  10669 ( 154 n);  Protein   282   332 (  51 aa); score: 0.295
  Intron  6  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  7  11330  11361 (  32 n);  Protein   333   341 (   9 aa); score: 0.377

MATCH	21326110+	21594326	0.374	1043	1.017	P
PGS_21326110+_21594326	(7964  8058,8339  8749,8849  9045,9698  9844,10201  10207,10516  10669,11330  11361)

Alignment:

GGTGGAGGGG AGGATTTGAG CGGCGCGGCC ATGGATACAA CCTCTAGGGC CGCCAAGATC     8023
 G  G  G   E  D  L  S   G  A  A   M  D  T   T  S  R  A   A  K  I  
 .     |         +  .                   |   |  |     +   +  .     
 S  L  G   A  L  M  E   E  K  R   R  A  T   T  S  S  S   S  S  Q        20


CCGTCGCTCC ACCAGACGGA GATCAACTGG GACAAGTAAG CAACCCACCG GTCTTGTAAT     8083
 P  S  L   H  Q  T  E   I  N  W   D  N                            
       +      .  .  +   |  +  |   .                               
 V  H  M   -  S  N  D   I  D  W   Q  M..... .......... ..........       31


CCTTAGGTTC CCATTTCGTG CCGATTTCCG CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT     8143
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       31


TTAGCTCCGA TCGTTGAACT GTGTCCCCCG CCCTATTCCG GATTGGCTTT ACCGCCTTAG     8203
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       31


GATTAGGAAC TGTTGTTTGA GGATTTGACA TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA     8263
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       31


TTTGGGGATT CGTAGCTTAT TCCGTTGTTT CCGATTCTTT TACTGTTCTC AAATCGTGGC     8323
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       31


GCTGAAATGT TGCAGCCTCG ACAAGACCAA GCTCTACGTG GTGGGCGCAG GCATGTTCAG     8383
                  L   D  K  T  K   L  Y  V   V  G  A   G  M  F  S 
                  |   |  |  +  +   .  +      +  |  |   .  +  |  | 
.......... .....  L   D  K  S  R   F  F  F   L  G  A   A  L  F  S       46


CGGCGTCACC GTGGCGCTGT ATCCTGTCTC GGTGGTCAAG ACCCGGATGC AGGTTGCCTC     8443
  G  V  T   V  A  L   Y  P  V  S   V  V  K   T  R  M   Q  V  A  S 
  |  |  +   .  |  |   |  |  +      |  +  |   |  |  .   |  |  +    
  G  V  S   T  A  L   Y  P  I  V   V  L  K   T  R  Q   Q  V  S  P       66


TGGGGACGCC ATGAGGAGGA ACGCGCTGGC TACCTTCAAG AACATCCTCA AGATGGACGG     8503
  G  D  A   M  R  R   N  A  L  A   T  F  K   N  I  L   K  M  D  G 
        .             |     +      +  .         |      +  +  +  | 
  T  R  V   S  C  A   N  -  I  -   S  L  A   -  I  A   R  L  E  G       83


CGTGCCAGGG CTGTACCGGG GGTTTGCTAC CGTTATCATT GGGGCTGTAC CAACTAGGAT     8563
  V  P  G   L  Y  R   G  F  A  T   V  I  I   G  A  V   P  T  R  I 
  +     |   .  |  +   |  |  .  |      +  +   |  .  +   |  .  |    
  L  K  G   F  Y  K   G  F  G  T   S  L  L   G  T  I   P  A  R  A      103


CATCTTCCTC ACAGCGCTTG AGACAACCAA AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT     8623
  I  F  L   T  A  L   E  T  T  K   A  A  S   L  K  L   V  E  P  F 
  +  +  +   |  |  |   |     |  |   +  +         +      .        . 
  L  Y  M   T  A  L   E  I  T  K   S  S  V   G  Q  A   T  V  R  L      123


CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT TGCCAATGGC CTTGCTGGTC TGTCAGCGTC     8683
  K  L  S   E  P  V   R  A  A  F   A  N  G   L  A  G   L  S  A  S 
     |  |   +     .         |      |  |  |      |  |   |  +  +  + 
  G  L  S   D  T  T   S  L  A  V   A  N  G   A  A  G   L  T  S  A      143


TACATGTTCG CAGGCTATTT TTGTTCCAAT TGATGTGGTA T-GCCTCTCA TGTGCCTTCT     8742
  T  C  S   Q  A  I   F  V  P  I   D  V  V      P  L   M  C  L  L 
  .  .  +   |  .  +   +  .  |  |   |  +  |             +     +    
  V  A  A   Q  T  V   W  T  P  I   D  I  V   S  Q  R   L  M  V  Q      163


ATGTGATGTT GTATAGAGAA AAAATATCTT ACAATATGTT GATGTTAAAT GCTAATTACA     8802
  C  D                                                            
     |                                                            
  G  D ... .......... .......... .......... .......... ..........      165


ATACTAGACT ACTGTTTTCA TTCTGTTGTG CATTGGAATG TTTCAGATTA GCCAGAAATT     8862
                                                   I   S  Q  K  L 
                                                   +   |     .    
.......... .......... .......... .......... ...... V   S  L  S  K      170


GATGGTTCAA GGATATTCTG GTAATGCCAG ATACAAAGGT GGATTAGATG TTGCTCGAAA     8922
  M  V  Q   G  Y  S   G  N  A  R   Y  K  G   G  L  D   V  A  R  K 
     +      |         .  +  .  |   |  +  .   |  .  |   .     |  | 
  H  L  P   G  V  M   N  S  C  R   Y  R  N   G  F  D   A  F  R  K      190


GGTCATAAAG GCTGATGGCA TTAGGGGGCT GTACAGAGGA TTTGGACTGT CTGTTATGAC     8982
  V  I  K   A  D  G   I  R  G  L   Y  R  G   F  G  L   S  V  M  T 
  +  +      .  |  |      |  |  .   |  |  |   |  |  +   |  +  +  | 
  I  L  Y   T  D  G   P  R  G  F   Y  R  G   F  G  I   S  I  L  T      210


CTATGCTCCA TCCAGTGCTG TGTGGTGGGC AAGTTATGGT TCCAGCCAGC GCATAATTTG     9042
  Y  A  P   S  S  A   V  W  W  A   S  Y  G   S  S  Q   R  I  I  W 
  |  |  |   |  +  |   |  |  |  |   |  |  .      +  |   +     |  | 
  Y  A  P   S  N  A   V  W  W  A   S  Y  S   L  A  Q   K  S  I  W      230


GAGGTTAGCT TATCTGATTG GTTCATCGTT ATGTTCCTCT CAGCCCTGTG TACTATGTAA     9102
  S                                                               
  |                                                               
  S....... .......... .......... .......... .......... ..........      231


TATTTACGAG AAAAAGACCA GTAATACATT TCTACTTAAT AGTTATTTGA ATTGGTACTT     9162
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


TCCATCTGTC CAAAACCTTT TCAAACTTCC CCTCTTGATG CTCAAACTGC AGCTATAATT     9222
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


GCAATTTTGT TTTCTGATGC TTGTTCTTCC ATGTCAATAT GTACATATCT TTTTTAGAAA     9282
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


ACAAGAATGC ATCTCAATGC ATGTGCTGTA TTGTTTTGAT TAGATTTATC ATAGCGATCA     9342
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


ATCACATTTT CTTTACAGAT AAAAATAGTC GGAAGGATAA GTTGGATAAC TGACCAAAGT     9402
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


GGAAATATGA TCTTACATAT TTTTATCTCT GGCAGCTTAG AGAACTTAAT TACCAACCTG     9462
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


AAACAATGTG ATGAAGTAAC TACACAAAAC CACATATAGT TTCATGCACT CTGCAAAACT     9522
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


AAATTGAAAC TCTTAGTGTG CTCTTAATGC TGTTAAGAGG GTGTATGCAA GTTTACTGGA     9582
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


ATCAGTACCT TTTGTTAGTT TATTTCTTTG TGGTTGATGG TTGAAAGATT ATATTTCTTG     9642
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      231


TCTTGATAAC TTAGCCAAAA TAGTTAACTA TTGTGCTTTT TACATATTGG AACAGTGCTC     9702
                                                              A   
                                                                  
.......... .......... .......... .......... .......... .....  R        232


TTGGCCATTT GCATGACAAA GAAGAGGCTC CTAGCCAATT GAAACTAGTT GGTGTTCAAG     9762
L  G  H  L   H  D  K   E  E  A   P  S  Q  L   K  L  V   G  V  Q   
      |      +  +      +  |         .         .  +  |      |  |   
Y  K  H  S   Y  N  H   K  E  D   A  G  G  -   S  V  V   -  V  Q        250


CATCAGGGGG GGTTTTTGCC GGTGCCGTGA CCTCTTTTGT TACGACTCCC ATAGATACAA     9822
A  S  G  G   V  F  A   G  A  V   T  S  F  V   T  T  P   I  D  T   
|     .  .   .     |   .  .      +  +  .  |   |     |   +  |  |   
A  L  S  S   A  T  A   S  G  C   S  A  L  V   T  M  P   V  D  T        270


TAAAGACCAG GCTGCAGGTA CTGTGTGACA TTCTGTTTGC TGATTACTCT TGTAATTTGA     9882
I  K  T  R   L  Q  V   L                                          
|  |  |  |   |  |  |   |                                          
I  K  T  R   L  Q  V   L........ .......... .......... ..........      278


TTTGTGTGGG TATATTTTGT GAGGCTTACC CTTGTGACTT AATGATTCTT GTCTTTACAT     9942
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      278


TTATGCTGCT CATTTGCAAT AATTTGATTC CTTATCAATG CAATGCCACT AAGTTTAGGG    10002
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      278


GAATGGATAT TTTGTTTTGG AAGTATATTT GATGTCAGAC TTGAAGACCT AAATGTTCTT    10062
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      278


TTATACTGAT ATTTCCTCCA ATGGCGGGCT ATTGAGGTGC TGGACTGGAA TGCTGTCTAT    10122
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      278


ATTAAACAAT ATATACTTCT ATGTTTACAG CTGTTTGTTT TCTGCTGACA TACCATGACC    10182
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      278


AATTTGTCAT GGTTTCAGT- --TATGAGGT CAGAAAAAAA GAAACTTCCA TTGGGAAAAC    10239
                         Y  E                                     
                            |                                     
.......... ........   D  A  E .. .......... .......... ..........      281


TTGATATCTA TTACTTCATT ATTTATAGTG AGTAACAAAA GTTAGCACTT TCAAACTGAC    10299
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      281


TAAAGTATGC CAGGGACGTA TCATGCATTT TACAACATGC TCCACATATC TCCAAATATC    10359
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      281


ACATATTACG CTTGTAGTGG TAAACTGATA ATACATCTAC CAACACTGAA AGTTCTCACA    10419
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      281


AGTCAGAACC CTATATTTGA CAGTTGTGGT CTCCCTCCTT CCCTCTGCAT TTGTTGCTAC    10479
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      281


AGATGATTAC ACTGAGTTTT GTTTCTTGTC ATTTAGGTTA TGGATAATGA AAATAAGCCA    10539
                                        V   M  D  N  E   N  K  P  
                                                  .  .   .        
.......... .......... .......... ...... E   N  G  R  R   R  A  M       289


AAAGCCAGGG AAGTTGTCAA AAGATTGATT GCTGAAGATG GATGGAAAGG TTTGTACAGA    10599
 K  A  R   E  V  V  K   R  L  I   A  E  D   G  W  K  G   L  Y  R  
    .      +     |  |      |  +      |      |        .      |  |  
 T  V  M   Q  S  V  K   S  L  M   K  E  G   G  V  G  A   C  Y  R       309


GGGTTGGGTC CAAGATTTTT CAGCTCATCA GCTTGGGGAA CCTCAATGAT AGTATGCTAC    10659
 G  L  G   P  R  F  F   S  S  S   A  W  G   T  S  M  I   V  C  Y  
 |  |  |   |  |  +      +     |         .   |  +  |  |   .     |  
 G  L  G   P  R  W  V   A  M  S   M  S  A   T  T  M  I   T  T  Y       329


GAGTACCTGA GTATGTTTCG TCTTCCCTTG TCAAATGTAC ACATGCATAT GTAGTGTTAT    10719
 E  Y  L                                                          
 |  +  |                                                          
 E  F  L   .......... .......... .......... .......... ..........      332


ATATCACTGC ATCCCATGCA GGTTAATTTT AAGTACCCAG ATACTTCTTC TCATTTAGAA    10779
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


TTTAGTTAAA ATGACATCAT TCAGGTCAGT TGGCATCTCC AGTACACTGC TTTTGTAAGT    10839
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


TGTATCATAA ATCCCATTTG CAATGAAATT TTTGACTCAA GTTGCAGCCT GTAACTTTTC    10899
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


TATATTTTTC GAATAAAGCT ATCACCGTAC ATGAAACCTG CTTCTGTTAA TGCCAAGGAG    10959
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


CGCACATTAT TTCCTGTAGA CCGGCTTGGA TGTTGAACAA TTGGCACATG CAAGTAGCAA    11019
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


AGAGCAGCCT TGTGCTTGCA ACAATCTGGT CCACCTGTGG ATATGTTCGC TGTGAAAGAA    11079
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


ACCAATTAGT CCTTGTATGA AACATGGTAT TAGCGCTTCA TGAATAAAAC CACTGATTCT    11139
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


GATTTCTTAT TTTCAATGAA TGGATGGGCA TTACCAAAGT TATCATGATT AAAGATCTAT    11199
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


TTCATATAAG TTTATTTTTA TACATTAGAG TTTATTTAGA GAACAAGGTA TATTTAGTTT    11259
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


TGGTAATTTT GTGAACTGCA CTCAGACGAC TTTGGTATTC TTACTGTAAT TTTGTTTTGT    11319
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      332


TTTCCTACAG AGCGCTTGTG TGCTAAAGTT GAAGAGGTCT GA    11361
           K  R  L  C   A  K  V   E  E  V   * 
           |  |  |  .   .  |      +  +        
.......... K  R  L  A   T  K  -   K  Q  K   *       342





********************************************************************************
Query protein sequence    9 (File: 23308305)

     1  NLGAAEEESA QEIHLPADIN WEMLDKSKFF VLGAALFSGV SGALYPAVLM KTRQQVCHSQ
    61  GSCIKTAFTL VRHEGLRGLY RGFGTSLMGT IPARALYMTA LEVTKSNVGS AAVSLGLTEA
   121  KAAAVANAVG GLSAAMAAQL VWTPVDVVSQ RLMVQGSAGL VNASRCNYVN GFDAFRKIVR
   181  ADGPKGLYRG FGISILTYAP SNAVWWASYS VAQRMVWGGI GCYVCKKDEE SGNNSTTMKP
   241  DSKTIMAVQG VSAAIAGSVS ALITMPLDTI KTRLQVLDGE DSSNNGKRGP SIGQTVRNLV
   301  REGGWTACYR GLGPRCASMS MSATTMITTY EFLKRLSAKN HDGFYSKS-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   7994   8058 (  65 n);  Protein     1    23 (  23 aa); score: 0.076
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    24   147 ( 124 aa); score: 0.419
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   148   218 (  71 aa); score: 0.536
  Intron  3   9046   9437 ( 392 n);  Pd: 0.462   Pa: 0.346
 Exon  4   9438   9477 (  40 n);  Protein   219   229 (  11 aa); score: 0.084
  Intron  4   9478   9697 ( 220 n);  Pd: 0.924   Pa: 0.863
 Exon  5   9698   9844 ( 147 n);  Protein   230   277 (  48 aa); score: 0.386
  Intron  5   9845  10200 ( 356 n);  Pd: 0.446   Pa: 0.975
 Exon  6  10201  10207 (   7 n);  Protein   278   280 (   3 aa); score: -0.471
  Intron  6  10208  10515 ( 308 n);  Pd: 0.994   Pa: 0.987
 Exon  7  10516  10669 ( 154 n);  Protein   281   333 (  53 aa); score: 0.330
  Intron  7  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  8  11330  11378 (  49 n);  Protein   334   348 (  15 aa); score: 0.207

MATCH	21326110+	23308305	0.400	1041	0.994	P
PGS_21326110+_23308305	(7994  8058,8339  8720,8849  9045,9438  9477,9698  9844,10201  10207,10516  10669,11330  11378)

Alignment:

ATGGATACAA CCTCTAGGGC CGCCAAGATC CCGTCGCTCC AC---CAGAC GGAGATCAAC     8050
 M  D  T   T  S  R  A   A  K  I   P  S  L   H     Q  T   E  I  N  
           .  +  .         .         .  +   |        .   +  |  |  
 N  L  G   A  A  E  E   E  S  A   Q  E  I   H  L  P  A   D  I  N        20


TGGGACAAGT AAGCAACCCA CCGGTCTTGT AATCCTTAGG TTCCCATTTC GTGCCGATTT     8110
 W  D  N                                                          
 |  +                                                             
 W  E  M.. .......... .......... .......... .......... ..........       23


CCGCCTTTCT CGCGGTCCTA GTCTTCTAGA AATTTAGCTC CGATCGTTGA ACTGTGTCCC     8170
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       23


CCGCCCTATT CCGGATTGGC TTTACCGCCT TAGGATTAGG AACTGTTGTT TGAGGATTTG     8230
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       23


ACATGGCTGT TCATGCTCCT GGACGAAGCG CAATTTGGGG ATTCGTAGCT TATTCCGTTG     8290
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       23


TTTCCGATTC TTTTACTGTT CTCAAATCGT GGCGCTGAAA TGTTGCAGCC TCGACAAGAC     8350
                                                       L  D  K  T 
                                                       |  |  |  + 
.......... .......... .......... .......... ........   L  D  K  S       27


CAAGCTCTAC GTGGTGGGCG CAGGCATGTT CAGCGGCGTC ACCGTGGCGC TGTATCCTGT     8410
  K  L  Y   V  V  G   A  G  M  F   S  G  V   T  V  A   L  Y  P  V 
  |  .  +   |  +  |   |  .  +  |   |  |  |   +     |   |  |  |  . 
  K  F  F   V  L  G   A  A  L  F   S  G  V   S  G  A   L  Y  P  A       47


CTCGGTGGTC AAGACCCGGA TGCAGGTTGC CTCTGGGGAC GCCATGAGGA GGAACGCGCT     8470
  S  V  V   K  T  R   M  Q  V  A   S  G  D   A  M  R   R  N  A  L 
     +  +   |  |  |   .  |  |  .      .  .   .            .  .    
  V  L  M   K  T  R   Q  Q  V  C   H  S  Q   G  S  C   I  K  T  -       66


GGCTACCTTC AAGAACATCC TCAAGATGGA CGGCGTGCCA GGGCTGTACC GGGGGTTTGC     8530
  A  T  F   K  N  I   L  K  M  D   G  V  P   G  L  Y   R  G  F  A 
  |     |      .  +   +  +     +   |  +      |  |  |   |  |  |  . 
  A  -  F   -  T  L   V  R  H  E   G  L  R   G  L  Y   R  G  F  G       84


TACCGTTATC ATTGGGGCTG TACCAACTAG GATCATCTTC CTCACAGCGC TTGAGACAAC     8590
  T  V  I   I  G  A   V  P  T  R   I  I  F   L  T  A   L  E  T  T 
  |     +   +  |  .   +  |  .  |      +  +   +  |  |   |  |  .  | 
  T  S  L   M  G  T   I  P  A  R   A  L  Y   M  T  A   L  E  V  T      104


CAAAGCAGCC TCGCTTAAGC TTGTTGAGCC CTTCAAGCTG TCAGAGCCGG TGCGGGCTGC     8650
  K  A  A   S  L  K   L  V  E  P   F  K  L   S  E  P   V  R  A  A 
  |  +            .      .         .     |   +  |            |  | 
  K  S  N   V  G  S   A  A  V  S   L  G  L   T  E  A   K  A  A  A      124


CTTTGCCAAT GGCCTTGCTG GTCTGTCAGC GTCTACATGT TCGCAGGCTA TTTTTGTTCC     8710
  F  A  N   G  L  A   G  L  S  A   S  T  C   S  Q  A   I  F  V  P 
     |  |   .  +  .   |  |  |  |   +     .   +  |      +  +  .  | 
  V  A  N   A  V  G   G  L  S  A   A  M  A   A  Q  L   V  W  T  P      144


AATTGATGTG GTATGCCTCT CATGTGCCTT CTATGTGATG TTGTATAGAG AAAAAATATC     8770
  I  D  V                                                         
  +  |  |                                                         
  V  D  V  .......... .......... .......... .......... ..........      147


TTACAATATG TTGATGTTAA ATGCTAATTA CAATACTAGA CTACTGTTTT CATTCTGTTG     8830
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      147


TGCATTGGAA TGTTTCAGAT TAGCCAGAAA TTGATGGTTC AAGGATATTC TGGT------     8884
                    I   S  Q  K   L  M  V   Q  G  Y  S   G        
                    +   |  |  +   |  |  |   |  |     +   |        
.......... ........ V   S  Q  R   L  M  V   Q  G  S  A   G  L  V       161


AATGCC---A GA------TA CAAAGGTGGA TTAGATGTTG CTCGAAAGGT CATAAAGGCT     8935
 N  A      R        Y   K  G  G   L  D  V   A  R  K  V   I  K  A  
 |  |      |        |      .  |   .  |  .      |  |  +   +  +  |  
 N  A  S   R  C  N  Y   V  N  G   F  D  A   F  R  K  I   V  R  A       181


GATGGCATTA GGGGGCTGTA CAGAGGATTT GGACTGTCTG TTATGACCTA TGCTCCATCC     8995
 D  G  I   R  G  L  Y   R  G  F   G  L  S   V  M  T  Y   A  P  S  
 |  |      +  |  |  |   |  |  |   |  +  |   +  +  |  |   |  |  |  
 D  G  P   K  G  L  Y   R  G  F   G  I  S   I  L  T  Y   A  P  S       201


AGTGCTGTGT GGTGGGCAAG TTATGGTTCC AGCCAGCGCA TAATTTGGAG GTTAGCTTAT     9055
 S  A  V   W  W  A  S   Y  G  S   S  Q  R   I  I  W  S            
 +  |  |   |  |  |  |   |  .      +  |  |   +  +  |  .            
 N  A  V   W  W  A  S   Y  S  V   A  Q  R   M  V  W  G ..........      218


CTGATTGGTT CATCGTTATG TTCCTCTCAG CCCTGTGTAC TATGTAATAT TTACGAGAAA     9115
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      218


AAGACCAGTA ATACATTTCT ACTTAATAGT TATTTGAATT GGTACTTTCC ATCTGTCCAA     9175
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      218


AACCTTTTCA AACTTCCCCT CTTGATGCTC AAACTGCAGC TATAATTGCA ATTTTGTTTT     9235
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      218


CTGATGCTTG TTCTTCCATG TCAATATGTA CATATCTTTT TTAGAAAACA AGAATGCATC     9295
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      218


TCAATGCATG TGCTGTATTG TTTTGATTAG ATTTATCATA GCGATCAATC ACATTTTCTT     9355
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      218


TACAGATAAA AATAGTCGGA AGGATAAGTT GGATAACTGA CCAAAGTGGA AATATGATCT     9415
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      218


TACATATTTT TATCTCTGGC AGCTTAGAGA ACTTAATTAC CAACCTGAAA CAATGTGATG     9475
                          L  E   N  L  I  T   N  L  K   Q  C  D   
                                 .        .         |   +     |   
.......... .......... ..  G  I   G  C  Y  V   -  C  K   K  -  D        228


AAGTAACTAC ACAAAACCAC ATATAGTTTC ATGCACTCTG CAAAACTAAA TTGAAACTCT     9535
E                                                                 
|                                                                 
E ........ .......... .......... .......... .......... ..........      229


TAGTGTGCTC TTAATGCTGT TAAGAGGGTG TATGCAAGTT TACTGGAATC AGTACCTTTT     9595
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      229


GTTAGTTTAT TTCTTTGTGG TTGATGGTTG AAAGATTATA TTTCTTGTCT TGATAACTTA     9655
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      229


GCCAAAATAG TTAACTATTG TGCTTTTTAC ATATTGGAAC AGTGCTCT-T -GGCCATTTG     9713
                                               C  S      G  H  L  
                                                  |      |  +     
.......... .......... .......... .......... .. E  S  -   G  N  -       233


CATGACAAAG AAGAGGCTCC TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG     9773
 H  D  K   E  E  A  P   S  Q  L   K  L  V   G  V  Q  A   S  G  G  
 +  .               |   .  .         +  +   .  |  |  .      .  .  
 N  S  T   T  M  K  P   D  S  K   T  I  M   A  V  Q  G   V  S  A       253


GTTTTTGCCG GTGCCGTGAC CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG     9833
 V  F  A   G  A  V  T   S  F  V   T  T  P   I  D  T  I   K  T  R  
 .  .  |   |  +  |  +   +  .  +   |     |   +  |  |  |   |  |  |  
 A  I  A   G  S  V  S   A  L  I   T  M  P   L  D  T  I   K  T  R       273


CTGCAGGTAC TGTGTGACAT TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT     9893
 L  Q  V   L                                                      
 |  |  |   |                                                      
 L  Q  V   L......... .......... .......... .......... ..........      277


ATATTTTGTG AGGCTTACCC TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC     9953
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      277


ATTTGCAATA ATTTGATTCC TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT    10013
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      277


TTGTTTTGGA AGTATATTTG ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA    10073
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      277


TTTCCTCCAA TGGCGGGCTA TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA    10133
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      277


TATACTTCTA TGTTTACAGC TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG    10193
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      277


GTTTCAGT-- -TATGAGGTC AGAAAAAAAG AAACTTCCAT TGGGAAAACT TGATATCTAT    10250
             Y  E                                                 
                |                                                 
.......  D   G  E ... .......... .......... .......... ..........      280


TACTTCATTA TTTATAGTGA GTAACAAAAG TTAGCACTTT CAAACTGACT AAAGTATGCC    10310
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      280


AGGGACGTAT CATGCATTTT ACAACATGCT CCACATATCT CCAAATATCA CATATTACGC    10370
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      280


TTGTAGTGGT AAACTGATAA TACATCTACC AACACTGAAA GTTCTCACAA GTCAGAACCC    10430
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      280


TATATTTGAC AGTTGTGGTC TCCCTCCTTC CCTCTGCATT TGTTGCTACA GATGATTACA    10490
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      280


CTGAGTTTTG TTTCTTGTCA TTTAGGTTAT GGATAATGAA AATAAG---- --CCAAAAGC    10544
                            V  M   D  N  E   N  K         P  K  A 
                                   .  |  .   .  |         |  .    
.......... .......... ..... D  S   S  N  N   G  K  R   G  P  S  I      292


CAGGGAAGTT GTCAAAAGAT TGATTGCTGA AGATGGATGG AAAGGTTTGT ACAGAGGGTT    10604
  R  E  V   V  K  R   L  I  A  E   D  G  W   K  G  L   Y  R  G  L 
     +  .   |  +  .   |  +     |      |  |      .      |  |  |  | 
  G  Q  T   V  R  N   L  V  R  E   G  G  W   T  A  C   Y  R  G  L      312


GGGTCCAAGA TTTTTCAGCT CATCAGCTTG GGGAACCTCA ATGATAGTAT GCTACGAGTA    10664
  G  P  R   F  F  S   S  S  A  W   G  T  S   M  I  V   C  Y  E  Y 
  |  |  |         |      |         .  |  +   |  |  .      |  |  + 
  G  P  R   C  A  S   M  S  M  S   A  T  T   M  I  T   T  Y  E  F      332


CCTGAGTATG TTTCGTCTTC CCTTGTCAAA TGTACACATG CATATGTAGT GTTATATATC    10724
  L                                                               
  |                                                               
  L  ..... .......... .......... .......... .......... ..........      333


ACTGCATCCC ATGCAGGTTA ATTTTAAGTA CCCAGATACT TCTTCTCATT TAGAATTTAG    10784
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


TTAAAATGAC ATCATTCAGG TCAGTTGGCA TCTCCAGTAC ACTGCTTTTG TAAGTTGTAT    10844
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


CATAAATCCC ATTTGCAATG AAATTTTTGA CTCAAGTTGC AGCCTGTAAC TTTTCTATAT    10904
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


TTTTCGAATA AAGCTATCAC CGTACATGAA ACCTGCTTCT GTTAATGCCA AGGAGCGCAC    10964
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


ATTATTTCCT GTAGACCGGC TTGGATGTTG AACAATTGGC ACATGCAAGT AGCAAAGAGC    11024
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


AGCCTTGTGC TTGCAACAAT CTGGTCCACC TGTGGATATG TTCGCTGTGA AAGAAACCAA    11084
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


TTAGTCCTTG TATGAAACAT GGTATTAGCG CTTCATGAAT AAAACCACTG ATTCTGATTT    11144
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


CTTATTTTCA ATGAATGGAT GGGCATTACC AAAGTTATCA TGATTAAAGA TCTATTTCAT    11204
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


ATAAGTTTAT TTTTATACAT TAGAGTTTAT TTAGAGAACA AGGTATATTT AGTTTTGGTA    11264
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


ATTTTGTGAA CTGCACTCAG ACGACTTTGG TATTCTTACT GTAATTTTGT TTTGTTTTCC    11324
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      333


TACAGAGCGC TTGTGTGCTA AAGTTGAAG- AGGTCTGATT TCTGAGCTGC CTTAA    11378
     K  R   L  C  A   K  V  E      G  L  I   S  E  L   P  * 
     |  |   |     |   |     .      |  .      |  +           
.....K  R   L  S  A   K  N  H  D   G  F  Y   S  K  -   S  *       349





********************************************************************************
Query protein sequence    1 (File: 21326111)

     1  DTTSRAAKIP SLHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVVKT RMQVASGDAM
    61  RRNALATFKN ILKMDGVPGL YRGFATVIIG AVPTRIIFLT ALETTKAASL KLVEPFKLSE
   121  PVRAAFANGL AGLSASTCSQ AIFVPIDVIS QKLMVQGYSG NARYKGGLDV ARKVIKADGI
   181  RGLYRGFGLS VMTYAPSSAV WWASYGSSQR IIWSALGHLH DKEEAPSQLK LVGVQASGGV
   241  FAGAVTSFVT TPIDTIKTRL QVMDNENKPK AREVVKRLIA EDGWKGLYRG LGPRFFSSSA
   301  WGTSMIVCYE YLKRLCAKVE EV-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   7997   8058 (  62 n);  Protein     1    21 (  21 aa); score: 1.000
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    22   148 ( 127 aa); score: 1.000
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   149   214 (  66 aa); score: 1.000
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9839 ( 142 n);  Protein   215   261 (  47 aa); score: 1.000
  Intron  4   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  5  10516  10669 ( 154 n);  Protein   262   312 (  51 aa); score: 1.000
  Intron  5  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  6  11330  11361 (  32 n);  Protein   313   322 (  10 aa); score: 1.000

MATCH	21326110+	21326111	1.000	969	1.000	P
PGS_21326110+_21326111	(7997  8058,8339  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTCCACC AGACGGAGAT CAACTGGGAC     8056
 D  T  T   S  R  A  A   K  I  P   S  L  H   Q  T  E  I   N  W  D  
 |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |  
 D  T  T   S  R  A  A   K  I  P   S  L  H   Q  T  E  I   N  W  D        20


AAGTAAGCAA CCCACCGGTC TTGTAATCCT TAGGTTCCCA TTTCGTGCCG ATTTCCGCCT     8116
 N                                                                
 |                                                                
 N........ .......... .......... .......... .......... ..........       21


TTCTCGCGGT CCTAGTCTTC TAGAAATTTA GCTCCGATCG TTGAACTGTG TCCCCCGCCC     8176
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       21


TATTCCGGAT TGGCTTTACC GCCTTAGGAT TAGGAACTGT TGTTTGAGGA TTTGACATGG     8236
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       21


CTGTTCATGC TCCTGGACGA AGCGCAATTT GGGGATTCGT AGCTTATTCC GTTGTTTCCG     8296
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       21


ATTCTTTTAC TGTTCTCAAA TCGTGGCGCT GAAATGTTGC AGCCTCGACA AGACCAAGCT     8356
                                                L  D   K  T  K  L 
                                                |  |   |  |  |  | 
.......... .......... .......... .......... ..  L  D   K  T  K  L       27


CTACGTGGTG GGCGCAGGCA TGTTCAGCGG CGTCACCGTG GCGCTGTATC CTGTCTCGGT     8416
  Y  V  V   G  A  G   M  F  S  G   V  T  V   A  L  Y   P  V  S  V 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  Y  V  V   G  A  G   M  F  S  G   V  T  V   A  L  Y   P  V  S  V       47


GGTCAAGACC CGGATGCAGG TTGCCTCTGG GGACGCCATG AGGAGGAACG CGCTGGCTAC     8476
  V  K  T   R  M  Q   V  A  S  G   D  A  M   R  R  N   A  L  A  T 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  V  K  T   R  M  Q   V  A  S  G   D  A  M   R  R  N   A  L  A  T       67


CTTCAAGAAC ATCCTCAAGA TGGACGGCGT GCCAGGGCTG TACCGGGGGT TTGCTACCGT     8536
  F  K  N   I  L  K   M  D  G  V   P  G  L   Y  R  G   F  A  T  V 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  F  K  N   I  L  K   M  D  G  V   P  G  L   Y  R  G   F  A  T  V       87


TATCATTGGG GCTGTACCAA CTAGGATCAT CTTCCTCACA GCGCTTGAGA CAACCAAAGC     8596
  I  I  G   A  V  P   T  R  I  I   F  L  T   A  L  E   T  T  K  A 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  I  I  G   A  V  P   T  R  I  I   F  L  T   A  L  E   T  T  K  A      107


AGCCTCGCTT AAGCTTGTTG AGCCCTTCAA GCTGTCAGAG CCGGTGCGGG CTGCCTTTGC     8656
  A  S  L   K  L  V   E  P  F  K   L  S  E   P  V  R   A  A  F  A 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  A  S  L   K  L  V   E  P  F  K   L  S  E   P  V  R   A  A  F  A      127


CAATGGCCTT GCTGGTCTGT CAGCGTCTAC ATGTTCGCAG GCTATTTTTG TTCCAATTGA     8716
  N  G  L   A  G  L   S  A  S  T   C  S  Q   A  I  F   V  P  I  D 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  N  G  L   A  G  L   S  A  S  T   C  S  Q   A  I  F   V  P  I  D      147


TGTGGTATGC CTCTCATGTG CCTTCTATGT GATGTTGTAT AGAGAAAAAA TATCTTACAA     8776
  V                                                               
  |                                                               
  V ...... .......... .......... .......... .......... ..........      148


TATGTTGATG TTAAATGCTA ATTACAATAC TAGACTACTG TTTTCATTCT GTTGTGCATT     8836
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      148


GGAATGTTTC AGATTAGCCA GAAATTGATG GTTCAAGGAT ATTCTGGTAA TGCCAGATAC     8896
              I  S  Q   K  L  M   V  Q  G   Y  S  G  N   A  R  Y  
              |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |  
.......... .. I  S  Q   K  L  M   V  Q  G   Y  S  G  N   A  R  Y       164


AAAGGTGGAT TAGATGTTGC TCGAAAGGTC ATAAAGGCTG ATGGCATTAG GGGGCTGTAC     8956
 K  G  G   L  D  V  A   R  K  V   I  K  A   D  G  I  R   G  L  Y  
 |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |  
 K  G  G   L  D  V  A   R  K  V   I  K  A   D  G  I  R   G  L  Y       184


AGAGGATTTG GACTGTCTGT TATGACCTAT GCTCCATCCA GTGCTGTGTG GTGGGCAAGT     9016
 R  G  F   G  L  S  V   M  T  Y   A  P  S   S  A  V  W   W  A  S  
 |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |  
 R  G  F   G  L  S  V   M  T  Y   A  P  S   S  A  V  W   W  A  S       204


TATGGTTCCA GCCAGCGCAT AATTTGGAGG TTAGCTTATC TGATTGGTTC ATCGTTATGT     9076
 Y  G  S   S  Q  R  I   I  W  S                                   
 |  |  |   |  |  |  |   |  |  |                                   
 Y  G  S   S  Q  R  I   I  W  S. .......... .......... ..........      214


TCCTCTCAGC CCTGTGTACT ATGTAATATT TACGAGAAAA AGACCAGTAA TACATTTCTA     9136
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


CTTAATAGTT ATTTGAATTG GTACTTTCCA TCTGTCCAAA ACCTTTTCAA ACTTCCCCTC     9196
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TTGATGCTCA AACTGCAGCT ATAATTGCAA TTTTGTTTTC TGATGCTTGT TCTTCCATGT     9256
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


CAATATGTAC ATATCTTTTT TAGAAAACAA GAATGCATCT CAATGCATGT GCTGTATTGT     9316
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TTTGATTAGA TTTATCATAG CGATCAATCA CATTTTCTTT ACAGATAAAA ATAGTCGGAA     9376
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GGATAAGTTG GATAACTGAC CAAAGTGGAA ATATGATCTT ACATATTTTT ATCTCTGGCA     9436
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GCTTAGAGAA CTTAATTACC AACCTGAAAC AATGTGATGA AGTAACTACA CAAAACCACA     9496
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TATAGTTTCA TGCACTCTGC AAAACTAAAT TGAAACTCTT AGTGTGCTCT TAATGCTGTT     9556
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


AAGAGGGTGT ATGCAAGTTT ACTGGAATCA GTACCTTTTG TTAGTTTATT TCTTTGTGGT     9616
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TGATGGTTGA AAGATTATAT TTCTTGTCTT GATAACTTAG CCAAAATAGT TAACTATTGT     9676
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GCTTTTTACA TATTGGAACA GTGCTCTTGG CCATTTGCAT GACAAAGAAG AGGCTCCTAG     9736
                         A  L  G   H  L  H   D  K  E   E  A  P  S 
                         |  |  |   |  |  |   |  |  |   |  |  |  | 
.......... .......... .  A  L  G   H  L  H   D  K  E   E  A  P  S      227


CCAATTGAAA CTAGTTGGTG TTCAAGCATC AGGGGGGGTT TTTGCCGGTG CCGTGACCTC     9796
  Q  L  K   L  V  G   V  Q  A  S   G  G  V   F  A  G   A  V  T  S 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  Q  L  K   L  V  G   V  Q  A  S   G  G  V   F  A  G   A  V  T  S      247


TTTTGTTACG ACTCCCATAG ATACAATAAA GACCAGGCTG CAGGTACTGT GTGACATTCT     9856
  F  V  T   T  P  I   D  T  I  K   T  R  L   Q                    
  |  |  |   |  |  |   |  |  |  |   |  |  |   |                    
  F  V  T   T  P  I   D  T  I  K   T  R  L   Q ....... ..........      261


GTTTGCTGAT TACTCTTGTA ATTTGATTTG TGTGGGTATA TTTTGTGAGG CTTACCCTTG     9916
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TGACTTAATG ATTCTTGTCT TTACATTTAT GCTGCTCATT TGCAATAATT TGATTCCTTA     9976
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TCAATGCAAT GCCACTAAGT TTAGGGGAAT GGATATTTTG TTTTGGAAGT ATATTTGATG    10036
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TCAGACTTGA AGACCTAAAT GTTCTTTTAT ACTGATATTT CCTCCAATGG CGGGCTATTG    10096
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


AGGTGCTGGA CTGGAATGCT GTCTATATTA AACAATATAT ACTTCTATGT TTACAGCTGT    10156
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TTGTTTTCTG CTGACATACC ATGACCAATT TGTCATGGTT TCAGTTATGA GGTCAGAAAA    10216
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


AAAGAAACTT CCATTGGGAA AACTTGATAT CTATTACTTC ATTATTTATA GTGAGTAACA    10276
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


AAAGTTAGCA CTTTCAAACT GACTAAAGTA TGCCAGGGAC GTATCATGCA TTTTACAACA    10336
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TGCTCCACAT ATCTCCAAAT ATCACATATT ACGCTTGTAG TGGTAAACTG ATAATACATC    10396
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TACCAACACT GAAAGTTCTC ACAAGTCAGA ACCCTATATT TGACAGTTGT GGTCTCCCTC    10456
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


CTTCCCTCTG CATTTGTTGC TACAGATGAT TACACTGAGT TTTGTTTCTT GTCATTTAGG    10516
                                                                  
                                                                  
.......... .......... .......... .......... .......... .........       261


TTATGGATAA TGAAAATAAG CCAAAAGCCA GGGAAGTTGT CAAAAGATTG ATTGCTGAAG    10576
V  M  D  N   E  N  K   P  K  A   R  E  V  V   K  R  L   I  A  E   
|  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   
V  M  D  N   E  N  K   P  K  A   R  E  V  V   K  R  L   I  A  E        281


ATGGATGGAA AGGTTTGTAC AGAGGGTTGG GTCCAAGATT TTTCAGCTCA TCAGCTTGGG    10636
D  G  W  K   G  L  Y   R  G  L   G  P  R  F   F  S  S   S  A  W   
|  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   
D  G  W  K   G  L  Y   R  G  L   G  P  R  F   F  S  S   S  A  W        301


GAACCTCAAT GATAGTATGC TACGAGTACC TGAGTATGTT TCGTCTTCCC TTGTCAAATG    10696
G  T  S  M   I  V  C   Y  E  Y   L                                
|  |  |  |   |  |  |   |  |  |   |                                
G  T  S  M   I  V  C   Y  E  Y   L  ....... .......... ..........      312


TACACATGCA TATGTAGTGT TATATATCAC TGCATCCCAT GCAGGTTAAT TTTAAGTACC    10756
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


CAGATACTTC TTCTCATTTA GAATTTAGTT AAAATGACAT CATTCAGGTC AGTTGGCATC    10816
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


TCCAGTACAC TGCTTTTGTA AGTTGTATCA TAAATCCCAT TTGCAATGAA ATTTTTGACT    10876
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


CAAGTTGCAG CCTGTAACTT TTCTATATTT TTCGAATAAA GCTATCACCG TACATGAAAC    10936
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


CTGCTTCTGT TAATGCCAAG GAGCGCACAT TATTTCCTGT AGACCGGCTT GGATGTTGAA    10996
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


CAATTGGCAC ATGCAAGTAG CAAAGAGCAG CCTTGTGCTT GCAACAATCT GGTCCACCTG    11056
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


TGGATATGTT CGCTGTGAAA GAAACCAATT AGTCCTTGTA TGAAACATGG TATTAGCGCT    11116
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


TCATGAATAA AACCACTGAT TCTGATTTCT TATTTTCAAT GAATGGATGG GCATTACCAA    11176
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


AGTTATCATG ATTAAAGATC TATTTCATAT AAGTTTATTT TTATACATTA GAGTTTATTT    11236
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


AGAGAACAAG GTATATTTAG TTTTGGTAAT TTTGTGAACT GCACTCAGAC GACTTTGGTA    11296
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      312


TTCTTACTGT AATTTTGTTT TGTTTTCCTA CAGAGCGCTT GTGTGCTAAA GTTGAAGAGG    11356
                                    K  R  L   C  A  K   V  E  E   
                                    |  |  |   |  |  |   |  |  |   
.......... .......... .......... ...K  R  L   C  A  K   V  E  E        321


TCTGA    11361
V  * 
|    
V  *       323





********************************************************************************
Query protein sequence    2 (File: 12061241)

     1  DTTTRAKIPS LHHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVIKT RMQVATGEAV
    61  RRNAAATFRN ILKVDGVPGL YRGFGTVITG AIPARIIFLT ALETTKAASL KLVEPFKLSE
   121  PVQAAFANGL GGLSASLCSQ AVFVPIDVVS QKLMVQGYSG HVRYKGGLDV AQQIIKADGI
   181  RGLYRGFGLS VMTYSPSSAV WWASYGSSQR IIWSAFDRWN DKESSPSQLT IVGVQATGGI
   241  IAGAVTSCVT TPIDTIKTRL QVNQNKPKAM EVVRRLIAED GWKGFYRGLG PRFFSSSAWG
   301  TSMIVCYEYL KRLCAKVEEV -

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   7997   8058 (  62 n);  Protein     1    21 (  21 aa); score: 0.750
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    22   148 ( 127 aa); score: 0.910
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   149   214 (  66 aa); score: 0.932
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9839 ( 142 n);  Protein   215   261 (  47 aa); score: 0.706
  Intron  4   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  5  10516  10669 ( 154 n);  Protein   262   310 (  49 aa); score: 0.863
  Intron  5  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  6  11330  11361 (  32 n);  Protein   311   320 (  10 aa); score: 1.000

MATCH	21326110+	12061241	0.864	969	1.006	P
PGS_21326110+_12061241	(7997  8058,8339  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTC---C ACCAGACGGA GATCAACTGG     8053
 D  T  T   S  R  A  A   K  I  P   S  L      H  Q  T  E   I  N  W  
 |  |  |   +  |  |      |  |  |   |  |      |  |  |  |   |  |  |  
 D  T  T   T  R  A  -   K  I  P   S  L  H   H  Q  T  E   I  N  W        19


GACAAGTAAG CAACCCACCG GTCTTGTAAT CCTTAGGTTC CCATTTCGTG CCGATTTCCG     8113
 D  N                                                             
 |  |                                                             
 D  N..... .......... .......... .......... .......... ..........       21


CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT TTAGCTCCGA TCGTTGAACT GTGTCCCCCG     8173
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       21


CCCTATTCCG GATTGGCTTT ACCGCCTTAG GATTAGGAAC TGTTGTTTGA GGATTTGACA     8233
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       21


TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA TTTGGGGATT CGTAGCTTAT TCCGTTGTTT     8293
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       21


CCGATTCTTT TACTGTTCTC AAATCGTGGC GCTGAAATGT TGCAGCCTCG ACAAGACCAA     8353
                                                   L   D  K  T  K 
                                                   |   |  |  |  | 
.......... .......... .......... .......... .....  L   D  K  T  K       26


GCTCTACGTG GTGGGCGCAG GCATGTTCAG CGGCGTCACC GTGGCGCTGT ATCCTGTCTC     8413
  L  Y  V   V  G  A   G  M  F  S   G  V  T   V  A  L   Y  P  V  S 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  | 
  L  Y  V   V  G  A   G  M  F  S   G  V  T   V  A  L   Y  P  V  S       46


GGTGGTCAAG ACCCGGATGC AGGTTGCCTC TGGGGACGCC ATGAGGAGGA ACGCGCTGGC     8473
  V  V  K   T  R  M   Q  V  A  S   G  D  A   M  R  R   N  A  L  A 
  |  +  |   |  |  |   |  |  |  +   |  +  |   +  |  |   |  |     | 
  V  I  K   T  R  M   Q  V  A  T   G  E  A   V  R  R   N  A  A  A       66


TACCTTCAAG AACATCCTCA AGATGGACGG CGTGCCAGGG CTGTACCGGG GGTTTGCTAC     8533
  T  F  K   N  I  L   K  M  D  G   V  P  G   L  Y  R   G  F  A  T 
  |  |  +   |  |  |   |  +  |  |   |  |  |   |  |  |   |  |  .  | 
  T  F  R   N  I  L   K  V  D  G   V  P  G   L  Y  R   G  F  G  T       86


CGTTATCATT GGGGCTGTAC CAACTAGGAT CATCTTCCTC ACAGCGCTTG AGACAACCAA     8593
  V  I  I   G  A  V   P  T  R  I   I  F  L   T  A  L   E  T  T  K 
  |  |      |  |  +   |  .  |  |   |  |  |   |  |  |   |  |  |  | 
  V  I  T   G  A  I   P  A  R  I   I  F  L   T  A  L   E  T  T  K      106


AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT     8653
  A  A  S   L  K  L   V  E  P  F   K  L  S   E  P  V   R  A  A  F 
  |  |  |   |  |  |   |  |  |  |   |  |  |   |  |  |   +  |  |  | 
  A  A  S   L  K  L   V  E  P  F   K  L  S   E  P  V   Q  A  A  F      126


TGCCAATGGC CTTGCTGGTC TGTCAGCGTC TACATGTTCG CAGGCTATTT TTGTTCCAAT     8713
  A  N  G   L  A  G   L  S  A  S   T  C  S   Q  A  I   F  V  P  I 
  |  |  |   |  .  |   |  |  |  |      |  |   |  |  +   |  |  |  | 
  A  N  G   L  G  G   L  S  A  S   L  C  S   Q  A  V   F  V  P  I      146


TGATGTGGTA TGCCTCTCAT GTGCCTTCTA TGTGATGTTG TATAGAGAAA AAATATCTTA     8773
  D  V                                                            
  |  |                                                            
  D  V ... .......... .......... .......... .......... ..........      148


CAATATGTTG ATGTTAAATG CTAATTACAA TACTAGACTA CTGTTTTCAT TCTGTTGTGC     8833
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      148


ATTGGAATGT TTCAGATTAG CCAGAAATTG ATGGTTCAAG GATATTCTGG TAATGCCAGA     8893
                 I  S   Q  K  L   M  V  Q   G  Y  S  G   N  A  R  
                 +  |   |  |  |   |  |  |   |  |  |  |   +  .  |  
.......... ..... V  S   Q  K  L   M  V  Q   G  Y  S  G   H  V  R       163


TACAAAGGTG GATTAGATGT TGCTCGAAAG GTCATAAAGG CTGATGGCAT TAGGGGGCTG     8953
 Y  K  G   G  L  D  V   A  R  K   V  I  K   A  D  G  I   R  G  L  
 |  |  |   |  |  |  |   |  +  +   +  |  |   |  |  |  |   |  |  |  
 Y  K  G   G  L  D  V   A  Q  Q   I  I  K   A  D  G  I   R  G  L       183


TACAGAGGAT TTGGACTGTC TGTTATGACC TATGCTCCAT CCAGTGCTGT GTGGTGGGCA     9013
 Y  R  G   F  G  L  S   V  M  T   Y  A  P   S  S  A  V   W  W  A  
 |  |  |   |  |  |  |   |  |  |   |  +  |   |  |  |  |   |  |  |  
 Y  R  G   F  G  L  S   V  M  T   Y  S  P   S  S  A  V   W  W  A       203


AGTTATGGTT CCAGCCAGCG CATAATTTGG AGGTTAGCTT ATCTGATTGG TTCATCGTTA     9073
 S  Y  G   S  S  Q  R   I  I  W   S                               
 |  |  |   |  |  |  |   |  |  |   |                               
 S  Y  G   S  S  Q  R   I  I  W   S........ .......... ..........      214


TGTTCCTCTC AGCCCTGTGT ACTATGTAAT ATTTACGAGA AAAAGACCAG TAATACATTT     9133
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


CTACTTAATA GTTATTTGAA TTGGTACTTT CCATCTGTCC AAAACCTTTT CAAACTTCCC     9193
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


CTCTTGATGC TCAAACTGCA GCTATAATTG CAATTTTGTT TTCTGATGCT TGTTCTTCCA     9253
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TGTCAATATG TACATATCTT TTTTAGAAAA CAAGAATGCA TCTCAATGCA TGTGCTGTAT     9313
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TGTTTTGATT AGATTTATCA TAGCGATCAA TCACATTTTC TTTACAGATA AAAATAGTCG     9373
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GAAGGATAAG TTGGATAACT GACCAAAGTG GAAATATGAT CTTACATATT TTTATCTCTG     9433
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GCAGCTTAGA GAACTTAATT ACCAACCTGA AACAATGTGA TGAAGTAACT ACACAAAACC     9493
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT     9553
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT     9613
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT     9673
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      214


TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC     9733
                            A  L   G  H  L   H  D  K   E  E  A  P 
                            |  .      .      +  |  |   |  .  +  | 
.......... .......... ....  A  F   D  R  W   N  D  K   E  S  S  P      226


TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC     9793
  S  Q  L   K  L  V   G  V  Q  A   S  G  G   V  F  A   G  A  V  T 
  |  |  |      +  |   |  |  |  |   +  |  |   +  .  |   |  |  |  | 
  S  Q  L   T  I  V   G  V  Q  A   T  G  G   I  I  A   G  A  V  T      246


CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT     9853
  S  F  V   T  T  P   I  D  T  I   K  T  R   L  Q                 
  |     |   |  |  |   |  |  |  |   |  |  |   |  |                 
  S  C  V   T  T  P   I  D  T  I   K  T  R   L  Q .... ..........      261


TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC     9913
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC     9973
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG    10033
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA    10093
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC    10153
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA    10213
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA    10273
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA    10333
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC    10393
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC    10453
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT    10513
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      261


AGGTTATGGA TAATGAAAAT AAGCCAAAAG CCAGGGAAGT TGTCAAAAGA TTGATTGCTG    10573
   V  M  D   N  E  N   K  P  K   A  R  E  V   V  K  R   L  I  A   
   |         |  +  |   |  |  |   |     |  |   |  +  |   |  |  |   
.. V  -  -   N  Q  N   K  P  K   A  M  E  V   V  R  R   L  I  A        278


AAGATGGATG GAAAGGTTTG TACAGAGGGT TGGGTCCAAG ATTTTTCAGC TCATCAGCTT    10633
E  D  G  W   K  G  L   Y  R  G   L  G  P  R   F  F  S   S  S  A   
|  |  |  |   |  |  .   |  |  |   |  |  |  |   |  |  |   |  |  |   
E  D  G  W   K  G  F   Y  R  G   L  G  P  R   F  F  S   S  S  A        298


GGGGAACCTC AATGATAGTA TGCTACGAGT ACCTGAGTAT GTTTCGTCTT CCCTTGTCAA    10693
W  G  T  S   M  I  V   C  Y  E   Y  L                             
|  |  |  |   |  |  |   |  |  |   |  |                             
W  G  T  S   M  I  V   C  Y  E   Y  L  .... .......... ..........      310


ATGTACACAT GCATATGTAG TGTTATATAT CACTGCATCC CATGCAGGTT AATTTTAAGT    10753
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


ACCCAGATAC TTCTTCTCAT TTAGAATTTA GTTAAAATGA CATCATTCAG GTCAGTTGGC    10813
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


ATCTCCAGTA CACTGCTTTT GTAAGTTGTA TCATAAATCC CATTTGCAAT GAAATTTTTG    10873
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


ACTCAAGTTG CAGCCTGTAA CTTTTCTATA TTTTTCGAAT AAAGCTATCA CCGTACATGA    10933
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


AACCTGCTTC TGTTAATGCC AAGGAGCGCA CATTATTTCC TGTAGACCGG CTTGGATGTT    10993
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


GAACAATTGG CACATGCAAG TAGCAAAGAG CAGCCTTGTG CTTGCAACAA TCTGGTCCAC    11053
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


CTGTGGATAT GTTCGCTGTG AAAGAAACCA ATTAGTCCTT GTATGAAACA TGGTATTAGC    11113
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


GCTTCATGAA TAAAACCACT GATTCTGATT TCTTATTTTC AATGAATGGA TGGGCATTAC    11173
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


CAAAGTTATC ATGATTAAAG ATCTATTTCA TATAAGTTTA TTTTTATACA TTAGAGTTTA    11233
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


TTTAGAGAAC AAGGTATATT TAGTTTTGGT AATTTTGTGA ACTGCACTCA GACGACTTTG    11293
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      310


GTATTCTTAC TGTAATTTTG TTTTGTTTTC CTACAGAGCG CTTGTGTGCT AAAGTTGAAG    11353
                                       K  R   L  C  A   K  V  E   
                                       |  |   |  |  |   |  |  |   
.......... .......... .......... ......K  R   L  C  A   K  V  E        318


AGGTCTGA    11361
E  V  * 
|  |    
E  V  *       321





********************************************************************************
Query protein sequence    3 (File: 18496651)

     1  DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
    61  MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
   121  ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKSDG
   181  PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RMIWSALGHL NDKEDAPSQL KIVGVQATGG
   241  MIAGAVTSCV STPLDTIKTR LQVNQNKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
   301  GTSMIVCYEY LKRVCAKVEE A-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   7997   8058 (  62 n);  Protein     1    22 (  22 aa); score: 0.754
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    23   149 ( 127 aa); score: 0.846
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   150   215 (  66 aa); score: 0.854
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9839 ( 142 n);  Protein   216   262 (  47 aa); score: 0.835
  Intron  4   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  5  10516  10669 ( 154 n);  Protein   263   311 (  49 aa); score: 0.866
  Intron  5  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  6  11330  11361 (  32 n);  Protein   312   321 (  10 aa); score: 0.857

MATCH	21326110+	18496651	0.843	969	1.003	P
PGS_21326110+_18496651	(7997  8058,8339  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTC---C ACCAGACGGA GATCAACTGG     8053
 D  T  T   S  R  A  A   K  I  P   S  L      H  Q  T  E   I  N  W  
 |  |  +   +  |  |  |   |  |  |   |  |      .  |  |  |   |  |  |  
 D  T  S   T  R  A  A   K  I  P   S  L  P   Q  Q  T  E   I  N  W        20


GACAAGTAAG CAACCCACCG GTCTTGTAAT CCTTAGGTTC CCATTTCGTG CCGATTTCCG     8113
 D  N                                                             
 |  |                                                             
 D  N..... .......... .......... .......... .......... ..........       22


CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT TTAGCTCCGA TCGTTGAACT GTGTCCCCCG     8173
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       22


CCCTATTCCG GATTGGCTTT ACCGCCTTAG GATTAGGAAC TGTTGTTTGA GGATTTGACA     8233
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       22


TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA TTTGGGGATT CGTAGCTTAT TCCGTTGTTT     8293
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       22


CCGATTCTTT TACTGTTCTC AAATCGTGGC GCTGAAATGT TGCAGCCTCG ACAAGACCAA     8353
                                                   L   D  K  T  K 
                                                   |   |     |  | 
.......... .......... .......... .......... .....  L   D  M  T  K       27


GCTCTACGTG GTGGGCGCAG GCATGTTCAG CGGCGTCACC GTGGCGCTGT ATCCTGTCTC     8413
  L  Y  V   V  G  A   G  M  F  S   G  V  T   V  A  L   Y  P  V  S 
  |  |  |   |  |  |   |  |  |  |      |  |   |  |  |   |  |  |  | 
  L  Y  V   V  G  A   G  M  F  S   C  V  T   V  A  L   Y  P  V  S       47


GGTGGTCAAG ACCCGGATGC AGGTTGCCTC TGGGGACGCC ATGAGGAGGA ACGCGCTGGC     8473
  V  V  K   T  R  M   Q  V  A  S   G  D  A   M  R  R   N  A  L  A 
  |  +  |   |  |  |   |  |  |  |   |  +  |   |  |  |   |  |  |  | 
  V  I  K   T  R  M   Q  V  A  S   G  E  A   M  R  R   N  A  L  A       67


TACCTTCAAG AACATCCTCA AGATGGACGG CGTGCCAGGG CTGTACCGGG GGTTTGCTAC     8533
  T  F  K   N  I  L   K  M  D  G   V  P  G   L  Y  R   G  F  A  T 
  |  |  |   |  |  |   |  +  |  |   |  |  |   |  |  |   |  |  .  | 
  T  F  K   N  I  L   K  V  D  G   V  P  G   L  Y  R   G  F  G  T       87


CGTTATCATT GGGGCTGTAC CAACTAGGAT CATCTTCCTC ACAGCGCTTG AGACAACCAA     8593
  V  I  I   G  A  V   P  T  R  I   I  F  L   T  A  L   E  T  T  K 
  |  |      |  |  +   |  .  |  |   |  |  |   |  |  |   |     |  | 
  V  I  T   G  A  I   P  A  R  I   I  F  L   T  A  L   E  K  T  K      107


AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT     8653
  A  A  S   L  K  L   V  E  P  F   K  L  S   E  P  V   R  A  A  F 
  |  .  |   |  |  |   |  |  |  .   +  |  |   |     +   .  |  |  . 
  A  T  S   L  K  L   V  E  P  L   Q  L  S   E  S  M   E  A  A  L      127


TGCCAATGGC CTTGCTGGTC TGTCAGCGTC TACATGTTCG CAGGCTATTT TTGTTCCAAT     8713
  A  N  G   L  A  G   L  S  A  S   T  C  S   Q  A  I   F  V  P  I 
  |  |  |   |  .  |   |  +  |  |      |  |   |  |  +   |  |  |  | 
  A  N  G   L  G  G   L  T  A  S   L  C  S   Q  A  V   F  V  P  I      147


TGATGTGGTA TGCCTCTCAT GTGCCTTCTA TGTGATGTTG TATAGAGAAA AAATATCTTA     8773
  D  V                                                            
  |  |                                                            
  D  V ... .......... .......... .......... .......... ..........      149


CAATATGTTG ATGTTAAATG CTAATTACAA TACTAGACTA CTGTTTTCAT TCTGTTGTGC     8833
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      149


ATTGGAATGT TTCAGATTAG CCAGAAATTG ATGGTTCAAG GATATTCTGG TAATGCCAGA     8893
                 I  S   Q  K  L   M  V  Q   G  Y  S  G   N  A  R  
                 +  |   |  |  |   |  |  |   |  |  |  |   +  .  |  
.......... ..... V  S   Q  K  L   M  V  Q   G  Y  S  G   H  V  R       164


TACAAAGGTG GATTAGATGT TGCTCGAAAG GTCATAAAGG CTGATGGCAT TAGGGGGCTG     8953
 Y  K  G   G  L  D  V   A  R  K   V  I  K   A  D  G  I   R  G  L  
 |  |  |   |  +  |  |   .  +  |   +  +  |   +  |  |      |  |  |  
 Y  K  G   G  I  D  V   V  Q  K   I  M  K   S  D  G  P   R  G  L       184


TACAGAGGAT TTGGACTGTC TGTTATGACC TATGCTCCAT CCAGTGCTGT GTGGTGGGCA     9013
 Y  R  G   F  G  L  S   V  M  T   Y  A  P   S  S  A  V   W  W  A  
 |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |  
 Y  R  G   F  G  L  S   V  M  T   Y  A  P   S  S  A  V   W  W  A       204


AGTTATGGTT CCAGCCAGCG CATAATTTGG AGGTTAGCTT ATCTGATTGG TTCATCGTTA     9073
 S  Y  G   S  S  Q  R   I  I  W   S                               
 |  |  |      |  |  |   +  |  |   |                               
 S  Y  G   F  S  Q  R   M  I  W   S........ .......... ..........      215


TGTTCCTCTC AGCCCTGTGT ACTATGTAAT ATTTACGAGA AAAAGACCAG TAATACATTT     9133
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


CTACTTAATA GTTATTTGAA TTGGTACTTT CCATCTGTCC AAAACCTTTT CAAACTTCCC     9193
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


CTCTTGATGC TCAAACTGCA GCTATAATTG CAATTTTGTT TTCTGATGCT TGTTCTTCCA     9253
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


TGTCAATATG TACATATCTT TTTTAGAAAA CAAGAATGCA TCTCAATGCA TGTGCTGTAT     9313
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


TGTTTTGATT AGATTTATCA TAGCGATCAA TCACATTTTC TTTACAGATA AAAATAGTCG     9373
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GAAGGATAAG TTGGATAACT GACCAAAGTG GAAATATGAT CTTACATATT TTTATCTCTG     9433
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GCAGCTTAGA GAACTTAATT ACCAACCTGA AACAATGTGA TGAAGTAACT ACACAAAACC     9493
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT     9553
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT     9613
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT     9673
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC     9733
                            A  L   G  H  L   H  D  K   E  E  A  P 
                            |  |   |  |  |   +  |  |   |  +  |  | 
.......... .......... ....  A  L   G  H  L   N  D  K   E  D  A  P      227


TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC     9793
  S  Q  L   K  L  V   G  V  Q  A   S  G  G   V  F  A   G  A  V  T 
  |  |  |   |  +  |   |  |  |  |   +  |  |   +  .  |   |  |  |  | 
  S  Q  L   K  I  V   G  V  Q  A   T  G  G   M  I  A   G  A  V  T      247


CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT     9853
  S  F  V   T  T  P   I  D  T  I   K  T  R   L  Q                 
  |     |   +  |  |   +  |  |  |   |  |  |   |  |                 
  S  C  V   S  T  P   L  D  T  I   K  T  R   L  Q .... ..........      262


TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC     9913
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC     9973
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG    10033
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA    10093
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC    10153
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA    10213
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA    10273
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA    10333
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC    10393
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC    10453
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT    10513
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AGGTTATGGA TAATGAAAAT AAGCCAAAAG CCAGGGAAGT TGTCAAAAGA TTGATTGCTG    10573
   V  M  D   N  E  N   K  P  K   A  R  E  V   V  K  R   L  I  A   
   |         |  +  |   |  |  |   |     |  |   |  +  |   |  |  |   
.. V  -  -   N  Q  N   K  P  K   A  S  E  V   V  R  R   L  I  A        279


AAGATGGATG GAAAGGTTTG TACAGAGGGT TGGGTCCAAG ATTTTTCAGC TCATCAGCTT    10633
E  D  G  W   K  G  L   Y  R  G   L  G  P  R   F  F  S   S  S  A   
|  |  |  |   |  |  .   |  |  |   |  |  |  |   |  |  |   |  |  |   
E  D  G  W   K  G  F   Y  R  G   L  G  P  R   F  F  S   S  S  A        299


GGGGAACCTC AATGATAGTA TGCTACGAGT ACCTGAGTAT GTTTCGTCTT CCCTTGTCAA    10693
W  G  T  S   M  I  V   C  Y  E   Y  L                             
|  |  |  |   |  |  |   |  |  |   |  |                             
W  G  T  S   M  I  V   C  Y  E   Y  L  .... .......... ..........      311


ATGTACACAT GCATATGTAG TGTTATATAT CACTGCATCC CATGCAGGTT AATTTTAAGT    10753
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


ACCCAGATAC TTCTTCTCAT TTAGAATTTA GTTAAAATGA CATCATTCAG GTCAGTTGGC    10813
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


ATCTCCAGTA CACTGCTTTT GTAAGTTGTA TCATAAATCC CATTTGCAAT GAAATTTTTG    10873
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


ACTCAAGTTG CAGCCTGTAA CTTTTCTATA TTTTTCGAAT AAAGCTATCA CCGTACATGA    10933
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


AACCTGCTTC TGTTAATGCC AAGGAGCGCA CATTATTTCC TGTAGACCGG CTTGGATGTT    10993
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


GAACAATTGG CACATGCAAG TAGCAAAGAG CAGCCTTGTG CTTGCAACAA TCTGGTCCAC    11053
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


CTGTGGATAT GTTCGCTGTG AAAGAAACCA ATTAGTCCTT GTATGAAACA TGGTATTAGC    11113
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


GCTTCATGAA TAAAACCACT GATTCTGATT TCTTATTTTC AATGAATGGA TGGGCATTAC    11173
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


CAAAGTTATC ATGATTAAAG ATCTATTTCA TATAAGTTTA TTTTTATACA TTAGAGTTTA    11233
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


TTTAGAGAAC AAGGTATATT TAGTTTTGGT AATTTTGTGA ACTGCACTCA GACGACTTTG    11293
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


GTATTCTTAC TGTAATTTTG TTTTGTTTTC CTACAGAGCG CTTGTGTGCT AAAGTTGAAG    11353
                                       K  R   L  C  A   K  V  E   
                                       |  |   +  |  |   |  |  |   
.......... .......... .......... ......K  R   V  C  A   K  V  E        319


AGGTCTGA    11361
E  V  * 
|  .    
E  A  *       322





********************************************************************************
Query protein sequence    4 (File: 12278522)

     1  DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
    61  MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
   121  ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKADG
   181  PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RVIWSALGRL DDKEDTPSQL KIVGVQATGG
   241  MVAGAVTSCV STPLDTIKTR LQVNINKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
   301  GTSMIVCYEY LKRVCAKVEE A-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   7997   8058 (  62 n);  Protein     1    22 (  22 aa); score: 0.754
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    23   149 ( 127 aa); score: 0.846
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   150   215 (  66 aa); score: 0.871
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9839 ( 142 n);  Protein   216   262 (  47 aa); score: 0.778
  Intron  4   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  5  10516  10669 ( 154 n);  Protein   263   311 (  49 aa); score: 0.851
  Intron  5  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  6  11330  11361 (  32 n);  Protein   312   321 (  10 aa); score: 0.857

MATCH	21326110+	12278522	0.836	969	1.003	P
PGS_21326110+_12278522	(7997  8058,8339  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTC---C ACCAGACGGA GATCAACTGG     8053
 D  T  T   S  R  A  A   K  I  P   S  L      H  Q  T  E   I  N  W  
 |  |  +   +  |  |  |   |  |  |   |  |      .  |  |  |   |  |  |  
 D  T  S   T  R  A  A   K  I  P   S  L  P   Q  Q  T  E   I  N  W        20


GACAAGTAAG CAACCCACCG GTCTTGTAAT CCTTAGGTTC CCATTTCGTG CCGATTTCCG     8113
 D  N                                                             
 |  |                                                             
 D  N..... .......... .......... .......... .......... ..........       22


CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT TTAGCTCCGA TCGTTGAACT GTGTCCCCCG     8173
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       22


CCCTATTCCG GATTGGCTTT ACCGCCTTAG GATTAGGAAC TGTTGTTTGA GGATTTGACA     8233
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       22


TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA TTTGGGGATT CGTAGCTTAT TCCGTTGTTT     8293
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       22


CCGATTCTTT TACTGTTCTC AAATCGTGGC GCTGAAATGT TGCAGCCTCG ACAAGACCAA     8353
                                                   L   D  K  T  K 
                                                   |   |     |  | 
.......... .......... .......... .......... .....  L   D  M  T  K       27


GCTCTACGTG GTGGGCGCAG GCATGTTCAG CGGCGTCACC GTGGCGCTGT ATCCTGTCTC     8413
  L  Y  V   V  G  A   G  M  F  S   G  V  T   V  A  L   Y  P  V  S 
  |  |  |   |  |  |   |  |  |  |      |  |   |  |  |   |  |  |  | 
  L  Y  V   V  G  A   G  M  F  S   C  V  T   V  A  L   Y  P  V  S       47


GGTGGTCAAG ACCCGGATGC AGGTTGCCTC TGGGGACGCC ATGAGGAGGA ACGCGCTGGC     8473
  V  V  K   T  R  M   Q  V  A  S   G  D  A   M  R  R   N  A  L  A 
  |  +  |   |  |  |   |  |  |  |   |  +  |   |  |  |   |  |  |  | 
  V  I  K   T  R  M   Q  V  A  S   G  E  A   M  R  R   N  A  L  A       67


TACCTTCAAG AACATCCTCA AGATGGACGG CGTGCCAGGG CTGTACCGGG GGTTTGCTAC     8533
  T  F  K   N  I  L   K  M  D  G   V  P  G   L  Y  R   G  F  A  T 
  |  |  |   |  |  |   |  +  |  |   |  |  |   |  |  |   |  |  .  | 
  T  F  K   N  I  L   K  V  D  G   V  P  G   L  Y  R   G  F  G  T       87


CGTTATCATT GGGGCTGTAC CAACTAGGAT CATCTTCCTC ACAGCGCTTG AGACAACCAA     8593
  V  I  I   G  A  V   P  T  R  I   I  F  L   T  A  L   E  T  T  K 
  |  |      |  |  +   |  .  |  |   |  |  |   |  |  |   |     |  | 
  V  I  T   G  A  I   P  A  R  I   I  F  L   T  A  L   E  K  T  K      107


AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT     8653
  A  A  S   L  K  L   V  E  P  F   K  L  S   E  P  V   R  A  A  F 
  |  .  |   |  |  |   |  |  |  .   +  |  |   |     +   .  |  |  . 
  A  T  S   L  K  L   V  E  P  L   Q  L  S   E  S  M   E  A  A  L      127


TGCCAATGGC CTTGCTGGTC TGTCAGCGTC TACATGTTCG CAGGCTATTT TTGTTCCAAT     8713
  A  N  G   L  A  G   L  S  A  S   T  C  S   Q  A  I   F  V  P  I 
  |  |  |   |  .  |   |  +  |  |      |  |   |  |  +   |  |  |  | 
  A  N  G   L  G  G   L  T  A  S   L  C  S   Q  A  V   F  V  P  I      147


TGATGTGGTA TGCCTCTCAT GTGCCTTCTA TGTGATGTTG TATAGAGAAA AAATATCTTA     8773
  D  V                                                            
  |  |                                                            
  D  V ... .......... .......... .......... .......... ..........      149


CAATATGTTG ATGTTAAATG CTAATTACAA TACTAGACTA CTGTTTTCAT TCTGTTGTGC     8833
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      149


ATTGGAATGT TTCAGATTAG CCAGAAATTG ATGGTTCAAG GATATTCTGG TAATGCCAGA     8893
                 I  S   Q  K  L   M  V  Q   G  Y  S  G   N  A  R  
                 +  |   |  |  |   |  |  |   |  |  |  |   +  .  |  
.......... ..... V  S   Q  K  L   M  V  Q   G  Y  S  G   H  V  R       164


TACAAAGGTG GATTAGATGT TGCTCGAAAG GTCATAAAGG CTGATGGCAT TAGGGGGCTG     8953
 Y  K  G   G  L  D  V   A  R  K   V  I  K   A  D  G  I   R  G  L  
 |  |  |   |  +  |  |   .  +  |   +  +  |   |  |  |      |  |  |  
 Y  K  G   G  I  D  V   V  Q  K   I  M  K   A  D  G  P   R  G  L       184


TACAGAGGAT TTGGACTGTC TGTTATGACC TATGCTCCAT CCAGTGCTGT GTGGTGGGCA     9013
 Y  R  G   F  G  L  S   V  M  T   Y  A  P   S  S  A  V   W  W  A  
 |  |  |   |  |  |  |   |  |  |   |  |  |   |  |  |  |   |  |  |  
 Y  R  G   F  G  L  S   V  M  T   Y  A  P   S  S  A  V   W  W  A       204


AGTTATGGTT CCAGCCAGCG CATAATTTGG AGGTTAGCTT ATCTGATTGG TTCATCGTTA     9073
 S  Y  G   S  S  Q  R   I  I  W   S                               
 |  |  |      |  |  |   +  |  |   |                               
 S  Y  G   F  S  Q  R   V  I  W   S........ .......... ..........      215


TGTTCCTCTC AGCCCTGTGT ACTATGTAAT ATTTACGAGA AAAAGACCAG TAATACATTT     9133
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


CTACTTAATA GTTATTTGAA TTGGTACTTT CCATCTGTCC AAAACCTTTT CAAACTTCCC     9193
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


CTCTTGATGC TCAAACTGCA GCTATAATTG CAATTTTGTT TTCTGATGCT TGTTCTTCCA     9253
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


TGTCAATATG TACATATCTT TTTTAGAAAA CAAGAATGCA TCTCAATGCA TGTGCTGTAT     9313
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


TGTTTTGATT AGATTTATCA TAGCGATCAA TCACATTTTC TTTACAGATA AAAATAGTCG     9373
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GAAGGATAAG TTGGATAACT GACCAAAGTG GAAATATGAT CTTACATATT TTTATCTCTG     9433
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GCAGCTTAGA GAACTTAATT ACCAACCTGA AACAATGTGA TGAAGTAACT ACACAAAACC     9493
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT     9553
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT     9613
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT     9673
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      215


TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC     9733
                            A  L   G  H  L   H  D  K   E  E  A  P 
                            |  |   |  .  |      |  |   |  +  .  | 
.......... .......... ....  A  L   G  R  L   D  D  K   E  D  T  P      227


TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC     9793
  S  Q  L   K  L  V   G  V  Q  A   S  G  G   V  F  A   G  A  V  T 
  |  |  |   |  +  |   |  |  |  |   +  |  |   +     |   |  |  |  | 
  S  Q  L   K  I  V   G  V  Q  A   T  G  G   M  V  A   G  A  V  T      247


CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT     9853
  S  F  V   T  T  P   I  D  T  I   K  T  R   L  Q                 
  |     |   +  |  |   +  |  |  |   |  |  |   |  |                 
  S  C  V   S  T  P   L  D  T  I   K  T  R   L  Q .... ..........      262


TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC     9913
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC     9973
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG    10033
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA    10093
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC    10153
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA    10213
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA    10273
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA    10333
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC    10393
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC    10453
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT    10513
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AGGTTATGGA TAATGAAAAT AAGCCAAAAG CCAGGGAAGT TGTCAAAAGA TTGATTGCTG    10573
   V  M  D   N  E  N   K  P  K   A  R  E  V   V  K  R   L  I  A   
   |         |     |   |  |  |   |     |  |   |  +  |   |  |  |   
.. V  -  -   N  I  N   K  P  K   A  S  E  V   V  R  R   L  I  A        279


AAGATGGATG GAAAGGTTTG TACAGAGGGT TGGGTCCAAG ATTTTTCAGC TCATCAGCTT    10633
E  D  G  W   K  G  L   Y  R  G   L  G  P  R   F  F  S   S  S  A   
|  |  |  |   |  |  .   |  |  |   |  |  |  |   |  |  |   |  |  |   
E  D  G  W   K  G  F   Y  R  G   L  G  P  R   F  F  S   S  S  A        299


GGGGAACCTC AATGATAGTA TGCTACGAGT ACCTGAGTAT GTTTCGTCTT CCCTTGTCAA    10693
W  G  T  S   M  I  V   C  Y  E   Y  L                             
|  |  |  |   |  |  |   |  |  |   |  |                             
W  G  T  S   M  I  V   C  Y  E   Y  L  .... .......... ..........      311


ATGTACACAT GCATATGTAG TGTTATATAT CACTGCATCC CATGCAGGTT AATTTTAAGT    10753
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


ACCCAGATAC TTCTTCTCAT TTAGAATTTA GTTAAAATGA CATCATTCAG GTCAGTTGGC    10813
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


ATCTCCAGTA CACTGCTTTT GTAAGTTGTA TCATAAATCC CATTTGCAAT GAAATTTTTG    10873
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


ACTCAAGTTG CAGCCTGTAA CTTTTCTATA TTTTTCGAAT AAAGCTATCA CCGTACATGA    10933
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


AACCTGCTTC TGTTAATGCC AAGGAGCGCA CATTATTTCC TGTAGACCGG CTTGGATGTT    10993
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


GAACAATTGG CACATGCAAG TAGCAAAGAG CAGCCTTGTG CTTGCAACAA TCTGGTCCAC    11053
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


CTGTGGATAT GTTCGCTGTG AAAGAAACCA ATTAGTCCTT GTATGAAACA TGGTATTAGC    11113
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


GCTTCATGAA TAAAACCACT GATTCTGATT TCTTATTTTC AATGAATGGA TGGGCATTAC    11173
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


CAAAGTTATC ATGATTAAAG ATCTATTTCA TATAAGTTTA TTTTTATACA TTAGAGTTTA    11233
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


TTTAGAGAAC AAGGTATATT TAGTTTTGGT AATTTTGTGA ACTGCACTCA GACGACTTTG    11293
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      311


GTATTCTTAC TGTAATTTTG TTTTGTTTTC CTACAGAGCG CTTGTGTGCT AAAGTTGAAG    11353
                                       K  R   L  C  A   K  V  E   
                                       |  |   +  |  |   |  |  |   
.......... .......... .......... ......K  R   V  C  A   K  V  E        319


AGGTCTGA    11361
E  V  * 
|  .    
E  A  *       322





********************************************************************************
Query protein sequence    5 (File: 21553961)

     1  DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
    61  RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
   121  TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
   181  GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATTAPSKS KIVMVQAAGG
   241  IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
   301  SAWGTSMILT YEYLKRLCAI ED-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   8000   8058 (  59 n);  Protein     1    20 (  20 aa); score: 0.467
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    21   147 ( 127 aa); score: 0.712
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   148   213 (  66 aa); score: 0.837
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9839 ( 142 n);  Protein   214   262 (  49 aa); score: 0.496
  Intron  4   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  5  10516  10669 ( 154 n);  Protein   263   314 (  52 aa); score: 0.754
  Intron  5  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  6  11330  11361 (  32 n);  Protein   315   322 (   8 aa); score: 0.590

MATCH	21326110+	21553961	0.697	966	0.997	P
PGS_21326110+_21553961	(8000  8058,8339  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

ACAACCTCTA GGGCCGCCAA GATCCCGTCG CTCCACCAGA CGGAGATCAA CTGGGACAAG     8059
 T  T  S   R  A  A  K   I  P  S   L  H  Q   T  E  I  N   W  D  N  
    |         .  +  +   |     |   .     |   |  |  |  |   |  |  .  
 D  T  P   P  T  S  R   I  A  S   F  G  Q   T  E  I  N   W  D  K.       20


TAAGCAACCC ACCGGTCTTG TAATCCTTAG GTTCCCATTT CGTGCCGATT TCCGCCTTTC     8119
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TCGCGGTCCT AGTCTTCTAG AAATTTAGCT CCGATCGTTG AACTGTGTCC CCCGCCCTAT     8179
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TCCGGATTGG CTTTACCGCC TTAGGATTAG GAACTGTTGT TTGAGGATTT GACATGGCTG     8239
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TTCATGCTCC TGGACGAAGC GCAATTTGGG GATTCGTAGC TTATTCCGTT GTTTCCGATT     8299
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


CTTTTACTGT TCTCAAATCG TGGCGCTGAA ATGTTGCAGC CTCGACAAGA CCAAGCTCTA     8359
                                             L  D  K   T  K  L  Y 
                                             |  |  |      +  .  | 
.......... .......... .......... .........   L  D  K   R  R  F  Y       27


CGTGGTGGGC GCAGGCATGT TCAGCGGCGT CACCGTGGCG CTGTATCCTG TCTCGGTGGT     8419
  V  V  G   A  G  M   F  S  G  V   T  V  A   L  Y  P   V  S  V  V 
  +     |   |  |  +   |  +  |  |   |  |  |   |  |  |   |  |  |  | 
  I  N  G   A  G  L   F  T  G  V   T  V  A   L  Y  P   V  S  V  V       47


CAAGACCCGG ATGCAGGTTG CCTCTGGGGA CGCCATGAGG AGGAACGCGC TGGCTACCTT     8479
  K  T  R   M  Q  V   A  S  G  D   A  M  R   R  N  A   L  A  T  F 
  |  |  |   +  |  |   |  |     +         .   |  +  |   .  +  .    
  K  T  R   L  Q  V   A  S  K  E   I  A  E   R  S  A   F  S  V  V       67


CAAGAACATC CTCAAGATGG ACGGCGTGCC AGGGCTGTAC CGGGGGTTTG CTACCGTTAT     8539
  K  N  I   L  K  M   D  G  V  P   G  L  Y   R  G  F   A  T  V  I 
  |  .  |   |  |      |  |  |  |   |  |  |   |  |  |   .  |  |  | 
  K  G  I   L  K  N   D  G  V  P   G  L  Y   R  G  F   G  T  V  I       87


CATTGGGGCT GTACCAACTA GGATCATCTT CCTCACAGCG CTTGAGACAA CCAAAGCAGC     8599
  I  G  A   V  P  T   R  I  I  F   L  T  A   L  E  T   T  K  A  A 
     |  |   |  |  .   |  |  |  |   |  |  |   |  |  |   |  |     + 
  T  G  A   V  P  A   R  I  I  F   L  T  A   L  E  T   T  K  I  S      107


CTCGCTTAAG CTTGTTGAGC CCTTCAAGCT GTCAGAGCCG GTGCGGGCTG CCTTTGCCAA     8659
  S  L  K   L  V  E   P  F  K  L   S  E  P   V  R  A   A  F  A  N 
  +  .  |   |  |      |  .  +  |   |  |  |   .  +  |   |  .  |  | 
  A  F  K   L  V  A   P  L  E  L   S  E  P   T  Q  A   A  I  A  N      127


TGGCCTTGCT GGTCTGTCAG CGTCTACATG TTCGCAGGCT ATTTTTGTTC CAATTGATGT     8719
  G  L  A   G  L  S   A  S  T  C   S  Q  A   I  F  V   P  I  D  V 
  |  +  |   |  +  +   |  |         |  |  |   +  |  |   |  |  |  | 
  G  I  A   G  M  T   A  S  L  F   S  Q  A   V  F  V   P  I  D  V      147


GGTATGCCTC TCATGTGCCT TCTATGTGAT GTTGTATAGA GAAAAAATAT CTTACAATAT     8779
                                                                  
                                                                  
 ......... .......... .......... .......... .......... ..........      147


GTTGATGTTA AATGCTAATT ACAATACTAG ACTACTGTTT TCATTCTGTT GTGCATTGGA     8839
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      147


ATGTTTCAGA TTAGCCAGAA ATTGATGGTT CAAGGATATT CTGGTAATGC CAGATACAAA     8899
           I  S  Q  K   L  M  V   Q  G  Y   S  G  N  A   R  Y  K  
           +  |  |  |   |  |  |   |  |  |   |  |  +  |      |     
.........  V  S  Q  K   L  M  V   Q  G  Y   S  G  H  A   T  Y  T       164


GGTGGATTAG ATGTTGCTCG AAAGGTCATA AAGGCTGATG GCATTAGGGG GCTGTACAGA     8959
 G  G  L   D  V  A  R   K  V  I   K  A  D   G  I  R  G   L  Y  R  
 |  |  +   |  |  |      |  +  |   |  +      |  +  |  |   |  |  |  
 G  G  I   D  V  A  T   K  I  I   K  S  Y   G  V  R  G   L  Y  R       184


GGATTTGGAC TGTCTGTTAT GACCTATGCT CCATCCAGTG CTGTGTGGTG GGCAAGTTAT     9019
 G  F  G   L  S  V  M   T  Y  A   P  S  S   A  V  W  W   A  S  Y  
 |  |  |   |  |  |  |   |  |  +   |  |  |   |  .  |  |   |  |  |  
 G  F  G   L  S  V  M   T  Y  S   P  S  S   A  A  W  W   A  S  Y       204


GGTTCCAGCC AGCGCATAAT TTGGAGGTTA GCTTATCTGA TTGGTTCATC GTTATGTTCC     9079
 G  S  S   Q  R  I  I   W  S                                      
 |  |  |   |  |  +  |   |                                         
 G  S  S   Q  R  V  I   W  R.... .......... .......... ..........      213


TCTCAGCCCT GTGTACTATG TAATATTTAC GAGAAAAAGA CCAGTAATAC ATTTCTACTT     9139
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


AATAGTTATT TGAATTGGTA CTTTCCATCT GTCCAAAACC TTTTCAAACT TCCCCTCTTG     9199
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


ATGCTCAAAC TGCAGCTATA ATTGCAATTT TGTTTTCTGA TGCTTGTTCT TCCATGTCAA     9259
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TATGTACATA TCTTTTTTAG AAAACAAGAA TGCATCTCAA TGCATGTGCT GTATTGTTTT     9319
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


GATTAGATTT ATCATAGCGA TCAATCACAT TTTCTTTACA GATAAAAATA GTCGGAAGGA     9379
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TAAGTTGGAT AACTGACCAA AGTGGAAATA TGATCTTACA TATTTTTATC TCTGGCAGCT     9439
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TAGAGAACTT AATTACCAAC CTGAAACAAT GTGATGAAGT AACTACACAA AACCACATAT     9499
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


AGTTTCATGC ACTCTGCAAA ACTAAATTGA AACTCTTAGT GTGCTCTTAA TGCTGTTAAG     9559
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


AGGGTGTATG CAAGTTTACT GGAATCAGTA CCTTTTGTTA GTTTATTTCT TTGTGGTTGA     9619
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TGGTTGAAAG ATTATATTTC TTGTCTTGAT AACTTAGCCA AAATAGTTAA CTATTGTGCT     9679
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TTTTACATAT TGGAACAGTG CTCTTGGCCA TTTGCATGAC AAAGAA---- --GAGGCTCC     9733
                      A  L  G  H   L  H  D   K  E         E  A  P 
                         |  |  +         |   .  +            |  | 
.......... ........   F  L  G  Y   G  G  D   S  D  A   T  T  A  P      227


TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC     9793
  S  Q  L   K  L  V   G  V  Q  A   S  G  G   V  F  A   G  A  V  T 
  |  +      |  +  |      |  |  |   +  |  |   +  .  |   |  |  .  . 
  S  K  S   K  I  V   M  V  Q  A   A  G  G   I  I  A   G  A  T  A      247


CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT     9853
  S  F  V   T  T  P   I  D  T  I   K  T  R   L  Q                 
  |     +   |  |  |   +  |  |  |   |  |  |   |  |                 
  S  S  I   T  T  P   L  D  T  I   K  T  R   L  Q .... ..........      262


TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC     9913
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC     9973
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG    10033
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA    10093
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC    10153
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA    10213
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA    10273
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA    10333
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC    10393
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC    10453
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT    10513
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AGGTTATGGA TAAT---GAA AATAAGCCAA AAGCCAGGGA AGTTGTCAAA AGATTGATTG    10570
   V  M  D   N     E   N  K  P   K  A  R  E   V  V  K   R  L  I   
   |  |      +     |   |  +  |   .  |  +  +   |  |  |   +  |  +   
.. V  M  G   H  Q  E   N  R  P   S  A  K  Q   V  V  K   K  L  L        281


CTGAAGATGG ATGGAAAGGT TTGTACAGAG GGTTGGGTCC AAGATTTTTC AGCTCATCAG    10630
A  E  D  G   W  K  G   L  Y  R   G  L  G  P   R  F  F   S  S  S   
|  |  |  |   |  |  |   .  |  |   |  |  |  |   |  |  |   |     |   
A  E  D  G   W  K  G   F  Y  R   G  L  G  P   R  F  F   S  M  S        301


CTTGGGGAAC CTCAATGATA GTATGCTACG AGTACCTGAG TATGTTTCGT CTTCCCTTGT    10690
A  W  G  T   S  M  I   V  C  Y   E  Y  L                          
|  |  |  |   |  |  |   +     |   |  |  |                          
A  W  G  T   S  M  I   L  T  Y   E  Y  L  . .......... ..........      314


CAAATGTACA CATGCATATG TAGTGTTATA TATCACTGCA TCCCATGCAG GTTAATTTTA    10750
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


AGTACCCAGA TACTTCTTCT CATTTAGAAT TTAGTTAAAA TGACATCATT CAGGTCAGTT    10810
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


GGCATCTCCA GTACACTGCT TTTGTAAGTT GTATCATAAA TCCCATTTGC AATGAAATTT    10870
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TTGACTCAAG TTGCAGCCTG TAACTTTTCT ATATTTTTCG AATAAAGCTA TCACCGTACA    10930
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TGAAACCTGC TTCTGTTAAT GCCAAGGAGC GCACATTATT TCCTGTAGAC CGGCTTGGAT    10990
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


GTTGAACAAT TGGCACATGC AAGTAGCAAA GAGCAGCCTT GTGCTTGCAA CAATCTGGTC    11050
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


CACCTGTGGA TATGTTCGCT GTGAAAGAAA CCAATTAGTC CTTGTATGAA ACATGGTATT    11110
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


AGCGCTTCAT GAATAAAACC ACTGATTCTG ATTTCTTATT TTCAATGAAT GGATGGGCAT    11170
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TACCAAAGTT ATCATGATTA AAGATCTATT TCATATAAGT TTATTTTTAT ACATTAGAGT    11230
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TTATTTAGAG AACAAGGTAT ATTTAGTTTT GGTAATTTTG TGAACTGCAC TCAGACGACT    11290
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TTGGTATTCT TACTGTAATT TTGTTTTGTT TTCCTACAGA GCGCTTGTGT GCTAAAGTTG    11350
                                          K   R  L  C   A  K  V   
                                          |   |  |  |   |     +   
.......... .......... .......... .........K   R  L  C   A  -  I        320


AAGAGGTCTG A    11361
E  E  V  *  
|  +        
E  D  -  *        323





********************************************************************************
Query protein sequence    6 (File: 15292889)

     1  DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
    61  RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
   121  TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
   181  GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATAAPSKS KIVMVQAAGG
   241  IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
   301  SAWGTSMILT YEYLKRLCAI ED-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   8000   8058 (  59 n);  Protein     1    20 (  20 aa); score: 0.467
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    21   147 ( 127 aa); score: 0.712
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9045 ( 197 n);  Protein   148   213 (  66 aa); score: 0.837
  Intron  3   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  4   9698   9839 ( 142 n);  Protein   214   262 (  49 aa); score: 0.498
  Intron  4   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  5  10516  10669 ( 154 n);  Protein   263   314 (  52 aa); score: 0.754
  Intron  5  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  6  11330  11361 (  32 n);  Protein   315   322 (   8 aa); score: 0.590

MATCH	21326110+	15292889	0.697	966	0.997	P
PGS_21326110+_15292889	(8000  8058,8339  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

ACAACCTCTA GGGCCGCCAA GATCCCGTCG CTCCACCAGA CGGAGATCAA CTGGGACAAG     8059
 T  T  S   R  A  A  K   I  P  S   L  H  Q   T  E  I  N   W  D  N  
    |         .  +  +   |     |   .     |   |  |  |  |   |  |  .  
 D  T  P   P  T  S  R   I  A  S   F  G  Q   T  E  I  N   W  D  K.       20


TAAGCAACCC ACCGGTCTTG TAATCCTTAG GTTCCCATTT CGTGCCGATT TCCGCCTTTC     8119
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TCGCGGTCCT AGTCTTCTAG AAATTTAGCT CCGATCGTTG AACTGTGTCC CCCGCCCTAT     8179
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TCCGGATTGG CTTTACCGCC TTAGGATTAG GAACTGTTGT TTGAGGATTT GACATGGCTG     8239
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TTCATGCTCC TGGACGAAGC GCAATTTGGG GATTCGTAGC TTATTCCGTT GTTTCCGATT     8299
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


CTTTTACTGT TCTCAAATCG TGGCGCTGAA ATGTTGCAGC CTCGACAAGA CCAAGCTCTA     8359
                                             L  D  K   T  K  L  Y 
                                             |  |  |      +  .  | 
.......... .......... .......... .........   L  D  K   R  R  F  Y       27


CGTGGTGGGC GCAGGCATGT TCAGCGGCGT CACCGTGGCG CTGTATCCTG TCTCGGTGGT     8419
  V  V  G   A  G  M   F  S  G  V   T  V  A   L  Y  P   V  S  V  V 
  +     |   |  |  +   |  +  |  |   |  |  |   |  |  |   |  |  |  | 
  I  N  G   A  G  L   F  T  G  V   T  V  A   L  Y  P   V  S  V  V       47


CAAGACCCGG ATGCAGGTTG CCTCTGGGGA CGCCATGAGG AGGAACGCGC TGGCTACCTT     8479
  K  T  R   M  Q  V   A  S  G  D   A  M  R   R  N  A   L  A  T  F 
  |  |  |   +  |  |   |  |     +         .   |  +  |   .  +  .    
  K  T  R   L  Q  V   A  S  K  E   I  A  E   R  S  A   F  S  V  V       67


CAAGAACATC CTCAAGATGG ACGGCGTGCC AGGGCTGTAC CGGGGGTTTG CTACCGTTAT     8539
  K  N  I   L  K  M   D  G  V  P   G  L  Y   R  G  F   A  T  V  I 
  |  .  |   |  |      |  |  |  |   |  |  |   |  |  |   .  |  |  | 
  K  G  I   L  K  N   D  G  V  P   G  L  Y   R  G  F   G  T  V  I       87


CATTGGGGCT GTACCAACTA GGATCATCTT CCTCACAGCG CTTGAGACAA CCAAAGCAGC     8599
  I  G  A   V  P  T   R  I  I  F   L  T  A   L  E  T   T  K  A  A 
     |  |   |  |  .   |  |  |  |   |  |  |   |  |  |   |  |     + 
  T  G  A   V  P  A   R  I  I  F   L  T  A   L  E  T   T  K  I  S      107


CTCGCTTAAG CTTGTTGAGC CCTTCAAGCT GTCAGAGCCG GTGCGGGCTG CCTTTGCCAA     8659
  S  L  K   L  V  E   P  F  K  L   S  E  P   V  R  A   A  F  A  N 
  +  .  |   |  |      |  .  +  |   |  |  |   .  +  |   |  .  |  | 
  A  F  K   L  V  A   P  L  E  L   S  E  P   T  Q  A   A  I  A  N      127


TGGCCTTGCT GGTCTGTCAG CGTCTACATG TTCGCAGGCT ATTTTTGTTC CAATTGATGT     8719
  G  L  A   G  L  S   A  S  T  C   S  Q  A   I  F  V   P  I  D  V 
  |  +  |   |  +  +   |  |         |  |  |   +  |  |   |  |  |  | 
  G  I  A   G  M  T   A  S  L  F   S  Q  A   V  F  V   P  I  D  V      147


GGTATGCCTC TCATGTGCCT TCTATGTGAT GTTGTATAGA GAAAAAATAT CTTACAATAT     8779
                                                                  
                                                                  
 ......... .......... .......... .......... .......... ..........      147


GTTGATGTTA AATGCTAATT ACAATACTAG ACTACTGTTT TCATTCTGTT GTGCATTGGA     8839
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      147


ATGTTTCAGA TTAGCCAGAA ATTGATGGTT CAAGGATATT CTGGTAATGC CAGATACAAA     8899
           I  S  Q  K   L  M  V   Q  G  Y   S  G  N  A   R  Y  K  
           +  |  |  |   |  |  |   |  |  |   |  |  +  |      |     
.........  V  S  Q  K   L  M  V   Q  G  Y   S  G  H  A   T  Y  T       164


GGTGGATTAG ATGTTGCTCG AAAGGTCATA AAGGCTGATG GCATTAGGGG GCTGTACAGA     8959
 G  G  L   D  V  A  R   K  V  I   K  A  D   G  I  R  G   L  Y  R  
 |  |  +   |  |  |      |  +  |   |  +      |  +  |  |   |  |  |  
 G  G  I   D  V  A  T   K  I  I   K  S  Y   G  V  R  G   L  Y  R       184


GGATTTGGAC TGTCTGTTAT GACCTATGCT CCATCCAGTG CTGTGTGGTG GGCAAGTTAT     9019
 G  F  G   L  S  V  M   T  Y  A   P  S  S   A  V  W  W   A  S  Y  
 |  |  |   |  |  |  |   |  |  +   |  |  |   |  .  |  |   |  |  |  
 G  F  G   L  S  V  M   T  Y  S   P  S  S   A  A  W  W   A  S  Y       204


GGTTCCAGCC AGCGCATAAT TTGGAGGTTA GCTTATCTGA TTGGTTCATC GTTATGTTCC     9079
 G  S  S   Q  R  I  I   W  S                                      
 |  |  |   |  |  +  |   |                                         
 G  S  S   Q  R  V  I   W  R.... .......... .......... ..........      213


TCTCAGCCCT GTGTACTATG TAATATTTAC GAGAAAAAGA CCAGTAATAC ATTTCTACTT     9139
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


AATAGTTATT TGAATTGGTA CTTTCCATCT GTCCAAAACC TTTTCAAACT TCCCCTCTTG     9199
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


ATGCTCAAAC TGCAGCTATA ATTGCAATTT TGTTTTCTGA TGCTTGTTCT TCCATGTCAA     9259
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TATGTACATA TCTTTTTTAG AAAACAAGAA TGCATCTCAA TGCATGTGCT GTATTGTTTT     9319
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


GATTAGATTT ATCATAGCGA TCAATCACAT TTTCTTTACA GATAAAAATA GTCGGAAGGA     9379
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TAAGTTGGAT AACTGACCAA AGTGGAAATA TGATCTTACA TATTTTTATC TCTGGCAGCT     9439
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TAGAGAACTT AATTACCAAC CTGAAACAAT GTGATGAAGT AACTACACAA AACCACATAT     9499
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


AGTTTCATGC ACTCTGCAAA ACTAAATTGA AACTCTTAGT GTGCTCTTAA TGCTGTTAAG     9559
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


AGGGTGTATG CAAGTTTACT GGAATCAGTA CCTTTTGTTA GTTTATTTCT TTGTGGTTGA     9619
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TGGTTGAAAG ATTATATTTC TTGTCTTGAT AACTTAGCCA AAATAGTTAA CTATTGTGCT     9679
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      213


TTTTACATAT TGGAACAGTG CTCTTGGCCA TTTGCATGAC AAAGAA---- --GAGGCTCC     9733
                      A  L  G  H   L  H  D   K  E         E  A  P 
                         |  |  +         |   .  +            |  | 
.......... ........   F  L  G  Y   G  G  D   S  D  A   T  A  A  P      227


TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC     9793
  S  Q  L   K  L  V   G  V  Q  A   S  G  G   V  F  A   G  A  V  T 
  |  +      |  +  |      |  |  |   +  |  |   +  .  |   |  |  .  . 
  S  K  S   K  I  V   M  V  Q  A   A  G  G   I  I  A   G  A  T  A      247


CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT     9853
  S  F  V   T  T  P   I  D  T  I   K  T  R   L  Q                 
  |     +   |  |  |   +  |  |  |   |  |  |   |  |                 
  S  S  I   T  T  P   L  D  T  I   K  T  R   L  Q .... ..........      262


TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC     9913
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC     9973
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG    10033
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA    10093
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC    10153
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA    10213
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA    10273
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA    10333
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC    10393
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC    10453
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT    10513
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      262


AGGTTATGGA TAAT---GAA AATAAGCCAA AAGCCAGGGA AGTTGTCAAA AGATTGATTG    10570
   V  M  D   N     E   N  K  P   K  A  R  E   V  V  K   R  L  I   
   |  |      +     |   |  +  |   .  |  +  +   |  |  |   +  |  +   
.. V  M  G   H  Q  E   N  R  P   S  A  K  Q   V  V  K   K  L  L        281


CTGAAGATGG ATGGAAAGGT TTGTACAGAG GGTTGGGTCC AAGATTTTTC AGCTCATCAG    10630
A  E  D  G   W  K  G   L  Y  R   G  L  G  P   R  F  F   S  S  S   
|  |  |  |   |  |  |   .  |  |   |  |  |  |   |  |  |   |     |   
A  E  D  G   W  K  G   F  Y  R   G  L  G  P   R  F  F   S  M  S        301


CTTGGGGAAC CTCAATGATA GTATGCTACG AGTACCTGAG TATGTTTCGT CTTCCCTTGT    10690
A  W  G  T   S  M  I   V  C  Y   E  Y  L                          
|  |  |  |   |  |  |   +     |   |  |  |                          
A  W  G  T   S  M  I   L  T  Y   E  Y  L  . .......... ..........      314


CAAATGTACA CATGCATATG TAGTGTTATA TATCACTGCA TCCCATGCAG GTTAATTTTA    10750
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


AGTACCCAGA TACTTCTTCT CATTTAGAAT TTAGTTAAAA TGACATCATT CAGGTCAGTT    10810
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


GGCATCTCCA GTACACTGCT TTTGTAAGTT GTATCATAAA TCCCATTTGC AATGAAATTT    10870
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TTGACTCAAG TTGCAGCCTG TAACTTTTCT ATATTTTTCG AATAAAGCTA TCACCGTACA    10930
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TGAAACCTGC TTCTGTTAAT GCCAAGGAGC GCACATTATT TCCTGTAGAC CGGCTTGGAT    10990
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


GTTGAACAAT TGGCACATGC AAGTAGCAAA GAGCAGCCTT GTGCTTGCAA CAATCTGGTC    11050
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


CACCTGTGGA TATGTTCGCT GTGAAAGAAA CCAATTAGTC CTTGTATGAA ACATGGTATT    11110
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


AGCGCTTCAT GAATAAAACC ACTGATTCTG ATTTCTTATT TTCAATGAAT GGATGGGCAT    11170
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TACCAAAGTT ATCATGATTA AAGATCTATT TCATATAAGT TTATTTTTAT ACATTAGAGT    11230
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TTATTTAGAG AACAAGGTAT ATTTAGTTTT GGTAATTTTG TGAACTGCAC TCAGACGACT    11290
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      314


TTGGTATTCT TACTGTAATT TTGTTTTGTT TTCCTACAGA GCGCTTGTGT GCTAAAGTTG    11350
                                          K   R  L  C   A  K  V   
                                          |   |  |  |   |     +   
.......... .......... .......... .........K   R  L  C   A  -  I        320


AAGAGGTCTG A    11361
E  E  V  *  
|  +        
E  D  -  *        323





********************************************************************************
Query protein sequence    7 (File: 11358653)

     1  DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
    61  RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
   121  TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
   181  GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRLAMNVLS FLEFGFATKA TIPLIQYLLL
   241  LGRFLGYGGD SDATAAPSKS KIVMVQAAGG IIAGATASSI TTPLDTIKTR LQVMGHQENR
   301  PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM SAWGTSMILT YEYLKRLCAI ED-

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   8000   8058 (  59 n);  Protein     1    20 (  20 aa); score: 0.467
  Intron  1   8059   8338 ( 280 n);  Pd: 1.000   Pa: 0.998
 Exon  2   8339   8720 ( 382 n);  Protein    21   147 ( 127 aa); score: 0.712
  Intron  2   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  3   8849   9101 ( 253 n);  Protein   148   233 (  86 aa); score: 0.657
  Intron  3   9102   9437 ( 336 n);  Pd: 0.000   Pa: 0.346
 Exon  4   9438   9477 (  40 n);  Protein   234   246 (  13 aa); score: 0.042
  Intron  4   9478   9697 ( 220 n);  Pd: 0.924   Pa: 0.863
 Exon  5   9698   9839 ( 142 n);  Protein   247   292 (  46 aa); score: 0.555
  Intron  5   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  6  10516  10669 ( 154 n);  Protein   293   344 (  52 aa); score: 0.754
  Intron  6  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  7  11330  11361 (  32 n);  Protein   345   352 (   8 aa); score: 0.590

MATCH	21326110+	11358653	0.667	1062	1.003	P
PGS_21326110+_11358653	(8000  8058,8339  8720,8849  9101,9438  9477,9698  9839,10516  10669,11330  11361)

Alignment:

ACAACCTCTA GGGCCGCCAA GATCCCGTCG CTCCACCAGA CGGAGATCAA CTGGGACAAG     8059
 T  T  S   R  A  A  K   I  P  S   L  H  Q   T  E  I  N   W  D  N  
    |         .  +  +   |     |   .     |   |  |  |  |   |  |  .  
 D  T  P   P  T  S  R   I  A  S   F  G  Q   T  E  I  N   W  D  K.       20


TAAGCAACCC ACCGGTCTTG TAATCCTTAG GTTCCCATTT CGTGCCGATT TCCGCCTTTC     8119
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TCGCGGTCCT AGTCTTCTAG AAATTTAGCT CCGATCGTTG AACTGTGTCC CCCGCCCTAT     8179
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TCCGGATTGG CTTTACCGCC TTAGGATTAG GAACTGTTGT TTGAGGATTT GACATGGCTG     8239
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


TTCATGCTCC TGGACGAAGC GCAATTTGGG GATTCGTAGC TTATTCCGTT GTTTCCGATT     8299
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........       20


CTTTTACTGT TCTCAAATCG TGGCGCTGAA ATGTTGCAGC CTCGACAAGA CCAAGCTCTA     8359
                                             L  D  K   T  K  L  Y 
                                             |  |  |      +  .  | 
.......... .......... .......... .........   L  D  K   R  R  F  Y       27


CGTGGTGGGC GCAGGCATGT TCAGCGGCGT CACCGTGGCG CTGTATCCTG TCTCGGTGGT     8419
  V  V  G   A  G  M   F  S  G  V   T  V  A   L  Y  P   V  S  V  V 
  +     |   |  |  +   |  +  |  |   |  |  |   |  |  |   |  |  |  | 
  I  N  G   A  G  L   F  T  G  V   T  V  A   L  Y  P   V  S  V  V       47


CAAGACCCGG ATGCAGGTTG CCTCTGGGGA CGCCATGAGG AGGAACGCGC TGGCTACCTT     8479
  K  T  R   M  Q  V   A  S  G  D   A  M  R   R  N  A   L  A  T  F 
  |  |  |   +  |  |   |  |     +         .   |  +  |   .  +  .    
  K  T  R   L  Q  V   A  S  K  E   I  A  E   R  S  A   F  S  V  V       67


CAAGAACATC CTCAAGATGG ACGGCGTGCC AGGGCTGTAC CGGGGGTTTG CTACCGTTAT     8539
  K  N  I   L  K  M   D  G  V  P   G  L  Y   R  G  F   A  T  V  I 
  |  .  |   |  |      |  |  |  |   |  |  |   |  |  |   .  |  |  | 
  K  G  I   L  K  N   D  G  V  P   G  L  Y   R  G  F   G  T  V  I       87


CATTGGGGCT GTACCAACTA GGATCATCTT CCTCACAGCG CTTGAGACAA CCAAAGCAGC     8599
  I  G  A   V  P  T   R  I  I  F   L  T  A   L  E  T   T  K  A  A 
     |  |   |  |  .   |  |  |  |   |  |  |   |  |  |   |  |     + 
  T  G  A   V  P  A   R  I  I  F   L  T  A   L  E  T   T  K  I  S      107


CTCGCTTAAG CTTGTTGAGC CCTTCAAGCT GTCAGAGCCG GTGCGGGCTG CCTTTGCCAA     8659
  S  L  K   L  V  E   P  F  K  L   S  E  P   V  R  A   A  F  A  N 
  +  .  |   |  |      |  .  +  |   |  |  |   .  +  |   |  .  |  | 
  A  F  K   L  V  A   P  L  E  L   S  E  P   T  Q  A   A  I  A  N      127


TGGCCTTGCT GGTCTGTCAG CGTCTACATG TTCGCAGGCT ATTTTTGTTC CAATTGATGT     8719
  G  L  A   G  L  S   A  S  T  C   S  Q  A   I  F  V   P  I  D  V 
  |  +  |   |  +  +   |  |         |  |  |   +  |  |   |  |  |  | 
  G  I  A   G  M  T   A  S  L  F   S  Q  A   V  F  V   P  I  D  V      147


GGTATGCCTC TCATGTGCCT TCTATGTGAT GTTGTATAGA GAAAAAATAT CTTACAATAT     8779
                                                                  
                                                                  
 ......... .......... .......... .......... .......... ..........      147


GTTGATGTTA AATGCTAATT ACAATACTAG ACTACTGTTT TCATTCTGTT GTGCATTGGA     8839
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      147


ATGTTTCAGA TTAGCCAGAA ATTGATGGTT CAAGGATATT CTGGTAATGC CAGATACAAA     8899
           I  S  Q  K   L  M  V   Q  G  Y   S  G  N  A   R  Y  K  
           +  |  |  |   |  |  |   |  |  |   |  |  +  |      |     
.........  V  S  Q  K   L  M  V   Q  G  Y   S  G  H  A   T  Y  T       164


GGTGGATTAG ATGTTGCTCG AAAGGTCATA AAGGCTGATG GCATTAGGGG GCTGTACAGA     8959
 G  G  L   D  V  A  R   K  V  I   K  A  D   G  I  R  G   L  Y  R  
 |  |  +   |  |  |      |  +  |   |  +      |  +  |  |   |  |  |  
 G  G  I   D  V  A  T   K  I  I   K  S  Y   G  V  R  G   L  Y  R       184


GGATTTGGAC TGTCTGTTAT GACCTATGCT CCATCCAGTG CTGTGTGGTG GGCAAGTTAT     9019
 G  F  G   L  S  V  M   T  Y  A   P  S  S   A  V  W  W   A  S  Y  
 |  |  |   |  |  |  |   |  |  +   |  |  |   |  .  |  |   |  |  |  
 G  F  G   L  S  V  M   T  Y  S   P  S  S   A  A  W  W   A  S  Y       204


GGTTCCAGCC AGCGCATAAT TTGGAGGTTA GCTTATCTGA TTGGTTCATC GTTAT-GTTC     9078
 G  S  S   Q  R  I  I   W  R  L   A  Y  L   I  G  S  S   L     F  
 |  |  |   |  |  +  |   |  |  |   |         +     |      |     |  
 G  S  S   Q  R  V  I   W  R  L   A  M  N   V  L  S  F   L  E  F       224


---CTCTCAG CCCTGTGTAC TATGTAATAT TTACGAGAAA AAGACCAGTA ATACATTTCT     9135
    L  S   A  L  C  T   M  Y                                      
    .  +   .     .  |   +                                         
 G  F  A   T  K  A  T   I  P.... .......... .......... ..........      233


ACTTAATAGT TATTTGAATT GGTACTTTCC ATCTGTCCAA AACCTTTTCA AACTTCCCCT     9195
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      233


CTTGATGCTC AAACTGCAGC TATAATTGCA ATTTTGTTTT CTGATGCTTG TTCTTCCATG     9255
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      233


TCAATATGTA CATATCTTTT TTAGAAAACA AGAATGCATC TCAATGCATG TGCTGTATTG     9315
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      233


TTTTGATTAG ATTTATCATA GCGATCAATC ACATTTTCTT TACAGATAAA AATAGTCGGA     9375
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      233


AGGATAAGTT GGATAACTGA CCAAAGTGGA AATATGATCT TACATATTTT TATCTCTGGC     9435
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      233


AGCTTAGAGA ACTTAATTAC CAACCTG-A- AACAATGTGA TGAAGTAACT ACACAAAACC     9493
    L  E   N  L  I  T   N  L      N  N  V   M  N                  
    |      .     +         |      .  .      +  .                  
..  L  I   Q  Y  L  L   L  L  -   G  R  F   L  G...... ..........      246


ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT     9553
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      246


GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT     9613
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      246


GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT     9673
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      246


TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC     9733
                            A  L   G  H  L   H  D  K   E  E  A  P 
                                   |            |            |  | 
.......... .......... ....  -  Y   G  G  D   S  D  A   T  A  A  P      257


TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC     9793
  S  Q  L   K  L  V   G  V  Q  A   S  G  G   V  F  A   G  A  V  T 
  |  +      |  +  |      |  |  |   +  |  |   +  .  |   |  |  .  . 
  S  K  S   K  I  V   M  V  Q  A   A  G  G   I  I  A   G  A  T  A      277


CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT     9853
  S  F  V   T  T  P   I  D  T  I   K  T  R   L  Q                 
  |     +   |  |  |   +  |  |  |   |  |  |   |  |                 
  S  S  I   T  T  P   L  D  T  I   K  T  R   L  Q .... ..........      292


TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC     9913
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC     9973
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG    10033
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA    10093
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC    10153
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA    10213
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA    10273
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA    10333
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC    10393
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC    10453
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT    10513
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      292


AGGTTATGGA TAAT---GAA AATAAGCCAA AAGCCAGGGA AGTTGTCAAA AGATTGATTG    10570
   V  M  D   N     E   N  K  P   K  A  R  E   V  V  K   R  L  I   
   |  |      +     |   |  +  |   .  |  +  +   |  |  |   +  |  +   
.. V  M  G   H  Q  E   N  R  P   S  A  K  Q   V  V  K   K  L  L        311


CTGAAGATGG ATGGAAAGGT TTGTACAGAG GGTTGGGTCC AAGATTTTTC AGCTCATCAG    10630
A  E  D  G   W  K  G   L  Y  R   G  L  G  P   R  F  F   S  S  S   
|  |  |  |   |  |  |   .  |  |   |  |  |  |   |  |  |   |     |   
A  E  D  G   W  K  G   F  Y  R   G  L  G  P   R  F  F   S  M  S        331


CTTGGGGAAC CTCAATGATA GTATGCTACG AGTACCTGAG TATGTTTCGT CTTCCCTTGT    10690
A  W  G  T   S  M  I   V  C  Y   E  Y  L                          
|  |  |  |   |  |  |   +     |   |  |  |                          
A  W  G  T   S  M  I   L  T  Y   E  Y  L  . .......... ..........      344


CAAATGTACA CATGCATATG TAGTGTTATA TATCACTGCA TCCCATGCAG GTTAATTTTA    10750
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


AGTACCCAGA TACTTCTTCT CATTTAGAAT TTAGTTAAAA TGACATCATT CAGGTCAGTT    10810
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


GGCATCTCCA GTACACTGCT TTTGTAAGTT GTATCATAAA TCCCATTTGC AATGAAATTT    10870
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


TTGACTCAAG TTGCAGCCTG TAACTTTTCT ATATTTTTCG AATAAAGCTA TCACCGTACA    10930
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


TGAAACCTGC TTCTGTTAAT GCCAAGGAGC GCACATTATT TCCTGTAGAC CGGCTTGGAT    10990
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


GTTGAACAAT TGGCACATGC AAGTAGCAAA GAGCAGCCTT GTGCTTGCAA CAATCTGGTC    11050
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


CACCTGTGGA TATGTTCGCT GTGAAAGAAA CCAATTAGTC CTTGTATGAA ACATGGTATT    11110
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


AGCGCTTCAT GAATAAAACC ACTGATTCTG ATTTCTTATT TTCAATGAAT GGATGGGCAT    11170
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


TACCAAAGTT ATCATGATTA AAGATCTATT TCATATAAGT TTATTTTTAT ACATTAGAGT    11230
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


TTATTTAGAG AACAAGGTAT ATTTAGTTTT GGTAATTTTG TGAACTGCAC TCAGACGACT    11290
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      344


TTGGTATTCT TACTGTAATT TTGTTTTGTT TTCCTACAGA GCGCTTGTGT GCTAAAGTTG    11350
                                          K   R  L  C   A  K  V   
                                          |   |  |  |   |     +   
.......... .......... .......... .........K   R  L  C   A  -  I        350


AAGAGGTCTG A    11361
E  E  V  *  
|  +        
E  D  -  *        353





********************************************************************************
Query protein sequence    8 (File: 13365793)

     1  AAAAAAETSE ASTAGLALAE ANINWQRRIL RSDGIPGAFR GFGTSAVGAL PGRVFALTSL
    61  EVSKEMAFKY SEHFDMSEAS RIAVANGIAG LVSSIFSSAY FVPLDVICQR LMAQGLPGMA
   121  TYRGPFDVIS KVVRTEGLRG LYRGFGITML TQSPASALWW SSYGGAQHAI WRSLGYGIDS
   181  QKKPSQSELV VVQATAGTIA GACSSIITTP IDTIKTRLQV MDNYGRGRPS VMKTTRVLLE
   241  EDGWRGFYRG FGPRFLNMSL WGTSMIVTYE LIKRLSVKPE -

Predicted gene structure (within gDNA segment 7800 to 11800):

 Exon  1   8403   8720 ( 318 n);  Protein     1   106 ( 106 aa); score: 0.385
  Intron  1   8721   8848 ( 128 n);  Pd: 0.797   Pa: 0.243
 Exon  2   8849   9045 ( 197 n);  Protein   107   172 (  66 aa); score: 0.596
  Intron  2   9046   9697 ( 652 n);  Pd: 0.462   Pa: 0.863
 Exon  3   9698   9839 ( 142 n);  Protein   173   219 (  47 aa); score: 0.591
  Intron  3   9840  10515 ( 676 n);  Pd: 0.947   Pa: 0.987
 Exon  4  10516  10669 ( 154 n);  Protein   220   272 (  53 aa); score: 0.500
  Intron  4  10670  11329 ( 660 n);  Pd: 0.990   Pa: 0.997
 Exon  5  11330  11361 (  32 n);  Protein   273   280 (   8 aa); score: 0.379

MATCH	21326110+	13365793	0.496	843	1.000	P
PGS_21326110+_13365793	(8403  8720,8849  9045,9698  9839,10516  10669,11330  11361)

Alignment:

TATCCTGTCT CGGTGGTCAA GACCCGGATG CAGGTTGCCT CTGGGGACGC CATGAGGAGG     8462
 Y  P  V   S  V  V  K   T  R  M   Q  V  A   S  G  D  A   M  R  R  
       .   +  .  .  +   |               .   +  |     |   +     .  
 A  A  A   A  A  A  E   T  S  E   A  S  T   A  G  L  A   L  A  E        20


AACGCGCTGG CTACCTTCAA GAACATCCTC AAGATGGACG GCGTGCCAGG GCTGTACCGG     8522
 N  A  L   A  T  F  K   N  I  L   K  M  D   G  V  P  G   L  Y  R  
       +            +   .  |  |   +     |   |  +  |  |      +  |  
 A  N  I   N  W  Q  R   R  I  L   R  S  D   G  I  P  G   A  F  R        40


GGGTTTGCTA CCGTTATCAT TGGGGCTGTA CCAACTAGGA TCATCTTCCT CACAGCGCTT     8582
 G  F  A   T  V  I  I   G  A  V   P  T  R   I  I  F  L   T  A  L  
 |  |  .   |        +   |  |  +   |     |   +  .     |   |  +  |  
 G  F  G   T  S  A  V   G  A  L   P  G  R   V  F  A  L   T  S  L        60


GAGACAACCA AAGCAGCCTC GCTTAAGCTT GTTGAGCCCT TCAAGCTGTC AGAGCCGGTG     8642
 E  T  T   K  A  A  S   L  K  L   V  E  P   F  K  L  S   E  P  V  
 |  .  +   |        +   .  |         |      |     +  |   |        
 E  V  S   K  E  M  A   F  K  Y   S  E  H   F  D  M  S   E  A  S        80


CGGGCTGCCT TTGCCAATGG CCTTGCTGGT CTGTCAGCGT CTACATGTTC GCAGGCTATT     8702
 R  A  A   F  A  N  G   L  A  G   L  S  A   S  T  C  S   Q  A  I  
 |     |      |  |  |   +  |  |   |     +   |        |   .  |     
 R  I  A   V  A  N  G   I  A  G   L  V  S   S  I  F  S   S  A  Y       100


TTTGTTCCAA TTGATGTGGT ATGCCTCTCA TGTGCCTTCT ATGTGATGTT GTATAGAGAA     8762
 F  V  P   I  D  V                                                
 |  |  |   +  |  |                                                
 F  V  P   L  D  V .. .......... .......... .......... ..........      106


AAAATATCTT ACAATATGTT GATGTTAAAT GCTAATTACA ATACTAGACT ACTGTTTTCA     8822
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      106


TTCTGTTGTG CATTGGAATG TTTCAGATTA GCCAGAAATT GATGGTTCAA GGATATTCTG     8882
                             I   S  Q  K  L   M  V  Q   G  Y  S   
                             |      |  +  |   |  .  |   |         
.......... .......... ...... I   C  Q  R  L   M  A  Q   G  L  P        117


GTAATGCCAG ATACAAAGGT GGATTAGATG TTGCTCGAAA GGTCATAAAG GCTGATGGCA     8942
G  N  A  R   Y  K  G   G  L  D   V  A  R  K   V  I  K   A  D  G   
|     |      |  +  |      .  |   |        |   |  +  +   .  +  |   
G  M  A  T   Y  R  G   P  F  D   V  I  S  K   V  V  R   T  E  G        137


TTAGGGGGCT GTACAGAGGA TTTGGACTGT CTGTTATGAC CTATGCTCCA TCCAGTGCTG     9002
I  R  G  L   Y  R  G   F  G  L   S  V  M  T   Y  A  P   S  S  A   
+  |  |  |   |  |  |   |  |  +   +  +  +  |      +  |   +  |  |   
L  R  G  L   Y  R  G   F  G  I   T  M  L  T   Q  S  P   A  S  A        157


TGTGGTGGGC AAGTTATGGT TCCAGCCAGC GCATAATTTG GAGGTTAGCT TATCTGATTG     9062
V  W  W  A   S  Y  G   S  S  Q   R  I  I  W   S                   
+  |  |  +   |  |  |   .  +  |   .     |  |                       
L  W  W  S   S  Y  G   G  A  Q   H  A  I  W   R....... ..........      172


GTTCATCGTT ATGTTCCTCT CAGCCCTGTG TACTATGTAA TATTTACGAG AAAAAGACCA     9122
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


GTAATACATT TCTACTTAAT AGTTATTTGA ATTGGTACTT TCCATCTGTC CAAAACCTTT     9182
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


TCAAACTTCC CCTCTTGATG CTCAAACTGC AGCTATAATT GCAATTTTGT TTTCTGATGC     9242
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


TTGTTCTTCC ATGTCAATAT GTACATATCT TTTTTAGAAA ACAAGAATGC ATCTCAATGC     9302
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


ATGTGCTGTA TTGTTTTGAT TAGATTTATC ATAGCGATCA ATCACATTTT CTTTACAGAT     9362
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


AAAAATAGTC GGAAGGATAA GTTGGATAAC TGACCAAAGT GGAAATATGA TCTTACATAT     9422
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


TTTTATCTCT GGCAGCTTAG AGAACTTAAT TACCAACCTG AAACAATGTG ATGAAGTAAC     9482
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


TACACAAAAC CACATATAGT TTCATGCACT CTGCAAAACT AAATTGAAAC TCTTAGTGTG     9542
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


CTCTTAATGC TGTTAAGAGG GTGTATGCAA GTTTACTGGA ATCAGTACCT TTTGTTAGTT     9602
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


TATTTCTTTG TGGTTGATGG TTGAAAGATT ATATTTCTTG TCTTGATAAC TTAGCCAAAA     9662
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      172


TAGTTAACTA TTGTGCTTTT TACATATTGG AACAGTGCTC TTGGCCATTT GCATGACAAA     9722
                                        A   L  G  H  L   H  D  K  
                                        +   |  |  +         |  .  
.......... .......... .......... .....  S   L  G  Y  G   I  D  S       180


GAAGAGGCTC CTAGCCAATT GAAACTAGTT GGTGTTCAAG CATCAGGGGG GGTTTTTGCC     9782
 E  E  A   P  S  Q  L   K  L  V   G  V  Q   A  S  G  G   V  F  A  
 +  +      |  |  |      +  |  |      |  |   |  +  .  |   .  .  |  
 Q  K  K   P  S  Q  S   E  L  V   V  V  Q   A  T  A  G   T  I  A       200


GGTGCCGTGA CCTCTTTTGT TACGACTCCC ATAGATACAA TAAAGACCAG GCTGCAGGTA     9842
 G  A  V   T  S  F  V   T  T  P   I  D  T   I  K  T  R   L  Q     
 |  |      +  |  .  +   |  |  |   |  |  |   |  |  |  |   |  |     
 G  A  C   S  S  I  I   T  T  P   I  D  T   I  K  T  R   L  Q ...      219


CTGTGTGACA TTCTGTTTGC TGATTACTCT TGTAATTTGA TTTGTGTGGG TATATTTTGT     9902
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


GAGGCTTACC CTTGTGACTT AATGATTCTT GTCTTTACAT TTATGCTGCT CATTTGCAAT     9962
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


AATTTGATTC CTTATCAATG CAATGCCACT AAGTTTAGGG GAATGGATAT TTTGTTTTGG    10022
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


AAGTATATTT GATGTCAGAC TTGAAGACCT AAATGTTCTT TTATACTGAT ATTTCCTCCA    10082
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


ATGGCGGGCT ATTGAGGTGC TGGACTGGAA TGCTGTCTAT ATTAAACAAT ATATACTTCT    10142
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


ATGTTTACAG CTGTTTGTTT TCTGCTGACA TACCATGACC AATTTGTCAT GGTTTCAGTT    10202
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


ATGAGGTCAG AAAAAAAGAA ACTTCCATTG GGAAAACTTG ATATCTATTA CTTCATTATT    10262
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


TATAGTGAGT AACAAAAGTT AGCACTTTCA AACTGACTAA AGTATGCCAG GGACGTATCA    10322
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


TGCATTTTAC AACATGCTCC ACATATCTCC AAATATCACA TATTACGCTT GTAGTGGTAA    10382
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


ACTGATAATA CATCTACCAA CACTGAAAGT TCTCACAAGT CAGAACCCTA TATTTGACAG    10442
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


TTGTGGTCTC CCTCCTTCCC TCTGCATTTG TTGCTACAGA TGATTACACT GAGTTTTGTT    10502
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      219


TCTTGTCATT TAGGTTATGG ATAAT----- -GAAAATAAG CCAAAAGCCA GGGAAGTTGT    10556
               V  M   D  N         E  N  K   P  K  A   R  E  V  V 
               |  |   |  |         .  .  +   |  .  .      +  .  . 
.......... ... V  M   D  N  Y  G   R  G  R   P  S  V   M  K  T  T      235


CAAAAGATTG ATTGCTGAAG ATGGATGGAA AGGTTTGTAC AGAGGGTTGG GTCCAAGATT    10616
  K  R  L   I  A  E   D  G  W  K   G  L  Y   R  G  L   G  P  R  F 
  +     |   +     |   |  |  |  +   |  .  |   |  |  .   |  |  |  | 
  R  V  L   L  E  E   D  G  W  R   G  F  Y   R  G  F   G  P  R  F      255


TTTCAGCTCA TCAGCTTGGG GAACCTCAAT GATAGTATGC TACGAGTACC TGAGTATGTT    10676
  F  S  S   S  A  W   G  T  S  M   I  V  C   Y  E  Y   L          
  .  +      |     |   |  |  |  |   |  |      |  |      +          
  L  N  M   S  L  W   G  T  S  M   I  V  T   Y  E  L   I  .......      272


TCGTCTTCCC TTGTCAAATG TACACATGCA TATGTAGTGT TATATATCAC TGCATCCCAT    10736
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


GCAGGTTAAT TTTAAGTACC CAGATACTTC TTCTCATTTA GAATTTAGTT AAAATGACAT    10796
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


CATTCAGGTC AGTTGGCATC TCCAGTACAC TGCTTTTGTA AGTTGTATCA TAAATCCCAT    10856
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


TTGCAATGAA ATTTTTGACT CAAGTTGCAG CCTGTAACTT TTCTATATTT TTCGAATAAA    10916
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


GCTATCACCG TACATGAAAC CTGCTTCTGT TAATGCCAAG GAGCGCACAT TATTTCCTGT    10976
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


AGACCGGCTT GGATGTTGAA CAATTGGCAC ATGCAAGTAG CAAAGAGCAG CCTTGTGCTT    11036
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


GCAACAATCT GGTCCACCTG TGGATATGTT CGCTGTGAAA GAAACCAATT AGTCCTTGTA    11096
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


TGAAACATGG TATTAGCGCT TCATGAATAA AACCACTGAT TCTGATTTCT TATTTTCAAT    11156
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


GAATGGATGG GCATTACCAA AGTTATCATG ATTAAAGATC TATTTCATAT AAGTTTATTT    11216
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


TTATACATTA GAGTTTATTT AGAGAACAAG GTATATTTAG TTTTGGTAAT TTTGTGAACT    11276
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      272


GCACTCAGAC GACTTTGGTA TTCTTACTGT AATTTTGTTT TGTTTTCCTA CAGAGCGCTT    11336
                                                          K  R  L 
                                                          |  |  | 
.......... .......... .......... .......... .......... ...K  R  L      275


GTGTGCTAAA GTTGAAGAGG TCTGA    11361
  C  A  K   V  E  E   V  * 
     .  |         |        
  S  V  K   -  P  E   -  *       281





********************************************************************************
Query protein sequence   19 (File: 23308305)

     1  NLGAAEEESA QEIHLPADIN WEMLDKSKFF VLGAALFSGV SGALYPAVLM KTRQQVCHSQ
    61  GSCIKTAFTL VRHEGLRGLY RGFGTSLMGT IPARALYMTA LEVTKSNVGS AAVSLGLTEA
   121  KAAAVANAVG GLSAAMAAQL VWTPVDVVSQ RLMVQGSAGL VNASRCNYVN GFDAFRKIVR
   181  ADGPKGLYRG FGISILTYAP SNAVWWASYS VAQRMVWGGI GCYVCKKDEE SGNNSTTMKP
   241  DSKTIMAVQG VSAAIAGSVS ALITMPLDTI KTRLQVLDGE DSSNNGKRGP SIGQTVRNLV
   301  REGGWTACYR GLGPRCASMS MSATTMITTY EFLKRLSAKN HDGFYSKS-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11690 ( 110 n);  Protein   292   327 (  36 aa); score: 0.126
  Intron  1  11689  10943 ( 747 n);  Pd: 0.998   Pa: 0.506
 Exon  2  10942  10876 (  67 n);  Protein   328   348 (  21 aa); score: 0.000

MATCH	21326110-	23308305	0.080	177	0.169	P
PGS_21326110-_23308305	(11799  11690,10942  10876)

Alignment:

ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAACAGCAT -CTCCTGGAA ACATTGTACC    11742
 I  I  E   T     M  K   R  R  V   K  T  A      S  W  K   H  C  T  
 |     +   |     +  +   .     |   +     .      .  |         |     
 I  G  Q   T  -  V  R   N  L  V   R  E  G   -  G  W  T   A  C  Y       309


ATGTCATTAC AA-C-TGGTC TCAATCATCC AGCCTCGCTA AAACAAGCCT TCACGTATAA    11684
 M  S  L   Q     W  S   Q  S  S   S  L  A   K  T  S  L   H        
    .  |                   |      |  +  +      |  +  +            
 R  G  L   G  P  R  C   A  S  M   S  M  S   A  T  T  M   I ......      327


AAGAATGATA AATGTGTACA TCTGATCTCG TTTATATTCA TCTAAAATGA TCAGCCTAAT    11624
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


CCATATTCAG CATAAGAGGC AAAAAAAAAT ATAGGCCCCT GCATTTTTTT GAGAATACTC    11564
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


TGCTCAGAGA ACCAAAGTTT GTAAGGACTA TTGTCTAGCA TCAAATGACG TCTTTAACAC    11504
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


CATCAAACAT CCATGTTTAA TACTTTCACC TATGTCAGCA AGAACTGAAG CTTCCGTTGG    11444
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


TGTACAGCAT TATTACACTT CTGCTAATAC AAAACTTCCA GAATTGTAGG AATTGCGTTC    11384
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


TGAGTTTAAG GCAGCTCAGA AATCAGACCT CTTCAACTTT AGCACACAAG CGCTCTGTAG    11324
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


GAAAACAAAA CAAAATTACA GTAAGAATAC CAAAGTCGTC TGAGTGCAGT TCACAAAATT    11264
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


ACCAAAACTA AATATACCTT GTTCTCTAAA TAAACTCTAA TGTATAAAAA TAAACTTATA    11204
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


TGAAATAGAT CTTTAATCAT GATAACTTTG GTAATGCCCA TCCATTCATT GAAAATAAGA    11144
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


AATCAGAATC AGTGGTTTTA TTCATGAAGC GCTAATACCA TGTTTCATAC AAGGACTAAT    11084
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


TGGTTTCTTT CACAGCGAAC ATATCCACAG GTGGACCAGA TTGTTGCAAG CACAAGGCTG    11024
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


CTCTTTGCTA CTTGCATGTG CCAATTGTTC AACATCCAAG CCGGTCTACA GGAAATAATG    10964
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      327


TGCGCTCCTT GGCATTAACA GAAGCAGGTT TCATGTACGG TGATA-G-CT TTATTCGAAA    10906
                        K  Q  V   S  C  T   V  I     L   Y  S  K  
                                  .                  |      +  |  
.......... .......... . T  T  Y   E  F  L   K  R  -  L   S  A  K       339


AATATAGAAA AGTTACAGGC TGCAACTTGA    10876
 N  I  E   K  L  Q  A   A  T  *  
 |     +      .     +      +     
 N  H  D   G  F  Y  S   K  S  *       349





********************************************************************************
Query protein sequence   11 (File: 21326111)

     1  DTTSRAAKIP SLHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVVKT RMQVASGDAM
    61  RRNALATFKN ILKMDGVPGL YRGFATVIIG AVPTRIIFLT ALETTKAASL KLVEPFKLSE
   121  PVRAAFANGL AGLSASTCSQ AIFVPIDVIS QKLMVQGYSG NARYKGGLDV ARKVIKADGI
   181  RGLYRGFGLS VMTYAPSSAV WWASYGSSQR IIWSALGHLH DKEEAPSQLK LVGVQASGGV
   241  FAGAVTSFVT TPIDTIKTRL QVMDNENKPK AREVVKRLIA EDGWKGLYRG LGPRFFSSSA
   301  WGTSMIVCYE YLKRLCAKVE EV-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11740 (  60 n);  Protein   271   289 (  19 aa); score: 0.205
  Intron  1  11739  11069 ( 671 n);  Pd: 0.880   Pa: 0.693
 Exon  2  11068  10966 ( 103 n);  Protein   290   322 (  33 aa); score: 0.042

MATCH	21326110-	21326111	0.104	163	0.168	P
PGS_21326110-_21326111	(11799  11740,11068  10966)

Alignment:

ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAA-CAGCA TCTCCTGGAA AC-ATTGTAC    11743
 I  I  E   T     M  K   R  R  V   K     S   I  S  W  K      L  Y  
       |   .     +  |   |     +         .      .  |  |      |  |  
 A  R  E   V  -  V  K   R  L  I   A  -  E   D  G  W  K   G  L  Y       288


CATGTCATTA CAACTGGTCT CAATCATCCA GCCTCGCTAA AACAAGCCTT CACGTATAAA    11683
 H                                                                
 .                                                                
 R ....... .......... .......... .......... .......... ..........      289


AGAATGATAA ATGTGTACAT CTGATCTCGT TTATATTCAT CTAAAATGAT CAGCCTAATC    11623
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


CATATTCAGC ATAAGAGGCA AAAAAAAATA TAGGCCCCTG CATTTTTTTG AGAATACTCT    11563
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


GCTCAGAGAA CCAAAGTTTG TAAGGACTAT TGTCTAGCAT CAAATGACGT CTTTAACACC    11503
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


ATCAAACATC CATGTTTAAT ACTTTCACCT ATGTCAGCAA GAACTGAAGC TTCCGTTGGT    11443
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


GTACAGCATT ATTACACTTC TGCTAATACA AAACTTCCAG AATTGTAGGA ATTGCGTTCT    11383
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


GAGTTTAAGG CAGCTCAGAA ATCAGACCTC TTCAACTTTA GCACACAAGC GCTCTGTAGG    11323
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


AAAACAAAAC AAAATTACAG TAAGAATACC AAAGTCGTCT GAGTGCAGTT CACAAAATTA    11263
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


CCAAAACTAA ATATACCTTG TTCTCTAAAT AAACTCTAAT GTATAAAAAT AAACTTATAT    11203
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


GAAATAGATC TTTAATCATG ATAACTTTGG TAATGCCCAT CCATTCATTG AAAATAAGAA    11143
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


ATCAGAATCA GTGGTTTTAT TCATGAAGCG CTAATACCAT GTTTCATACA AGGACTAATT    11083
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      289


GGTTTCTTTC ACAGCGAACA TATCCA-C-A GGTGGACCAG ATTGTTGCAA GCACAAGGCT    11025
                R  T   Y  P      R  W  T  R   L  L  Q   A  Q  G   
                          |      |  +               .   |     |   
.......... .... G  L   G  P  -   R  F  F  S   S  -  S   A  W  G        302


GCTCTTTGCT AC-TTGCATG TGCCAATTGT TCAACATCCA AGCCGGTCTA CAGGAA-A-T    10968
C  S  L  L      C  M   C  Q  L   F  N  I  Q   A  G  L   Q  E      
   |  +  +      |            |      .  +      |     +   +  |      
T  S  M  I   V  C  Y   E  Y  L   K  R  L  C   A  K  V   E  E  V        322


AA    10966
* 
  
*       323





********************************************************************************
Query protein sequence   12 (File: 12061241)

     1  DTTTRAKIPS LHHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVIKT RMQVATGEAV
    61  RRNAAATFRN ILKVDGVPGL YRGFGTVITG AIPARIIFLT ALETTKAASL KLVEPFKLSE
   121  PVQAAFANGL GGLSASLCSQ AVFVPIDVVS QKLMVQGYSG HVRYKGGLDV AQQIIKADGI
   181  RGLYRGFGLS VMTYSPSSAV WWASYGSSQR IIWSAFDRWN DKESSPSQLT IVGVQATGGI
   241  IAGAVTSCVT TPIDTIKTRL QVNQNKPKAM EVVRRLIAED GWKGFYRGLG PRFFSSSAWG
   301  TSMIVCYEYL KRLCAKVEEV -

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11740 (  60 n);  Protein   269   287 (  19 aa); score: 0.176
  Intron  1  11739  11069 ( 671 n);  Pd: 0.880   Pa: 0.693
 Exon  2  11068  10966 ( 103 n);  Protein   288   320 (  33 aa); score: 0.042

MATCH	21326110-	12061241	0.094	163	0.169	P
PGS_21326110-_12061241	(11799  11740,11068  10966)

Alignment:

ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAA-CAGCA TCTCCTGGAA AC-ATTGTAC    11743
 I  I  E   T     M  K   R  R  V   K     S   I  S  W  K      L  Y  
    +  |   .     +  +   |     +         .      .  |  |      .  |  
 A  M  E   V  -  V  R   R  L  I   A  -  E   D  G  W  K   G  F  Y       286


CATGTCATTA CAACTGGTCT CAATCATCCA GCCTCGCTAA AACAAGCCTT CACGTATAAA    11683
 H                                                                
 .                                                                
 R ....... .......... .......... .......... .......... ..........      287


AGAATGATAA ATGTGTACAT CTGATCTCGT TTATATTCAT CTAAAATGAT CAGCCTAATC    11623
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


CATATTCAGC ATAAGAGGCA AAAAAAAATA TAGGCCCCTG CATTTTTTTG AGAATACTCT    11563
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


GCTCAGAGAA CCAAAGTTTG TAAGGACTAT TGTCTAGCAT CAAATGACGT CTTTAACACC    11503
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


ATCAAACATC CATGTTTAAT ACTTTCACCT ATGTCAGCAA GAACTGAAGC TTCCGTTGGT    11443
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


GTACAGCATT ATTACACTTC TGCTAATACA AAACTTCCAG AATTGTAGGA ATTGCGTTCT    11383
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


GAGTTTAAGG CAGCTCAGAA ATCAGACCTC TTCAACTTTA GCACACAAGC GCTCTGTAGG    11323
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


AAAACAAAAC AAAATTACAG TAAGAATACC AAAGTCGTCT GAGTGCAGTT CACAAAATTA    11263
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


CCAAAACTAA ATATACCTTG TTCTCTAAAT AAACTCTAAT GTATAAAAAT AAACTTATAT    11203
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


GAAATAGATC TTTAATCATG ATAACTTTGG TAATGCCCAT CCATTCATTG AAAATAAGAA    11143
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


ATCAGAATCA GTGGTTTTAT TCATGAAGCG CTAATACCAT GTTTCATACA AGGACTAATT    11083
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      287


GGTTTCTTTC ACAGCGAACA TATCCA-C-A GGTGGACCAG ATTGTTGCAA GCACAAGGCT    11025
                R  T   Y  P      R  W  T  R   L  L  Q   A  Q  G   
                          |      |  +               .   |     |   
.......... .... G  L   G  P  -   R  F  F  S   -  S  S   A  W  G        300


GCTCTTTGCT AC-TTGCATG TGCCAATTGT TCAACATCCA AGCCGGTCTA CAGGAA-A-T    10968
C  S  L  L      C  M   C  Q  L   F  N  I  Q   A  G  L   Q  E      
   |  +  +      |            |      .  +      |     +   +  |      
T  S  M  I   V  C  Y   E  Y  L   K  R  L  C   A  K  V   E  E  V        320


AA    10966
* 
  
*       321





********************************************************************************
Query protein sequence   20 (File: 21594326)

     1  SLGALMEEKR RATTSSSSSQ VHMSNDIDWQ MLDKSRFFFL GAALFSGVST ALYPIVVLKT
    61  RQQVSPTRVS CANISLAIAR LEGLKGFYKG FGTSLLGTIP ARALYMTALE ITKSSVGQAT
   121  VRLGLSDTTS LAVANGAAGL TSAVAAQTVW TPIDIVSQRL MVQGDVSLSK HLPGVMNSCR
   181  YRNGFDAFRK ILYTDGPRGF YRGFGISILT YAPSNAVWWA SYSLAQKSIW SRYKHSYNHK
   241  EDAGGSVVVQ ALSSATASGC SALVTMPVDT IKTRLQVLDA EENGRRRAMT VMQSVKSLMK
   301  EGGVGACYRG LGPRWVAMSM SATTMITTYE FLKRLATKKQ K-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11740 (  60 n);  Protein   291   309 (  19 aa); score: 0.209
  Intron  1  11739  11069 ( 671 n);  Pd: 0.880   Pa: 0.693
 Exon  2  11068  10966 ( 103 n);  Protein   310   341 (  32 aa); score: 0.044

MATCH	21326110-	21594326	0.108	163	0.159	P
PGS_21326110-_21594326	(11799  11740,11068  10966)

Alignment:

ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAACAGCAT CTCCTGGA-A -ACATTGTAC    11743
 I  I  E   T     M  K   R  R  V   K  T  A   S  P  G      T  L  Y  
 +  +  +   +     +  |         +   |     .   .     |      .     |  
 V  M  Q   S  -  V  K   S  L  M   K  E  G   G  V  G  -   A  C  Y       308


CATGTCATTA CAACTGGTCT CAATCATCCA GCCTCGCTAA AACAAGCCTT CACGTATAAA    11683
 H                                                                
 .                                                                
 R ....... .......... .......... .......... .......... ..........      309


AGAATGATAA ATGTGTACAT CTGATCTCGT TTATATTCAT CTAAAATGAT CAGCCTAATC    11623
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


CATATTCAGC ATAAGAGGCA AAAAAAAATA TAGGCCCCTG CATTTTTTTG AGAATACTCT    11563
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


GCTCAGAGAA CCAAAGTTTG TAAGGACTAT TGTCTAGCAT CAAATGACGT CTTTAACACC    11503
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


ATCAAACATC CATGTTTAAT ACTTTCACCT ATGTCAGCAA GAACTGAAGC TTCCGTTGGT    11443
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


GTACAGCATT ATTACACTTC TGCTAATACA AAACTTCCAG AATTGTAGGA ATTGCGTTCT    11383
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


GAGTTTAAGG CAGCTCAGAA ATCAGACCTC TTCAACTTTA GCACACAAGC GCTCTGTAGG    11323
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


AAAACAAAAC AAAATTACAG TAAGAATACC AAAGTCGTCT GAGTGCAGTT CACAAAATTA    11263
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


CCAAAACTAA ATATACCTTG TTCTCTAAAT AAACTCTAAT GTATAAAAAT AAACTTATAT    11203
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


GAAATAGATC TTTAATCATG ATAACTTTGG TAATGCCCAT CCATTCATTG AAAATAAGAA    11143
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


ATCAGAATCA GTGGTTTTAT TCATGAAGCG CTAATACCAT GTTTCATACA AGGACTAATT    11083
                                                                  
                                                                  
.......... .......... .......... .......... .......... ..........      309


GGTTTCTTTC ACAGCGAACA TATCCA-C-A GGTGGACCAG ATTGTTGCAA GCACAAGGCT    11025
                R  T   Y  P      R  W  T  R   L  L  Q   A  Q  G   
                          |      |  |  .      +     .   +     .   
.......... .... G  L   G  P  -   R  W  V  A   M  S  M   S  -  A        322


GCTCTTTGCT ACTTGCATGT GCCAATTGTT CAACATCCAA GCCGGTCTAC AGGAAATAA    10966
C  S  L  L   L  A  C   A  N  C   S  T  S  K   P  V  Y   R  K  * 
   +  +  +      .                .                      +  |    
T  T  M  I   T  T  Y   E  F  L   K  R  L  A   T  K  K   Q  K  *       342





********************************************************************************
Query protein sequence   15 (File: 21553961)

     1  DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
    61  RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
   121  TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
   181  GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATTAPSKS KIVMVQAAGG
   241  IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
   301  SAWGTSMILT YEYLKRLCAI ED-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11659 ( 141 n);  Protein   276   322 (  47 aa); score: 0.023

MATCH	21326110-	21553961	0.023	141	0.146	P
PGS_21326110-_21553961	(11799  11659)

Alignment:

ATAATTGAAA -CCGCATG-A -AAAGAAGAG TCAAAACAGC ATCTCCTGGA AACATTGT-A    11744
 I  I  E      R  M      K  E  E   S  K  Q   H  L  L  E   T  L     
 +  +  +      +  +         |  +   .     +      .     .      |     
 V  V  K   -  K  L  L   A  E  D   G  W  K   G  F  Y  R   G  L  G       294


CCATGTCATT ACAACTGGTC TCAATCATCC AGC-C-TCGC TAAAACAAGC CTTCACGTAT    11686
 P  C  H   Y  N  W  S   Q  S  S   S     S   L  K  Q  A   F  T  Y  
 |         +  +     |         .   +     |   +        .   +     |  
 P  R  F   F  S  M  S   A  W  G   T  -  S   M  I  L  T   Y  E  Y       313


---AAAAGAA TGATAAATGT GTACATCTGA    11659
    K  R   M  I  N  V   Y  I  *  
    |  |   +        +            
 L  K  R   L  C  A  I   E  D  *       323





********************************************************************************
Query protein sequence   16 (File: 15292889)

     1  DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
    61  RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
   121  TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
   181  GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATAAPSKS KIVMVQAAGG
   241  IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
   301  SAWGTSMILT YEYLKRLCAI ED-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11659 ( 141 n);  Protein   276   322 (  47 aa); score: 0.023

MATCH	21326110-	15292889	0.023	141	0.146	P
PGS_21326110-_15292889	(11799  11659)

Alignment:

ATAATTGAAA -CCGCATG-A -AAAGAAGAG TCAAAACAGC ATCTCCTGGA AACATTGT-A    11744
 I  I  E      R  M      K  E  E   S  K  Q   H  L  L  E   T  L     
 +  +  +      +  +         |  +   .     +      .     .      |     
 V  V  K   -  K  L  L   A  E  D   G  W  K   G  F  Y  R   G  L  G       294


CCATGTCATT ACAACTGGTC TCAATCATCC AGC-C-TCGC TAAAACAAGC CTTCACGTAT    11686
 P  C  H   Y  N  W  S   Q  S  S   S     S   L  K  Q  A   F  T  Y  
 |         +  +     |         .   +     |   +        .   +     |  
 P  R  F   F  S  M  S   A  W  G   T  -  S   M  I  L  T   Y  E  Y       313


---AAAAGAA TGATAAATGT GTACATCTGA    11659
    K  R   M  I  N  V   Y  I  *  
    |  |   +        +            
 L  K  R   L  C  A  I   E  D  *       323





********************************************************************************
Query protein sequence   17 (File: 11358653)

     1  DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
    61  RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
   121  TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
   181  GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRLAMNVLS FLEFGFATKA TIPLIQYLLL
   241  LGRFLGYGGD SDATAAPSKS KIVMVQAAGG IIAGATASSI TTPLDTIKTR LQVMGHQENR
   301  PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM SAWGTSMILT YEYLKRLCAI ED-

Predicted gene structure (within gDNA segment 11800 to 7800):

 Exon  1  11799  11659 ( 141 n);  Protein   306   352 (  47 aa); score: 0.023

MATCH	21326110-	11358653	0.023	141	0.133	P
PGS_21326110-_11358653	(11799  11659)

Alignment:

ATAATTGAAA -CCGCATG-A -AAAGAAGAG TCAAAACAGC ATCTCCTGGA AACATTGT-A    11744
 I  I  E      R  M      K  E  E   S  K  Q   H  L  L  E   T  L     
 +  +  +      +  +         |  +   .     +      .     .      |     
 V  V  K   -  K  L  L   A  E  D   G  W  K   G  F  Y  R   G  L  G       324


CCATGTCATT ACAACTGGTC TCAATCATCC AGC-C-TCGC TAAAACAAGC CTTCACGTAT    11686
 P  C  H   Y  N  W  S   Q  S  S   S     S   L  K  Q  A   F  T  Y  
 |         +  +     |         .   +     |   +        .   +     |  
 P  R  F   F  S  M  S   A  W  G   T  -  S   M  I  L  T   Y  E  Y       343


---AAAAGAA TGATAAATGT GTACATCTGA    11659
    K  R   M  I  N  V   Y  I  *  
    |  |   +        +            
 L  K  R   L  C  A  I   E  D  *       353