HOMEWORK#6

1. Please help him to identify this gene (name, accession #, authors ...... )
¡@¡@name:Tetrahymena pyriformis mRNA for hemoglobin(GI "217409" ¡@¡@¡@¡@[GenBank] )
¡@¡@accession#:D13920
¡@¡ authors:Takagi,T., Iwaasa,H., Yuasa,H., Shikama,K., Takemasa,T. ¡@¡@¡@¡@¡@¡@and Watanabe,Y.(Biochim. Biophys. Acta 1173 (1), 75-¡@¡@¡@¡@¡@¡@78 (1993))
2. Which organism does this gene belong?
¡@¡@ Eukaryota; Alveolata; Ciliophora; Oligohymenophorea; Hymenostomatida; Tetrahymenina; Tetrahymena.
3. How many nucleotide, protein and structure have been known for this organism?
¡@¡@Nucleotide (95)
¡@¡@ Protein (231)
¡@¡@ Structure(0)
4. Do the Blast search for this gene. List the top 10 most similar sequence.

NO.1
dbj|D13920.1|TETHEMOGP
Tetrahymena pyriformis mRNA for hemoglobin

Length = 587
Score = 1148 bits (579), Expect = 0.0
Identities = 579/579 (100%) Strand = Plus / Plus

similar sequence

sequence are all the same, so no need to list.

 

NO.2
dbj|D13919.1|TETHEMOGT
Tetrahymena thermophila mRNA for hemoglobin

Length = 494
Score = 69.9 bits (35), Expect = 1e-09
Identities = 161/203 (79%) Strand = Plus / Plus

similar sequence

Query: 86 ttctacaagaaggtcttagctgatgaaagagtcaagcatttcttcaagaacaccgacatg 145
||||||||||| |||||||| ||||| |||||||| |||| ||| || ||||| | |||
Sbjct: 86 ttctacaagaaagtcttagcagatgatagagtcaaacattactttaaaaacactaatatg 145


Query: 146 gatcaccaaaccaagcaataaactgacttcctcaccatgctcttaggtggtcccaaccat 205
|| |||||| | ||| |||| || || || || ||||| ||||| || ||||| |||
Sbjct: 146 gaacaccaagctaagtaataggaagattttcttactatgcttttaggaggacccaatcat 205


Query: 206 tacaagggtaaaaatatgactgaagctcacaagggtatgaacttgcaaaacttgcacttt 265
|| || || ||||| ||| |||||||||| || ||||||||| | |||||| ||||||
Sbjct: 206 tataaaggaaaaaacatggctgaagctcataaaggtatgaacctttaaaactctcacttt 265

Query: 266 gatgccatcattgaaaaccttgc 288
|| || |||||||| || |||||
Sbjct: 266 gacgctatcattgagaatcttgc 288

¡@¡@¡@¡@¡@¡@¡@¡@¡@¡@¡@¡@

NO.3
gb|U56081.1|CEU56081
Caenorhabditis elegans T-BOX 12 (Ce-tbx-12) mRNA, complete cds

Length = 1384
Score = 48.1 bits (24), Expect = 0.005
Identities = 24/24 (100%) Strand = Plus / Minus

similar sequence

Query: 564 ccatggtacccgatcctcgaattc 587
||||||||||||||||||||||||
Sbjct: 36 ccatggtacccgatcctcgaattc 13

NO.4
emb|AL049870.3|CNS0000S
Human chromosome 14 DNA sequence , complete sequence

Length = 205035
Score = 48.1 bits (24), Expect = 0.005
Identities = 30/32 (93%) Strand = Plus / Plus

similar sequence

Query: 500 taatgagagtatttattgtgtattgttatgta 531
||||||||||||||||| | ||||||||||||
Sbjct: 187601 taatgagagtatttattttatattgttatgta 187632

 

NO.5

gb|AF194867.1|AF194867
Portulaca oleracea NADH dehydrogenase (ndhF) gene, partial cds;
chloroplast gene for chloroplast product

Length = 2087
Score = 46.1 bits (23), Expect = 0.019
Identities = 23/23 (100%) Strand = Plus / Plus

similar sequence

Query: 436 tcatcaaataaatagtaattcta 458
|||||||||||||||||||||||
Sbjct: 122 tcatcaaataaatagtaattcta 144

NO.6

gb|AF194854.1|AF194854
Portulaca molokiniensis NADH dehydrogenase (ndhF) gene, partial cds;
chloroplast gene for chloroplast product

Length = 2086
Score = 46.1 bits (23), Expect = 0.019
Identities = 23/23 (100%) Strand = Plus / Plus

similar sequence

Query: 436 tcatcaaataaatagtaattcta 458
|||||||||||||||||||||||
Sbjct: 121 tcatcaaataaatagtaattcta 143

NO.7

gb|AF194853.1|AF194853
Portulaca grandiflora NADH dehydrogenase (ndhF) gene, partial cds;
chloroplast gene for chloroplast product

Length = 2141
Score = 46.1 bits (23), Expect = 0.019
Identities = 23/23 (100%) Strand = Plus / Plus

similar sequence

Query: 436 tcatcaaataaatagtaattcta 458
|||||||||||||||||||||||
Sbjct: 185 tcatcaaataaatagtaattcta 207

NO.8

emb|AJ235828.1|MBR235828
Mostuea brunonis chloroplast ndhF gene

Length = 2202
Score = 46.1 bits (23), Expect = 0.019
Identities = 23/23 (100%) Strand = Plus / Plus

similar sequence

Query: 436 tcatcaaataaatagtaattcta 458
|||||||||||||||||||||||
Sbjct: 180 tcatcaaataaatagtaattcta 202

NO.9

gb|AF130178.1|AF130178
Montinia caryophyllacea NADH dehydrogenase subunit F (ndhF) gene, partial cds; chloroplast gene for chloroplast product

Length = 2197
Score = 44.1 bits (22), Expect = 0.077
Identities = 22/22 (100%) Strand = Plus / Plus

similar sequence

Query: 438 atcaaataaatagtaattctac 459
||||||||||||||||||||||
Sbjct: 156 atcaaataaatagtaattctac 177

NO.10

gb|AE003647.1|AE003647
Drosophila melanogaster genomic scaffold 142000013386055 section 40 of 63, complete sequence

Length = 262205
Score = 40.1 bits (20), Expect = 1.2
Identities = 20/20 (100%) Strand = Plus / Plus

similar sequence

Query: 477 atttcaataaaaattattta 496
||||||||||||||||||||
Sbjct: 42127 atttcaataaaaattattta 42146

5. Using ORF finder to translate this gene. Show the correct protein sequence.
FQNKENEQTPNYL*KARRRKCHEGCRPSLLQEGLS**KSQAFLQEHRHGSPNQAIN*LPHHALRWSQPLQ G*KYD*SSQGYELAKLAL*CHH*KPCCYP*GARCHRCCY*RGC*GHRTHP*GYARQVR*LLLLFIFYYY* LLLNTSSNK**FYSIKLVRFQ*KLFTVMRVFIVYCYVSFIEDDDQDKSHGTRSSN