Homework 7

Retrieving GenBank

Sho-Hua cloned part of gene sequence in the lab. There are 341 similar sequences in Genbank. His clone may be the retroviral vectors in the top 5 list, and its function is possible to improved retroviral vectors for gene transfer and expression.


Retrieving DNA sequence

There are 1337 documents about Lycopersicon esculentum in NCBI blaster since 5/9/98. 9 documents is found about heat shock protein. Lycopersicon esculentum class II small heat shock protein Le-HSP17.6 mRNA, complete cds is one of them. Its sequence has been identified.


TOP1Detail TOP2Detail TOP3Detail TOP4Detail TOP5Detail

1. Sho-Hua cloned a gene in the lab. The part of DNA sequence is listed here:
   1 tttgaaagac cccacccgta ggtggcaagc tagcttaagt aacgccactt tgcaaggcat 
  61 ggaaaaatac ataactgaga ataggaaagt tcagatcaag gtcaggaaca aagaaacagc
 121 tgaataccaa acaggatatc tgtggtaagc ggttcctgcc ccggctcagg gccaagaaca
 181 gatgagacag ctgagtgatg ggccaaacag gatatctgtg gtaagcagtt cctgccccgg
 241 ctcggggcca agaacagatg gtccccagat gcggtccagc cctcagcagt ttctagtgaa
 301 tcatcagatg tttccagggt gccccaagga cctgaaaatg accctgtacc ttatttgaac
 361 taaccaatca gttcgcttct cgcttctgtt cgcgcgcttc cgctctccga gctcaataaa
 421 agagcccaca acccctcact cggcgcgcca gtcttccgat agactgcgtc gcccgggtac
 481 ccgtattccc aataaagcct cttgctgttt gcatccgaat cgtggtctcg ctgttccttg
 541 ggagggtctc ctctgagtga ttgactaccc acgacggggg tctttcattt gggggctcgt
 601 ccgggatttg gagacccctg cccagggacc accgacccac caccgggagg taagctggcc
 661 agcaacttat ctgtgtctgt ccgattgtct agtgtctatg tttgatgtta tgcgcctgcg
 721 tctgtactag ttagctaact agctctgtat ctggcggacc cgtggtggaa ctgacgagtt
 781 ctgaacaccc ggccgcaacc ctgggagacg tcccagggac tttgggggcc gtttttgtgg
 841 cccgacctga ggaagggagt cgatgtggaa tccgaccccg tcaggatatg tggttctggt
 901 aggagacgag aacctaaaac agttcccgcc tccgtctgaa tttttgcttt cggtttggaa
 961 ccgaagccgc gcgtcttgtc tgctgcagca tcgttctgtg ttgtctctgt ctgactgtgt
1021 ttctgtattt gtctgaaaat tagggccaga ctgttaccac tcccttaagt ttgaccttag
1081 gtcactggaa agatgtcgag cggatcgctc acaaccagtc ggtagatgtc aagaagagac
1141 gttgggttac cttctgctct gcagaatggc caacctttaa cgtcggatgg ccgcgagacg
1201 gcacctttaa ccgagacctc atcacccagg ttaagatcaa ggtcttttca cctggcccgc
1261 atggacaccc agaccaggtc ccctacatcg tgacctggga agccttggct tttgaccccc
1321 ctccctgggt caagcccttt gtacacccta agcctccgcc tcctcttcct ccatccgccc
1381 cgtctctccc ccttgaacct cctcgttcga ccccgcctcg atcctccctt tatccagccc
1441 tcactccttc tctaggcggg aattcgttag cttggtaagt gaccagctac agtcggaaac
1501 catcagcaag caggtatgta ctctccaggg tgggcctggc ttccccagtc aagactccag
1561 ggatttgagg gacgctgtgg gctcttctct tacatgtacc ttttgctagc ctcaaccctg
1621 actatcttcc aggtcattgt tccaacatgg ccctgtggat cgacaggatg caactcctgt
1681 cttgcattgc actaagtctt gcacttgtca caaacagtgc acctacttca agttctacaa
1741 agaaaacaca gctgcaactg gagcatttac tgctggattt acagatgatt ttgaatggaa
1801 ttaataatta caagaatccc aaactcaccc gcatgctcac atttaagttt tacatgccca
1861 agaaggccac agaactgaaa catctgcagt gtctagaaga agaactcaaa cctctggagg
1921 aagtgctaaa tttagctcaa agcaaaaact ttcacttaag gcctagggac ttaatcagca
1981 atatcaacgt aatagttctc gagctaaagg gatctgaaac aacattcatg tgtgaatatg
2041 ctgatgagac agccaccatt gtggaatttc tgaacagatg gattaccttt tgtcaaagca
2101 tcatctcaac actaacttga taattaagtg cttcccactt aaaacatatc aggatccgct
2161 gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag gcagaagtat
2221 gcaaagcatg catctcaatt agtcagcaac caggtgtgga aagtccccag gctccccagc
2281 aggcagaagt atgcaaagca tgcatctcaa ttagtcagca accatagtcc cgcccctaac
2341 tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact
2401 aatttttttt atttatgcag aggccgaggc cgcctcggcc tctgagctat tccagaagta
2461 gtgaggaggc ttttttggag gcctaggctt ttgcaaaaag cttgggctgc aggtcgaggc
2521 ggatctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga tggattgcac
2581 gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca
2641 atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt
2701 gtcaagaccg acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg
2761 tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga
2821 agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct
2881 cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg
Please help him to identify the gene and its possible function.
2. How many nucleotide and protein sequence of Lycopersicon esculentum were know? Please find its class II small heat shock protein mRNA, complete cds.