HOMEWORK#5
Retrieving DNA sequence / GenBank
* due on 5/14
----------------------------------------------------------------------------
1. Sho-Ming cloned a gene in the lab. The DNA sequence is listed here:
AATTAATTCTCTCAATCAAATCCAATTTTCTCCCTATAAAAACCCTAAGGTCCTATA
GTGTTCTATATCCAACACTAGCTCCTACTCCCTAAAGCATTTATTATATCTTCCCCT
AGCTAGATACTTCATTCCACAAATAGTTTGCAGCTTTTTCTTTTCCTCTAAAACAAT
GGAAATAGCTGGGAAAATTGCATGCTTTGTGGTATTGTGCATGGTGGTAGCTGCACC
CTGCGCAGAAGCCATAACCTGTGGCCAGGTTACGTCGAATTTGGCACCTTGTCTTGC
TTATCTTAGAAACACGGGGCCTCTGGGACGTTGTTGCGGTGGCGTTAAGGCTCTGGT
GAATTCTGCAAGGACCACAGAAGATCGTCAAATTGCATGCACTTGCCTGAAATCAGC
TGCAGGTGCTATTTCTGGAATCAATTTGGGCAAAGCTGCTGGTCTCCCTAGTACTTG
TGGTGTCAATATTCCTTACAAGATCAGCCCTTCCACTGACTGCTCCAAGTACCTCAC
TTTTTTTCTCTCTCATGCTATTCTTATCCTTATATTCTATCTGCTTCATTTTCGCTT
ATCTTTTAAATTTTTTATTCGGAATCTTTATACCA
Please help him to identify the gene and its protein sequence.
Data searched from NCBI BLAST Search:
pir||S22168 lipid transfer protein - common tobacco gi|19883 (X62395)
lipid transferase [Nicotiana tabacum]
Length = 114
Plus Strand HSPs:
Query:
170MEIAGKIACFVVLCMVVAAPCAEAITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNS 349
MEIAGKIACFVVLCMVVAAPCAEAITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNS
Sbjct:
1MEIAGKIACFVVLCMVVAAPCAEAITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNS 60
Query:
350ARTTEDRQIACTCLKSAAGAISGINLGKAAGLPSTCGVNIPYKISPSTDCSK 505
ARTTEDRQIACTCLKSAAGAISGINLGKAAGLPSTCGVNIPYKISPSTDCSK
Sbjct:
61ARTTEDRQIACTCLKSAAGAISGINLGKAAGLPSTCGVNIPYKISPSTDCSK 112
2. How many nucleotide and protein sequence of Lycopersicon esculentum were
know?
Nucleotide (742) Protein (1019)
Please find its class II small heat shock protein mRNA, complete cds.
1773290 ---------------------------------------------------
Definition Lycopersicon esculentum class II small heat shock protein Le-
HSP17.6 mRNA, complete cds.
GenBank Name: LEU72396, Accession: U72396
NCBI Seq ID: 1773290
Updated Jan 13, 1997
Citation REF [1]
D.K. Kadyrzhanova, K.E. Vlachonasios, P. Ververidis & D.R.
Dilley (1996). A heat-treatment chilling tolerance related
cDNA from tomato fruit encoding a small heat-shock protein
class II. Unpublished
Citation REF [2]
Data Submission: D.K. Kadyrzhanova, K.E. Vlachonasios, P.
Ververidis & D.R. Dilley (1996).
Created Oct 23, 1996
Updated Jan 14, 1997
Coding region Comments: heat treatment/chilling 1773290: 108..584
tolerance related protein from
tomato fruit.
Sequence 738 nt, linear rna
1 tacggctgcg agaagacgac agaaggggac tgcaattaca aatcaaacca
51 aaattgacaa atttcacgca caaaatcaca atatccaaaa atttctcaat
101 actgaaaatg gatttgaggt tgttgggtat cgataacaca ccactcttcc
151 acactctcca ccatatgatg gaagctgccg gtgaagattc cgacaagtct
201 gtcaatgcac catcaaggaa ctatgttcgt gatgctaagg ccatggctgc
251 tacaccagcg gatgtgaagg agtatcctaa ttcgtatgtt tttgttgtgg
301 atatgccagg gttgaaatct ggagatatca aagtgcaggt ggaagaagac
351 aatgtgctgt tgattagtgg tgaaaggaag agggaagaag agaaagaagg
401 tgcaaagttt attaggatgg agagaagggt tgggaaattc atgaggaagt
451 ttagtctgcc agagaatgcg aatactgatg caatttctgc agtttgtcaa
501 gatggagttc tgactgttac tgttcagaaa ttgcctcctc ctgagccaaa
551 gaaacccaaa acaattgagg tgaaagttgc ttgaagttat ggactctgtt
601 ttgatggttt gtggtatgat gtagtagaaa taaagttgta ggagtagtga
651 acttttcctt tcatctttct gctatgtttt cacgtctgtt tgaatgttac
701 aatagccatg ggtattgttt gttttgatgc caaaaaaa