HOMEWORK#4

Retrieving DNA sequence / GenBank

1. Sho-Hua cloned a gene in the lab.

Answer is listed here:

The gene is below


sp|Q42952|NLT1_TOBAC NONSPECIFIC LIPID-TRANSFER PROTEIN 1 PRECURSOR
            (LTP 1) pir||S22168 lipid transfer protein - common tobacco
            gi|19883 (X62395) lipid transferase [Nicotiana tabacum]
            Length = 114

Plus Strand HSPs:
Score = 590 (270.3 bits), Expect = 1.2e-72, P = 1.2e-72
Identities = 112/112 (100%), Positives = 112/112 (100%), Frame = +2

The protein sequence is below
Query: 170 MEIAGKIACFVVLCMVVAAPCAEAITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNS 349 MEIAGKIACFVVLCMVVAAPCAEAITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNS Sbjct: 1 MEIAGKIACFVVLCMVVAAPCAEAITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNS 60 Query: 350 ARTTEDRQIACTCLKSAAGAISGINLGKAAGLPSTCGVNIPYKISPSTDCSK 505 ARTTEDRQIACTCLKSAAGAISGINLGKAAGLPSTCGVNIPYKISPSTDCSK Sbjct: 61 ARTTEDRQIACTCLKSAAGAISGINLGKAAGLPSTCGVNIPYKISPSTDCSK 112

2. How many nucleotide and protein sequence of Lycopersicon esculentum were known?

There are 817 genes and 1092 proteins with their sequences known in Lycopersicon esculentum.

This is the complete sequence of HSP21, the heat shock protein of Lycopersicon esculentum:

BASE COUNT      229 a    117 c    183 g    209 t
ORIGIN      
        1 tacggctgcg agaagacgac agaaggggac tgcaattaca aatcaaacca aaattgacaa
       61 atttcacgca caaaatcaca atatccaaaa atttctcaat actgaaaatg gatttgaggt
      121 tgttgggtat cgataacaca ccactcttcc acactctcca ccatatgatg gaagctgccg
      181 gtgaagattc cgacaagtct gtcaatgcac catcaaggaa ctatgttcgt gatgctaagg
      241 ccatggctgc tacaccagcg gatgtgaagg agtatcctaa ttcgtatgtt tttgttgtgg
      301 atatgccagg gttgaaatct ggagatatca aagtgcaggt ggaagaagac aatgtgctgt
      361 tgattagtgg tgaaaggaag agggaagaag agaaagaagg tgcaaagttt attaggatgg
      421 agagaagggt tgggaaattc atgaggaagt ttagtctgcc agagaatgcg aatactgatg
      481 caatttctgc agtttgtcaa gatggagttc tgactgttac tgttcagaaa ttgcctcctc
      541 ctgagccaaa gaaacccaaa acaattgagg tgaaagttgc ttgaagttat ggactctgtt
      601 ttgatggttt gtggtatgat gtagtagaaa taaagttgta ggagtagtga acttttcctt
      661 tcatctttct gctatgtttt cacgtctgtt tgaatgttac aatagccatg ggtattgttt
      721 gttttgatgc caaaaaaa