Jason Lee, a student in the department of Life Sciences,
got the mRNA from a plant. Here is the sequence:
 

        1 atccactact tcatcataaa cctcacaact actattctat cttctcttct ctaattttca
       61 taatcattaa gaatggaaat ggttaacaag attgcatgct ttgtgctttt atgcatggta
      121 gtggttgcac cccatgcaga ggcactaact tgtggtcaag ttacatctac cttggctcct
      181 tgtctccctt atctaatgaa tcgcggtcct ctcggaggct gttgtggtgg tgttaagggt
      241 cttttgggtc aagcccagac tacagtagac cgacagaccg catgcacttg cctaaaatca
      301 gctgcttctt cttttacagg ccttgatttg ggcaaagctg ctagtctccc tagcacttgt
      361 agtgtcaaca tcccttacaa gatcagcccc tctactgact gctctaaagt tcagtaaagc
      421 tgatcatcag aatttggttt catgaggaga attaagaata agatagatag cattgatctt
      481 gcttatggat cctttctttc tatgttgtat cagttgtcac tttctgtttt ttctgtgttt
      541 cctttaaatt ctcgtatgta gtcgagtctt gtatcgaaat ttgacgattg attatattgt
      601 atcagttgtt actttctgtt ttcctgtgtt tcttttaaaa tcgtatgtag tcgagtcttg
      661 tatcgaaatt tcccgattgg ctatgttgta ttaatctaat ctttgataat acacatctat
      721 cttatttggt
 
 

      1. Please help him to translate the DNA sequence to protein sequence.

          MEMVNKIACF VLLCMVVVAP HAEALTCGQV TSTLAPCLPY LMNRGPLGGC CGGVKGLLGQ
          AQTTVDRQTA CTCLKSAASS FTGLDLGKAA SLPSTCSVNI PYKISPSTDC SKVQ
 

      2. Please help him to identify the complete cds of this gene.
          Use the graphic view to explain all the features of this gene.

          CDS:http://www.ncbi.nlm.nih.gov/htbin-post/Entrez/query?uid=gb|U66466.1|&form=6&db=s&Dopt=g
                1 gtaatccagc taagaacgtc agaagtaaaa caaacttgtc gtaaaatatt taatttgaag
                61 ttgtatttaa atcttaatta ttttttttta aagctatact cacatcattt caattattct
              121 ttttgtaaaa gtatctctag agcttcataa tttttttttt aaaaatcttc gatcaaactg
              181 ttagagtagg taaaagtctc acattgatgg ggaaatagac tgattatttg cttataagga
              241 tgtggacaat actcctctca tataatagca tttaagatta aattagacct aaataacata
              301 ttttagcatg atattagagt tatattcatt cttgtttgaa cttccgatcc acatctcaat
              361 tggatctaca taaaaaaggg atattaaagt aagtaaaagc cctacattaa tcgaggaatc
              421 tacttatacg aactttggtg ataaaaaaaa agactcctac acgtaagatg ttagaactag
              481 ctaccacatg actttagagc cagcataata atgtacacca tcaaaatgct ttaaattttc
              541 aacctaacaa ccaactacct ctctcactcc tccattggcc atctactcca aatttccctc
              601 tataaaaaca ctcaaccaaa acacatttct tctcatccac tacttcatca taaacctcac
              661 aactactatt ctatcttctc ttctctaatt ttcataatca ttaagaatgg aaatggttaa
              721 caagattgca tgctttgtgc ttttatgcat ggtagtggtt gcaccccatg cagaggcact
              781 aacttgtggt caagttacat ctaccttggc tccttgtctc ccttatctaa tgaatcgcgg
              841 tcctctcgga ggctgttgtg gtggtgttaa gggtcttttg ggtcaagccc agactacagt
              901 agaccgacag accgcatgca cttgcctaaa atcagctgct tcttctttta caggccttga
              961 tttgggcaaa gctgctagtc tccctagcac ttgtagtgtc aacatccctt acaagatcag
            1021 cccctctact gactgctcta agtatgttaa tttttcatct tttttgacct ataacaacac
            1081 ctaactcttc gtattaatcc tagtacgaaa aataaagtaa caaaaaaatg atatgtgcta
            1141 gcacattgtc acaatatgac atgcaagtgt gtttggtttt ctcaaaaaat aagtggattt
            1201 tttatttata ttttagtgtt aagaaatatt agtttaaaaa tatttatata tgtaattata
            1261 aagaaaaaag atactattat agttagtaca ttatgttttt gttatcatta tcattattat
            1321 tattattaat gttggttttg ttcattgtta atgcagagtt cagtaaagct gatcatcaga
            1381 atttggtttc atgaggagaa ttaagaataa gatagatagc attgatcttg cttatggatc
            1441 ctttctttct atgttgtatc agttgtcact ttctgttttt tctgtgtttc ctttaaattc
            1501 tcgtatgtag tcgagtcttg tatcgaaatt tgacgattga ttatattgta tcagttgtta
            1561 ctttctgttt tcctgtgttt cttttaaaat cgtatgtagt cgagtcttgt atcgaaattt
            1621 cccgattggc tatgttgtat taatctaatc tttgataata cacatctatc ttatttggta
            1681 tatgtactct ctcgtctatt caatattttt ggtctacttt tactagggtt tttttaatat
            1741 gcattacaca tatatatcaa attcgagtaa tatatagtat acgctattgt gtgctcattc
            1801 atctaggtac ctcctttttc taaccacttc ttacacgtac aatgctaatt attg

           Graphic view:http://www.ncbi.nlm.nih.gov/cgi-bin/Entrez/framik?gi=1519357&db=Protein
 

      3.After translation of this gene, please help him to do the protein sequence analysis
         (including  pI, mol. wt.,  secondary structure prediction, hydrophobic profile,
           homology search, prosite scanning..........)

        Molecular weight: 11715.81

        Theoretical pI: 8.36

        Secondary structure prediction

            alpha-helix: MIN: 0.67  MAX: 1.18555555555556

            beta-sheet: MIN: 0.618888888888889  MAX: 1.36555555555556

        Hydrophobic profile: MIN: -1.54444444444444  MAX: 3.51111111111111

 

           Homology search:http://life.nthu.edu.tw/~b861604/blastNNCBI.html

           Prosite scanning:
           [1] PDOC00006 PS00006 CK2_PHOSPHO_SITE
           Casein kinase II phosphorylation site

           Number of matches: 2
                       1      63-66 TTVD
                       2      82-85 TGLD

          [2] PDOC00008 PS00008 MYRISTYL
          N-myristoylation site

          Number of matches: 6
                      1      28-33 GQVTST
                      2      48-53 GGCCGG
                      3      49-54 GCCGGV
                      4      52-57 GGVKGL
                      5      59-64 GQAQTT
                      6      83-88 GLDLGK

         [3] PDOC00516 PS00597 PLANT_LTP
         Plant lipid transfer proteins signature

                          92-113 LPSTCSVNIPYKISPSTDCSKV