Homework 7

Jason Lee, a student in the department of Life Sciences,
got the mRNA from a plant.
Here is the sequence:
        1 atccactact tcatcataaa cctcacaact actattctat cttctcttct ctaattttca
       61 taatcattaa gaatggaaat ggttaacaag attgcatgct ttgtgctttt atgcatggta
      121 gtggttgcac cccatgcaga ggcactaact tgtggtcaag ttacatctac cttggctcct
      181 tgtctccctt atctaatgaa tcgcggtcct ctcggaggct gttgtggtgg tgttaagggt
      241 cttttgggtc aagcccagac tacagtagac cgacagaccg catgcacttg cctaaaatca
      301 gctgcttctt cttttacagg ccttgatttg ggcaaagctg ctagtctccc tagcacttgt
      361 agtgtcaaca tcccttacaa gatcagcccc tctactgact gctctaaagt tcagtaaagc
      421 tgatcatcag aatttggttt catgaggaga attaagaata agatagatag cattgatctt
      481 gcttatggat cctttctttc tatgttgtat cagttgtcac tttctgtttt ttctgtgttt
      541 cctttaaatt ctcgtatgta gtcgagtctt gtatcgaaat ttgacgattg attatattgt
      601 atcagttgtt actttctgtt ttcctgtgtt tcttttaaaa tcgtatgtag tcgagtcttg
      661 tatcgaaatt tcccgattgg ctatgttgta ttaatctaat ctttgataat acacatctat
      721 cttatttggt

1. Please help him to translate the DNA sequence to protein sequence.
   I H Y F I I N L T T T I L S S L L Stop F S Stop S L R Met E Met V N K I A
   C F V L L C Met V V V A P H A E A L T C G Q V T S T L A P C L P Y L Met N
   R G P L G G C C G G V K G L L G Q A Q T T V D R Q T A C T C L K S A A S S
   F T G L D L G K A A S L P S T C S V N I P Y K I S P S T D C S K V Q StopS
   StopS S E F G F Met R R I K N K I D S I D L A Y G S F L S Met L Y Q L S L
   S V F S V F P L N S R Met Stop S S L V S K F D D Stop L Y C I S C Y F L F
   S C V S F K I V C S R V L Y R N F P I G Y V V L I Stop S L I I H I Y L I W


2. Please help him to identify the complete cds of this gene.
    Use the graphic view to explain all the features of this gene.
     cds
          1  AUGGAAAUGG UUAACAAAAU CGCUUGCUUC GUUCUGCUGU GCAUGGUUGU

     51  UGUUGCUCCG CACGCUGAAG CUCUGACCUG CGGUCAGGUU ACCUCCACCC

    101  UGGCUCCGUG CCUGCCGUAC CUGAUGAACC GUGGUCCGCU GGGUGGUUGC

    151  UGCGGUGGUG UUAAAGGUCU GCUGGGUCAG GCUCAGACCA CCGUUGACCG

    201  UCAGACCGCU UGCACCUGCC UGAAAUCCGC UGCUUCCUCC UUCACCGGUC

    251  UGGACCUGGG UAAAGCUGCU UCCCUGCCGU CCACCUGCUC CGUUAACAUC

       301  CCGUACAAAA UCUCCCCGUC CACCGACUGC UCCAAAGUUC AG

        graphic view
 

3.After translation of this gene, please help him to do the protein
   sequence analysis (including  pI, mol. wt.,  secondary structure
   prediction, hydrophobic profile, homology search, prosite
   scanning..........)

    Molecular weight: 7352.4
    Theoretical pI: 8.88
    secondary structure prediction,hydrophobic profile
    homology search
    prosite scan