Homework #7


Jason Lee, a student in the department of Life Sciences,
got the mRNA from a plant.
Here is the sequence:
    
        1 atccactact tcatcataaa cctcacaact actattctat cttctcttct ctaattttca
       61 taatcattaa gaatggaaat ggttaacaag attgcatgct ttgtgctttt atgcatggta
      121 gtggttgcac cccatgcaga ggcactaact tgtggtcaag ttacatctac cttggctcct
      181 tgtctccctt atctaatgaa tcgcggtcct ctcggaggct gttgtggtgg tgttaagggt
      241 cttttgggtc aagcccagac tacagtagac cgacagaccg catgcacttg cctaaaatca
      301 gctgcttctt cttttacagg ccttgatttg ggcaaagctg ctagtctccc tagcacttgt
      361 agtgtcaaca tcccttacaa gatcagcccc tctactgact gctctaaagt tcagtaaagc
      421 tgatcatcag aatttggttt catgaggaga attaagaata agatagatag cattgatctt
      481 gcttatggat cctttctttc tatgttgtat cagttgtcac tttctgtttt ttctgtgttt
      541 cctttaaatt ctcgtatgta gtcgagtctt gtatcgaaat ttgacgattg attatattgt
      601 atcagttgtt actttctgtt ttcctgtgtt tcttttaaaa tcgtatgtag tcgagtcttg
      661 tatcgaaatt tcccgattgg ctatgttgta ttaatctaat ctttgataat acacatctat
      721 cttatttggt

1. Please help him to translate the DNA sequence to protein sequence.
Ans:>gi|1519357|gb|AAB07487.1| lipid transfer protein 2
         MEMVNKIACFVLLCMVVVAPHAEALTCGQVTSTLAPCLPY
         LMNRGPLGGCCGGVKGLLGQAQTTVDRQTACTCLKSAAS
         SFTGLDLGKAASLPSTCSVNIPYKISPSTDCSKVQ

2. Please help him to identify the complete cds of this gene.
Ans:Lycopersicon pennellii lipid transfer protein
         1 gtaatccagc taagaacgtc agaagtaaaa caaacttgtc gtaaaatatt taatttgaag
       61 ttgtatttaa atcttaatta ttttttttta aagctatact cacatcattt caattattct
      121 ttttgtaaaa gtatctctag agcttcataa tttttttttt aaaaatcttc gatcaaactg
      181 ttagagtagg taaaagtctc acattgatgg ggaaatagac tgattatttg cttataagga
      241 tgtggacaat actcctctca tataatagca tttaagatta aattagacct aaataacata
      301 ttttagcatg atattagagt tatattcatt cttgtttgaa cttccgatcc acatctcaat
      361 tggatctaca taaaaaaggg atattaaagt aagtaaaagc cctacattaa tcgaggaatc
      421 tacttatacg aactttggtg ataaaaaaaa agactcctac acgtaagatg ttagaactag
      481 ctaccacatg actttagagc cagcataata atgtacacca tcaaaatgct ttaaattttc
      541 aacctaacaa ccaactacct ctctcactcc tccattggcc atctactcca aatttccctc
      601 tataaaaaca ctcaaccaaa acacatttct tctcatccac tacttcatca taaacctcac
      661 aactactatt ctatcttctc ttctctaatt ttcataatca ttaagaatgg aaatggttaa
      721 caagattgca tgctttgtgc ttttatgcat ggtagtggtt gcaccccatg cagaggcact
      781 aacttgtggt caagttacat ctaccttggc tccttgtctc ccttatctaa tgaatcgcgg
      841 tcctctcgga ggctgttgtg gtggtgttaa gggtcttttg ggtcaagccc agactacagt
      901 agaccgacag accgcatgca cttgcctaaa atcagctgct tcttctttta caggccttga
      961 tttgggcaaa gctgctagtc tccctagcac ttgtagtgtc aacatccctt acaagatcag
     1021 cccctctact gactgctcta agtatgttaa tttttcatct tttttgacct ataacaacac
     1081 ctaactcttc gtattaatcc tagtacgaaa aataaagtaa caaaaaaatg atatgtgcta
     1141 gcacattgtc acaatatgac atgcaagtgt gtttggtttt ctcaaaaaat aagtggattt
     1201 tttatttata ttttagtgtt aagaaatatt agtttaaaaa tatttatata tgtaattata
     1261 aagaaaaaag atactattat agttagtaca ttatgttttt gttatcatta tcattattat
     1321 tattattaat gttggttttg ttcattgtta atgcagagtt cagtaaagct gatcatcaga
     1381 atttggtttc atgaggagaa ttaagaataa gatagatagc attgatcttg cttatggatc
     1441 ctttctttct atgttgtatc agttgtcact ttctgttttt tctgtgtttc ctttaaattc
     1501 tcgtatgtag tcgagtcttg tatcgaaatt tgacgattga ttatattgta tcagttgtta
     1561 ctttctgttt tcctgtgttt cttttaaaat cgtatgtagt cgagtcttgt atcgaaattt
     1621 cccgattggc tatgttgtat taatctaatc tttgataata cacatctatc ttatttggta
     1681 tatgtactct ctcgtctatt caatattttt ggtctacttt tactagggtt tttttaatat
     1741 gcattacaca tatatatcaa attcgagtaa tatatagtat acgctattgt gtgctcattc
     1801 atctaggtac ctcctttttc taaccacttc ttacacgtac aatgctaatt attg
    Use the graphic view to explain all the features of this gene.
Ans:graphic view-1
        graphic view-2

3.After translation of this gene, please help him to do the protein sequence analysis
   (including  pI, mol. wt.,  secondary structure prediction, hydrophobic profile,homology search, prosite scanning..........)
Ans:
 
Number of amino acids 114
Molecular weight 11715.8
Theoretical pI 8.36
Amino acid composition linkage
Atomic composition linkage
molecular weight linkage
hydrophobic profile linkage
secondary structure prediction alpha-helix
beta-sheet
homology search linkage
prosite scanning linkage