Version 1.4

NCSA Computational Biology - SAPS Results

Statistical Analysis of Protein Sequences.

CARDIOTOXIN PRECURSOR.

CARDIOTOXIN PRECURSOR.

SAPS.  Version of January 7, 1995.
Date run: Tue Dec 10 21:42:58 1996



SWISS-PROT ANNOTATION:
ID   Q02454
DE   Q02454, 81 bases.

number of residues:   81;   molecular weight:   9.1 kdal
 
       1  MKTLLLTTVV VTIVCLDLEY TLKCNKLVPL FYKTCPAGKN LCYKMFMVAT PKVPVKRGCI 
      61  DVCPKSSLLV KYVCCNTDRC N

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: swp23s.q)

A  :  2( 2.5%); C  :  9(11.1%); D  :  3( 3.7%); E  :  1( 1.2%); F  :  2( 2.5%)
G  :  2( 2.5%); H  :  0( 0.0%); I  :  2( 2.5%); K  : 10(12.3%); L  : 11(13.6%)
M  :  3( 3.7%); N  :  4( 4.9%); P  :  5( 6.2%); Q  :  0( 0.0%); R  :  2( 2.5%)
S  :  2( 2.5%); T  :  8( 9.9%); V  : 11(13.6%); W  :  0( 0.0%); Y  :  4( 4.9%)

KR      :   12 ( 14.8%);   ED      :    4 (  4.9%);   AGP     :    9 ( 11.1%);
KRED    :   16 ( 19.8%);   KR-ED   :    8 (  9.9%);   FIKMNY  :   25 ( 30.9%);
LVIFM   :   29 ( 35.8%);   ST      :   10 ( 12.3%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
       1  0+00000000 000000-0-0 00+00+0000 00+00000+0 000+000000 0+000++000 
      61  -000+00000 +000000-+0 0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 12/30 or 16/45 or 19/60):  none


Negative charge clusters:  not evaluated (frequency of - < 5%, too low)


Mixed charge clusters (cmin = 14/30 or 19/45 or 24/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.148  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.049  ( ED )

 Expected score/letter:  -0.605
 - now scoring for positive charge segments;    Average information/letter:   0.727
 Minimal length of displayed segments set to:  20

M_0.01= 11.01  (cv=  6.42, lambda=  0.68489, k=  0.23341, x=  4.59;
                90% confidence interval for segment length:  15 +-  15)
M_0.05=  8.63  (x=  2.21)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.049  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.148  ( KR )

 Expected score/letter: -100.00
 - now scoring for negative charge segments;    Average information/letter:   2.722
 Minimal length of displayed segments set to:  20

M_0.01=  5.91  (cv=  3.17, lambda=  1.38629, k=  0.44580, x=  2.74;
                90% confidence interval for segment length:   4 +-   4)
M_0.05=  4.73  (x=  1.56)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.198  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.605
 - now scoring for mixed charge segments;    Average information/letter:   1.223
 Minimal length of displayed segments set to:  20

M_0.01=  5.86  (cv=  3.13, lambda=  1.40180, k=  0.45603, x=  2.72;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  4.69  (x=  1.56)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.198  ( KEDR )

 Expected score/letter:  -0.778
 - now scoring for uncharged segments;    Average information/letter:   0.128
 Minimal length of displayed segments set to:  20

M_0.01= 38.95  (cv= 26.39, lambda=  0.16653, k=  0.08143, x= 12.56;
                90% confidence interval for segment length:  73 +-  63)
M_0.05= 29.16  (x=  2.78)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)|
lmin0     5 |   3 |   5 |  34 |   9 |   6 |  10 |  11 |   7 |  12 | 
lmin1     6 |   4 |   7 |  41 |  11 |   8 |  12 |  13 |   9 |  15 | 
lmin2     7 |   5 |   8 |  45 |  12 |   9 |  14 |  15 |  10 |  16 | 

There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  22:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.469
 - now scoring for hydrophobic segments
........40........80.......120.......160.......200
*******;    Average information/letter:   0.372
 Minimal length of displayed segments set to:  15

M_0.01= 27.80  (cv= 16.69, lambda=  0.26323, k=  0.18675, x= 11.10;
                90% confidence interval for segment length:  28 +-  22)
M_0.05= 21.60  (x=  4.91)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -2.704
 - now scoring for transmembrane segments
........40........80.......120.......160.......200
********;    Average information/letter:   0.323
 Minimal length of displayed segments set to:  15

M_0.01= 60.71  (cv= 37.78, lambda=  0.11632, k=  0.14485, x= 22.94;
                90% confidence interval for segment length:  32 +-  27)
M_0.05= 46.70  (x=  8.92);     M_0.30= 30.03  (x= -7.75)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-14-C-8-C-10-C-6-C-16-C-3-C-10
CC   at   74
  -4-C-1-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   6  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 5   (6-10) 1   (11-20) 0   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 12/30 or 16/45 or 19/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  6)
   1 +plets (f+: 14.8%), 0 -plets (f-: 4.9%)
   Total number of charge altplets: 1 (Critical number: 6)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location	Period	Element		Copies	Core	Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location	Period	Element		Copies	Core	Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).




Report Bugs