(http://www.ch.embnet.org/cgi-bin/SAPS_form_parser)
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:50 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4812.3268.seq)
SWISS-PROT ANNOTATION:
ID ORFA
DE ORFA, 81 bases, 174C2750 checksum.
number of residues: 81; molecular weight: 8.5 kdal
1 MLVTGNGTFP LAQGANWSAS ALYTDALGGA TVQLQVNGVD LVDGALADNT QYVVTHGKRA
61 VVEVVVTGYT ADFNIWTYDA S
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 11(13.6%); C : 0( 0.0%); D : 6( 7.4%); E : 1( 1.2%); F : 2( 2.5%)
G : 9(11.1%); H : 1( 1.2%); I : 1( 1.2%); K : 1( 1.2%); L : 7( 8.6%)
M : 1( 1.2%); N : 5( 6.2%); P : 1( 1.2%); Q : 4( 4.9%); R : 1( 1.2%)
S : 3( 3.7%); T : 9(11.1%); V : 12(14.8%); W : 2( 2.5%); Y : 4( 4.9%)
KR : 2 ( 2.5%); ED : 7 ( 8.6%); AGP : 21 ( 25.9%);
KRED : 9 ( 11.1%); KR-ED : -5 ( -6.2%); FIKMNY : 14 ( 17.3%);
LVIFM : 23 ( 28.4%); ST : 12 ( 14.8%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 0000000000 0000-00000 000000000- 00-0000-00 0000000++0
61 00-0000000 0-000000-0 0
A. CHARGE CLUSTERS.
Positive charge clusters: not evaluated (frequency of + < 5%, too low)
Negative charge clusters (cmin = 8/30 or 11/45 or 13/60): none
Mixed charge clusters (cmin = 10/30 or 13/45 or 16/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.025 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.889 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.086 ( ED )
Expected score/letter: -1.012; Average information/letter: 3.911
Minimal length of displayed segments set to: 20
M_0.01= 4.68 (cv= 2.49, lambda= 1.76679, k= 0.48174, x= 2.19;
90% confidence interval for segment length: 3 +- 2)
M_0.05= 3.76 (x= 1.27)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.086 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.889 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.025 ( KR )
Expected score/letter: -0.765; Average information/letter: 1.543
Minimal length of displayed segments set to: 20
M_0.01= 7.71 (cv= 4.26, lambda= 1.03142, k= 0.35146, x= 3.45;
90% confidence interval for segment length: 7 +- 7)
M_0.05= 6.13 (x= 1.87)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.111 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.889 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.778; Average information/letter: 2.333
Minimal length of displayed segments set to: 20
M_0.01= 4.14 (cv= 2.11, lambda= 2.07944, k= 0.68056, x= 2.03;
90% confidence interval for segment length: 5 +- 4)
M_0.05= 3.36 (x= 1.24)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.889 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.111 ( KEDR )
Expected score/letter: -0.000
! expected score > -.001; too big !
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 2 | 4 | 4 | 58 | 6 | 8 | 9 | 7 | 10 | 11 | 5 | 6 |
lmin1 3 | 5 | 5 | 71 | 7 | 10 | 10 | 8 | 12 | 13 | 6 | 8 |
lmin2 4 | 6 | 6 | 78 | 8 | 11 | 12 | 10 | 13 | 15 | 7 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 3: 0
0 runs >= 38: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.086; Average information/letter: 0.274
Minimal length of displayed segments set to: 15
M_0.01= 29.09 (cv= 18.09, lambda= 0.24287, k= 0.14512, x= 10.99;
90% confidence interval for segment length: 37 +- 32)
M_0.05= 22.38 (x= 4.28)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -1.642; Average information/letter: 0.153
Minimal length of displayed segments set to: 15
M_0.01= 78.63 (cv= 53.43, lambda= 0.08225, k= 0.07983, x= 25.20;
90% confidence interval for segment length: 61 +- 58)
M_0.05= 58.81 (x= 5.38); M_0.30= 35.23 (x= -18.20)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-81-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 4 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 1 (11-20) 1 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 10/30 or 12/45 or 15/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 3)
1 +plets (f+: 2.5%), 0 -plets (f-: 8.6%)
Total number of charge altplets: 0 (Critical number: 3)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 7)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:51 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.5763.seq)
SWISS-PROT ANNOTATION:
ID ORFB
DE ORFB, 132 bases, 292DD71D checksum.
number of residues: 132; molecular weight: 14.1 kdal
1 MNKYAKVTFR ASIILWTIVF LLMCTSGCSA LTGLATDAAI GAVAKDDPLL GIDTEIVAGD
61 KQGIKSGPDT KLDDVEVNGN LTTSTIGRKT DVSGTTDTLT INEGVPFWWV LVGMSVMLLL
121 GLFIPQFKLT RK
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 9( 6.8%); C : 2( 1.5%); D : 10( 7.6%); E : 3( 2.3%); F : 5( 3.8%)
G : 13( 9.8%); H : 0( 0.0%); I : 10( 7.6%); K : 9( 6.8%); L : 16(12.1%)
M : 4( 3.0%); N : 4( 3.0%); P : 4( 3.0%); Q : 2( 1.5%); R : 3( 2.3%)
S : 7( 5.3%); T : 16(12.1%); V : 11( 8.3%); W : 3( 2.3%); Y : 1( 0.8%)
KR : 12 ( 9.1%); ED : 13 ( 9.8%); AGP : 26 ( 19.7%);
KRED : 25 ( 18.9%); KR-ED : -1 ( -0.8%); FIKMNY : 33 ( 25.0%);
LVIFM : 46 ( 34.8%); ST : 23 ( 17.4%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00+00+000+ 0000000000 0000000000 000000-000 0000+--000 00-0-0000-
61 +000+000-0 +0--0-0000 0000000++0 -00000-000 00-0000000 0000000000
121 0000000+00 ++
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 11/45 or 14/60): none
Negative charge clusters (cmin = 9/30 or 12/45 or 15/60): none
Mixed charge clusters (cmin = 14/30 or 19/45 or 23/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.091 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.811 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.098 ( ED )
Expected score/letter: -0.826; Average information/letter: 1.565
Minimal length of displayed segments set to: 20
M_0.01= 8.32 (cv= 4.81, lambda= 1.01604, k= 0.35852, x= 3.52;
90% confidence interval for segment length: 8 +- 7)
M_0.05= 6.72 (x= 1.91)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.098 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.811 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.091 ( KR )
Expected score/letter: -0.795; Average information/letter: 1.420
Minimal length of displayed segments set to: 20
M_0.01= 8.72 (cv= 5.06, lambda= 0.96448, k= 0.34189, x= 3.66;
90% confidence interval for segment length: 9 +- 8)
M_0.05= 7.03 (x= 1.97)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.189 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.811 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.621; Average information/letter: 1.303
Minimal length of displayed segments set to: 20
M_0.01= 6.01 (cv= 3.36, lambda= 1.45395, k= 0.47607, x= 2.65;
90% confidence interval for segment length: 10 +- 8)
M_0.05= 4.89 (x= 1.53)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.811 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.189 ( KEDR )
Expected score/letter: -0.705; Average information/letter: 0.109
Minimal length of displayed segments set to: 20
M_0.01= 44.80 (cv= 32.04, lambda= 0.15237, k= 0.07014, x= 12.75;
90% confidence interval for segment length: 90 +- 78)
M_0.05= 34.10 (x= 2.05)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 4 | 6 | 37 | 8 | 8 | 11 | 10 | 10 | 13 | NA | NA |
lmin1 5 | 5 | 7 | 45 | 10 | 10 | 13 | 12 | 12 | 16 | NA | NA |
lmin2 6 | 6 | 8 | 50 | 11 | 11 | 14 | 13 | 14 | 17 | NA | NA |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 25: 1, at 11;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.545; Average information/letter: 0.412
Minimal length of displayed segments set to: 15
M_0.01= 28.16 (cv= 17.43, lambda= 0.28010, k= 0.20294, x= 10.73;
90% confidence interval for segment length: 28 +- 21)
M_0.05= 22.34 (x= 4.91)
1) From 104 to 124 (type 1): length= 21, score=27.00 *
104 GVPFWWVLVG MSVMLLLGLF I
L: 5(23.8%); G: 3(14.3%); V: 4(19.0%);
# of segments (>=15 residues) exceeding M_0.05: 1
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -2.174; Average information/letter: 0.209
Minimal length of displayed segments set to: 15
M_0.01= 80.81 (cv= 54.59, lambda= 0.08944, k= 0.10489, x= 26.22;
90% confidence interval for segment length: 50 +- 43)
M_0.05= 62.59 (x= 8.00); M_0.30= 40.91 (x= -13.68)
1) From 13 to 44 (type 1): length= 32, score=48.00
(pocket at 36 to 37: length= 2, score=-19.00)
13 IILWTIVFLL MCTSGCSALT GLA |TD| AAIGA VA
L: 5(15.6%); A: 6(18.8%); T: 4(12.5%); I: 4(12.5%);
2) From 104 to 124 (type 1): length= 21, score=60.00
104 GVPFWWVLVG MSVMLLLGLF I
L: 5(23.8%); G: 3(14.3%); V: 4(19.0%);
# of segments (>=15 residues) exceeding M_0.30: 2
2. SPACINGS OF C.
H2N-23-C-3-C-104-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 10 (Expected range: 0-- 18)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 4 (11-20) 5 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 13/30 or 16/45 or 19/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 4 (Expected range: 0-- 7)
2 +plets (f+: 9.1%), 2 -plets (f-: 9.8%)
Total number of charge altplets: 2 (Critical number: 8)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 0 (11-20) 1 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
85- 104 5 T.... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
18- 23 1 i 6 6 0
110- 125 2 i. 8 8 ! 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
58- 133 (3.) A( 75)A 1 of 10 0.0035 large maximal spacing
59- 63 (2.) G( 4)G 14 of 14 0.0008 large minimal spacing
63- 67 (2.) G( 4)G 13 of 14 0.0008 matching minimum
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:53 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.902.seq)
SWISS-PROT ANNOTATION:
ID ORFC
DE ORFC, 77 bases, E0767B7B checksum.
number of residues: 77; molecular weight: 9.0 kdal
1 MAKKVQMTTY KCEVCRHEEE HKTEKRNPMY RDCPECNTRV GMKAISPFTS INIKRYGTDT
61 EPSKYEIIKN RPKGGNF
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 2( 2.6%); C : 4( 5.2%); D : 2( 2.6%); E : 8(10.4%); F : 2( 2.6%)
G : 4( 5.2%); H : 2( 2.6%); I : 5( 6.5%); K : 10(13.0%); L : 0( 0.0%)
M : 4( 5.2%); N : 5( 6.5%); P : 5( 6.5%); Q : 1( 1.3%); R : 6( 7.8%)
S : 3( 3.9%); T : 7( 9.1%); V : 3( 3.9%); W : 0( 0.0%); Y : 4( 5.2%)
KR : 16 ( 20.8%); ED : 10 ( 13.0%); AGP : 11 ( 14.3%);
KRED : 26 ( 33.8%); KR-ED : 6 ( 7.8%); FIKMNY : 30 ( 39.0%);
LVIFM : 14 ( 18.2%); ST : 10 ( 13.0%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00++000000 +0-00+0--- 0+0-++0000 +-00-000+0 00+0000000 000++000-0
61 -00+0-00+0 +0+0000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 15/30 or 20/45 or 25/60): none
Negative charge clusters (cmin = 11/30 or 14/45 or 18/60): none
Mixed charge clusters (cmin = 20/30 or 27/45 or 34/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.208 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.662 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.130 ( ED )
Expected score/letter: -0.506; Average information/letter: 0.388
Minimal length of displayed segments set to: 20
M_0.01= 14.87 (cv= 9.12, lambda= 0.47604, k= 0.15459, x= 5.74;
90% confidence interval for segment length: 26 +- 28)
M_0.05= 11.44 (x= 2.32)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.130 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.662 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.208 ( KR )
Expected score/letter: -0.818; Average information/letter: 1.136
Minimal length of displayed segments set to: 20
M_0.01= 9.50 (cv= 5.31, lambda= 0.81842, k= 0.31128, x= 4.19;
90% confidence interval for segment length: 10 +- 9)
M_0.05= 7.51 (x= 2.20)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.338 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.662 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.325; Average information/letter: 0.316
Minimal length of displayed segments set to: 20
M_0.01= 10.55 (cv= 6.45, lambda= 0.67373, k= 0.15915, x= 4.10;
90% confidence interval for segment length: 32 +- 33)
M_0.05= 8.13 (x= 1.68)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.662 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.338 ( KEDR )
Expected score/letter: -2.039; Average information/letter: 0.502
Minimal length of displayed segments set to: 20
M_0.01= 19.34 (cv= 10.92, lambda= 0.39788, k= 0.28690, x= 8.42;
90% confidence interval for segment length: 22 +- 11)
M_0.05= 15.24 (x= 4.33)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 6 | 4 | 8 | 19 | 9 | 8 | 12 | 10 | 9 | 13 | 6 | 7 |
lmin1 7 | 6 | 10 | 23 | 12 | 10 | 15 | 13 | 11 | 15 | 7 | 9 |
lmin2 8 | 7 | 11 | 26 | 13 | 11 | 16 | 14 | 12 | 17 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 4: 0
- runs >= 3: 1, at 18;
* runs >= 5: 0
0 runs >= 13: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -3.169; Average information/letter: 1.256
Minimal length of displayed segments set to: 15
M_0.01= 13.28 (cv= 7.10, lambda= 0.61141, k= 0.43982, x= 6.18;
90% confidence interval for segment length: 9 +- 5)
M_0.05= 10.62 (x= 3.51)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -6.364; Average information/letter: 1.353
Minimal length of displayed segments set to: 15
M_0.01= 25.79 (cv= 14.27, lambda= 0.30448, k= 0.33572, x= 11.52;
90% confidence interval for segment length: 8 +- 5)
M_0.05= 20.44 (x= 6.17); M_0.30= 14.07 (x= -0.20)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-11
CEVC at 12
-17
CPEC at 33 (l= 25)
-41-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 5 (Expected range: 0-- 12)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 3 (6-10) 2 (11-20) 0 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 11/30 or 15/45 or 18/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 4 (Expected range: 0-- 10)
3 +plets (f+: 20.8%), 1 -plets (f-: 13.0%)
Total number of charge altplets: 2 (Critical number: 11)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 2 (6-10) 0 (11-20) 1 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
4- 45 7 *.0..0. 6 6 /0/./2/././1/./
16- 27 2 *0 6 6 /0/2/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:54 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4810.3398.seq)
SWISS-PROT ANNOTATION:
ID ORFC2
DE ORFC2, 250 bases, C0857596 checksum.
number of residues: 250; molecular weight: 27.1 kdal
1 MDQPLVESSP PPILFGLSLS LEVTTFRSVL DNLVLARFSV RSVSLDVDTG ERANRFHSDT
61 GVTLRAITIH RVALLSFVFL LVTAHFTLVS SHLYFLRHNY FLVSLNCGIN NPSSSITDIP
121 TSTHQNGTPS LMVKVSVVPD TSVLRPIVLV VRLPLTSTSS NLVSGPDLIP CLSPATISVS
181 IPNSGSSFAT APIAASVASP VSAEHPDVHI NRNTIVHRMM DALKVTFAYL FILNTSTDIV
241 VYSPVNLYRR
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 14( 5.6%); C : 2( 0.8%); D : 11( 4.4%); E- : 4( 1.6%); F : 12( 4.8%)
G--: 7( 2.8%); H : 9( 3.6%); I : 15( 6.0%); K--: 2( 0.8%); L : 31(12.4%)
M : 4( 1.6%); N : 13( 5.2%); P+ : 18( 7.2%); Q--: 2( 0.8%); R : 14( 5.6%)
S++: 35(14.0%); T+ : 21( 8.4%); V++: 31(12.4%); W : 0( 0.0%); Y : 5( 2.0%)
KR : 16 ( 6.4%); ED : 15 ( 6.0%); AGP : 39 ( 15.6%);
KRED : 31 ( 12.4%); KR-ED : 1 ( 0.4%); FIKMNY : 51 ( 20.4%);
LVIFM : 93 ( 37.2%); ST ++: 56 ( 22.4%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0-0000-000 0000000000 0-0000+000 -00000+000 +0000-0-00 -+00+000-0
61 0000+00000 +000000000 0000000000 000000+000 0000000000 0000000-00
121 0000000000 000+00000- 0000+00000 0+00000000 000000-000 0000000000
181 0000000000 0000000000 000-00-000 0+00000+00 -00+000000 0000000-00
241 00000000++
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 7/30 or 9/45 or 11/60): none
Negative charge clusters (cmin = 7/30 or 9/45 or 10/60): none
Mixed charge clusters (cmin = 10/30 or 14/45 or 17/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.064 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.876 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.060 ( ED )
Expected score/letter: -0.868; Average information/letter: 2.123
Minimal length of displayed segments set to: 20
M_0.01= 7.55 (cv= 4.52, lambda= 1.22156, k= 0.40458, x= 3.03;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 6.21 (x= 1.69)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.060 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.876 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.064 ( KR )
Expected score/letter: -0.884; Average information/letter: 2.245
Minimal length of displayed segments set to: 20
M_0.01= 7.33 (cv= 4.38, lambda= 1.26053, k= 0.41326, x= 2.95;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 6.04 (x= 1.66)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.124 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.876 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.752; Average information/letter: 2.121
Minimal length of displayed segments set to: 20
M_0.01= 4.95 (cv= 2.82, lambda= 1.95508, k= 0.64555, x= 2.13;
90% confidence interval for segment length: 7 +- 4)
M_0.05= 4.12 (x= 1.30)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.876 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.124 ( KEDR )
Expected score/letter: -0.116
Average information/letter: 0.004 < .10; too small !
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 4 | 5 | 61 | 8 | 8 | 10 | 10 | 10 | 12 | 7 | 9 |
lmin1 5 | 5 | 6 | 74 | 10 | 9 | 12 | 12 | 12 | 15 | 9 | 11 |
lmin2 6 | 6 | 7 | 82 | 11 | 11 | 13 | 14 | 13 | 17 | 10 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 3: 0
0 runs >= 40: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.488; Average information/letter: 0.470
Minimal length of displayed segments set to: 15
M_0.01= 27.28 (cv= 17.53, lambda= 0.31501, k= 0.21676, x= 9.75;
90% confidence interval for segment length: 26 +- 20)
M_0.05= 22.10 (x= 4.58)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -1.860; Average information/letter: 0.188
Minimal length of displayed segments set to: 15
M_0.01= 87.96 (cv= 62.63, lambda= 0.08817, k= 0.09380, x= 25.33;
90% confidence interval for segment length: 59 +- 52)
M_0.05= 69.47 (x= 6.85); M_0.30= 47.48 (x= -15.15)
1) From 72 to 96 (type 1): length= 25, score=52.00
72 VALLSFVFLL VTAHFTLVSS HLYFL
L: 7(28.0%); S: 3(12.0%); V: 4(16.0%); F: 4(16.0%);
# of segments (>=15 residues) exceeding M_0.30: 1
2. SPACINGS OF C.
H2N-106-C-63-C-79-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 16 (Expected range: 6-- 31)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 4 (6-10) 5 (11-20) 4 (>=21) 4
3. Clusters of amino acid multiplets (cmin = 11/30 or 15/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 6)
1 +plets (f+: 6.4%), 0 -plets (f-: 6.0%)
Total number of charge altplets: 1 (Critical number: 8)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 0 (11-20) 0 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
133- 152 5 V.... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
13- 24 2 i0 6 6 /0/2/
56- 103 8 i.0.0... 6 6 /0/./2/./0/./././
67- 122 7 i...0.0 8 8 /0/./././2/./2/
74- 82 1 i 8 6 1
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
12- 112 (1.) P( 100)P 1 of 19 0.0012 large maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:56 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.6375.8536.seq)
SWISS-PROT ANNOTATION:
ID ORFD
DE ORFD, 224 bases, B6C47028 checksum.
number of residues: 224; molecular weight: 24.1 kdal
1 MKDLIRTGLV EEMIRPMVGP LIGGMGGGGP VTTPQTPMNF NHNEGAFAIL IDGGDGNDYV
61 DDSGDPIIER NELEPTVYGS LSGNNRTSIG SGKYYYEFLI VAEPASGRIG LGLSALDNRS
121 STTDIRHDFW NGWYSDGVKI NNPNYSSNGS QTTWAGTFGV GDRVGIGLDQ SLDDIILYVN
181 GVEMDRIELT NTGTNNFMTP YLLGDGGKLD HSAQAYLPSG FSEY
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A- : 8( 3.6%); C : 0( 0.0%); D+ : 18( 8.0%); E : 11( 4.9%); F : 7( 3.1%)
G++: 34(15.2%); H : 3( 1.3%); I : 16( 7.1%); K- : 4( 1.8%); L : 18( 8.0%)
M : 7( 3.1%); N+ : 17( 7.6%); P : 11( 4.9%); Q- : 4( 1.8%); R : 9( 4.0%)
S : 17( 7.6%); T : 15( 6.7%); V : 11( 4.9%); W : 3( 1.3%); Y : 11( 4.9%)
KR - : 13 ( 5.8%); ED : 29 ( 12.9%); AGP : 53 ( 23.7%);
KRED : 42 ( 18.8%); KR-ED - : -16 ( -7.1%); FIKMNY : 62 ( 27.7%);
LVIFM : 59 ( 26.3%); ST : 32 ( 14.3%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0+-00+0000 --00+00000 0000000000 0000000000 000-000000 0-00-00-00
61 --00-000-+ 0-0-000000 00000+0000 00+000-000 00-0000+00 000000-0+0
121 000-0+0-00 00000-00+0 0000000000 0000000000 0-+00000-0 00--000000
181 00-0-+0-00 0000000000 0000-00+0- 0000000000 00-0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 6/30 or 8/45 or 10/60): none
Negative charge clusters (cmin = 11/30 or 14/45 or 18/60): none
Mixed charge clusters (cmin = 14/30 or 18/45 or 23/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.058 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.129 ( ED )
Expected score/letter: -0.955; Average information/letter: 2.402
Minimal length of displayed segments set to: 20
M_0.01= 7.10 (cv= 4.19, lambda= 1.29049, k= 0.42742, x= 2.91;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 5.84 (x= 1.64)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.129 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.058 ( KR )
Expected score/letter: -0.670; Average information/letter: 0.936
Minimal length of displayed segments set to: 20
M_0.01= 11.18 (cv= 6.95, lambda= 0.77910, k= 0.27219, x= 4.23;
90% confidence interval for segment length: 13 +- 13)
M_0.05= 9.09 (x= 2.14)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.188 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.625; Average information/letter: 1.322
Minimal length of displayed segments set to: 20
M_0.01= 6.33 (cv= 3.69, lambda= 1.46634, k= 0.48077, x= 2.64;
90% confidence interval for segment length: 10 +- 8)
M_0.05= 5.22 (x= 1.53)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.188 ( KEDR )
Expected score/letter: -0.688; Average information/letter: 0.105
Minimal length of displayed segments set to: 20
M_0.01= 49.09 (cv= 36.31, lambda= 0.14905, k= 0.06756, x= 12.78;
90% confidence interval for segment length: 101 +- 84)
M_0.05= 38.15 (x= 1.85)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 3 | 5 | 6 | 40 | 7 | 10 | 11 | 9 | 12 | 14 | 6 | 7 |
lmin1 5 | 6 | 7 | 49 | 9 | 12 | 14 | 11 | 14 | 16 | 7 | 9 |
lmin2 6 | 7 | 9 | 54 | 10 | 13 | 15 | 12 | 16 | 18 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 27: 1, at 16;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.795; Average information/letter: 0.569
Minimal length of displayed segments set to: 15
M_0.01= 23.86 (cv= 14.90, lambda= 0.36328, k= 0.26054, x= 8.96;
90% confidence interval for segment length: 22 +- 15)
M_0.05= 19.37 (x= 4.47)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.219; Average information/letter: 0.445
Minimal length of displayed segments set to: 15
M_0.01= 56.69 (cv= 36.84, lambda= 0.14688, k= 0.18536, x= 19.84;
90% confidence interval for segment length: 27 +- 20)
M_0.05= 45.59 (x= 8.75); M_0.30= 32.39 (x= -4.46)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-224-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 19 (Expected range: 3-- 26)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 8 (6-10) 3 (11-20) 8 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 14/30 or 17/45 or 21/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 3 (Expected range: 0-- 10)
0 +plets (f+: 5.8%), 3 -plets (f-: 12.9%)
Total number of charge altplets: 4 (Critical number: 10)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 1 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
23- 29 1 G 6 4 1
52- 63 3 D.. 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
1- 28 4 i..0 7 7 /0/././2/
4- 57 9 i..0000.0 6 6 /0/././2/2/2/2/./2/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
38- 184 (2.) M( 146)M 1 of 8 0.0041 large 1. maximal spacing
198- 225 (4.) M( 27)M 2 of 8 0.9836 small 2. maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:58 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.8483.seq)
SWISS-PROT ANNOTATION:
ID ORFE
DE ORFE, 185 bases, 52D098DD checksum.
number of residues: 185; molecular weight: 21.1 kdal
1 MLLTPELKKE RWEAAYAGEI HVPKDMNRAT AALFWEKRHE ARTIERGKYP VFNLSMRDHD
61 DTLSFPQLYF QCDSDYEAAM VILGSWGHWQ RLCEAKWFAE KLALWQEEKA QRDIAHGRAK
121 IKALADAGNL SACKFLASGG LSDKPKQEKP SNPKPAKNPE PAPTAEEDDF LKHGLSILEK
181 NSEHK
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 23(12.4%); C : 3( 1.6%); D : 11( 5.9%); E : 18( 9.7%); F : 7( 3.8%)
G : 9( 4.9%); H : 7( 3.8%); I : 6( 3.2%); K : 19(10.3%); L : 18( 9.7%)
M : 4( 2.2%); N : 6( 3.2%); P : 11( 5.9%); Q : 6( 3.2%); R : 9( 4.9%)
S : 10( 5.4%); T : 5( 2.7%); V : 3( 1.6%); W : 6( 3.2%); Y : 4( 2.2%)
KR : 28 ( 15.1%); ED : 29 ( 15.7%); AGP : 43 ( 23.2%);
KRED : 57 ( 30.8%); KR-ED : -1 ( -0.5%); FIKMNY : 46 ( 24.9%);
LVIFM : 38 ( 20.5%); ST : 15 ( 8.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00000-0++- +0-00000-0 000+-00+00 00000-++0- 0+00-+0+00 000000+-0-
61 -000000000 00-0-0-000 0000000000 +00-0+000- +00000--+0 0+-0000+0+
121 0+000-0000 000+000000 00-+0+0-+0 000+00+00- 00000----0 0+000000-+
181 00-0+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 12/30 or 16/45 or 20/60): none
Negative charge clusters (cmin = 12/30 or 16/45 or 20/60): none
Mixed charge clusters (cmin = 19/30 or 26/45 or 32/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.151 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.692 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.157 ( ED )
Expected score/letter: -0.703; Average information/letter: 0.835
Minimal length of displayed segments set to: 20
M_0.01= 12.02 (cv= 7.41, lambda= 0.70492, k= 0.25905, x= 4.61;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 9.70 (x= 2.30)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.157 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.692 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.151 ( KR )
Expected score/letter: -0.681; Average information/letter: 0.778
Minimal length of displayed segments set to: 20
M_0.01= 12.39 (cv= 7.68, lambda= 0.67984, k= 0.24781, x= 4.71;
90% confidence interval for segment length: 16 +- 15)
M_0.05= 10.00 (x= 2.32)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.308 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.692 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.384; Average information/letter: 0.448
Minimal length of displayed segments set to: 20
M_0.01= 10.23 (cv= 6.45, lambda= 0.80898, k= 0.21288, x= 3.77;
90% confidence interval for segment length: 27 +- 24)
M_0.05= 8.21 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.692 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.308 ( KEDR )
Expected score/letter: -1.773; Average information/letter: 0.418
Minimal length of displayed segments set to: 20
M_0.01= 24.09 (cv= 14.94, lambda= 0.34931, k= 0.24483, x= 9.14;
90% confidence interval for segment length: 29 +- 16)
M_0.05= 19.42 (x= 4.47)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 5 | 8 | 23 | 9 | 9 | 13 | 10 | 11 | 14 | 7 | 9 |
lmin1 7 | 7 | 10 | 28 | 11 | 11 | 16 | 13 | 13 | 17 | 8 | 11 |
lmin2 8 | 8 | 11 | 32 | 13 | 13 | 17 | 14 | 15 | 19 | 10 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 1, at 166;
* runs >= 5: 0
0 runs >= 16: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.589; Average information/letter: 0.885
Minimal length of displayed segments set to: 15
M_0.01= 18.18 (cv= 10.78, lambda= 0.48410, k= 0.36062, x= 7.40;
90% confidence interval for segment length: 14 +- 8)
M_0.05= 14.81 (x= 4.03)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.319; Average information/letter: 0.902
Minimal length of displayed segments set to: 15
M_0.01= 39.12 (cv= 23.75, lambda= 0.21977, k= 0.29424, x= 15.37;
90% confidence interval for segment length: 14 +- 9)
M_0.05= 31.70 (x= 7.95); M_0.30= 22.88 (x= -0.88)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-71-C-20-C-39-C-52-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 10 (Expected range: 1-- 22)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 4 (6-10) 0 (11-20) 3 (>=21) 4
3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 5 (Expected range: 0-- 16)
2 +plets (f+: 15.1%), 3 -plets (f-: 15.7%)
Total number of charge altplets: 11 (Critical number: 19)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 1 (11-20) 1 (>=21) 4
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
115- 130 4 A... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:14:59 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4819.9592.seq)
SWISS-PROT ANNOTATION:
ID ORFF
DE ORFF, 99 bases, 814D438A checksum.
number of residues: 99; molecular weight: 11.7 kdal
1 MIAAIIVQGP LFWFIGSRQK KGEDMTSELK SIRMSIAKVH QRVDEELDKL DEKYATKTEV
61 EVHMAHLKEV NELQFRAIME NLDYIRKRLD EHDSKRRPE
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 6( 6.1%); C : 0( 0.0%); D : 7( 7.1%); E : 12(12.1%); F : 3( 3.0%)
G : 3( 3.0%); H : 4( 4.0%); I : 8( 8.1%); K : 10(10.1%); L : 8( 8.1%)
M : 5( 5.1%); N : 2( 2.0%); P : 2( 2.0%); Q : 4( 4.0%); R : 8( 8.1%)
S : 5( 5.1%); T : 3( 3.0%); V : 6( 6.1%); W : 1( 1.0%); Y : 2( 2.0%)
KR : 18 ( 18.2%); ED : 19 ( 19.2%); AGP : 11 ( 11.1%);
KRED : 37 ( 37.4%); KR-ED : -1 ( -1.0%); FIKMNY : 30 ( 30.3%);
LVIFM : 30 ( 30.3%); ST : 8 ( 8.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 0000000+0+ +0--000-0+ 00+0000+00 0+0---0-+0 --+000+0-0
61 -000000+-0 0-000+000- 00-00+++0- -0-0+++0-
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 13/30 or 18/45 or 22/60): none
Negative charge clusters (cmin = 14/30 or 19/45 or 23/60): none
Mixed charge clusters (cmin = 21/30 or 29/45 or 37/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.182 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.626 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.192 ( ED )
Expected score/letter: -0.646; Average information/letter: 0.622
Minimal length of displayed segments set to: 20
M_0.01= 12.93 (cv= 7.75, lambda= 0.59300, k= 0.21646, x= 5.18;
90% confidence interval for segment length: 18 +- 18)
M_0.05= 10.18 (x= 2.43)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.192 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.626 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.182 ( KR )
Expected score/letter: -0.606; Average information/letter: 0.540
Minimal length of displayed segments set to: 20
M_0.01= 13.70 (cv= 8.32, lambda= 0.55237, k= 0.19663, x= 5.38;
90% confidence interval for segment length: 20 +- 20)
M_0.05= 10.75 (x= 2.43)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.374 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.626 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.253; Average information/letter: 0.188
Minimal length of displayed segments set to: 20
M_0.01= 13.39 (cv= 8.90, lambda= 0.51622, k= 0.10182, x= 4.49;
90% confidence interval for segment length: 53 +- 55)
M_0.05= 10.23 (x= 1.33)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.626 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.374 ( KEDR )
Expected score/letter: -2.364; Average information/letter: 0.604
Minimal length of displayed segments set to: 20
M_0.01= 17.68 (cv= 10.02, lambda= 0.45839, k= 0.33610, x= 7.66;
90% confidence interval for segment length: 19 +- 8)
M_0.05= 14.13 (x= 4.10)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 5 | 9 | 18 | 9 | 9 | 13 | 9 | 10 | 13 | 7 | 8 |
lmin1 7 | 7 | 11 | 21 | 11 | 11 | 15 | 12 | 12 | 16 | 8 | 10 |
lmin2 8 | 8 | 12 | 24 | 12 | 12 | 17 | 13 | 13 | 18 | 9 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 4: 0
- runs >= 4: 0
* runs >= 6: 0
0 runs >= 12: 1, at 1;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.949; Average information/letter: 0.967
Minimal length of displayed segments set to: 15
M_0.01= 18.11 (cv= 10.11, lambda= 0.45433, k= 0.37944, x= 7.99;
90% confidence interval for segment length: 12 +- 7)
M_0.05= 14.52 (x= 4.40)
1) From 1 to 16 (type 1): length= 16, score=17.00 *
(pocket at 8 to 10: length= 3, score=-5.00)
1 MIAAIIV |QGP| LFWFIG
A: 2(12.5%); G: 2(12.5%); I: 4(25.0%); F: 2(12.5%);
# of segments (>=15 residues) exceeding M_0.05: 1
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.869; Average information/letter: 0.911
Minimal length of displayed segments set to: 15
M_0.01= 40.86 (cv= 23.43, lambda= 0.19613, k= 0.30667, x= 17.43;
90% confidence interval for segment length: 13 +- 8)
M_0.05= 32.55 (x= 9.12); M_0.30= 22.66 (x= -0.77)
1) From 1 to 16 (type 1): length= 16, score=33.00 *
(pocket at 8 to 10: length= 3, score=-14.00)
1 MIAAIIV |QGP| LFWFIG
A: 2(12.5%); G: 2(12.5%); I: 4(25.0%); F: 2(12.5%);
# of segments (>=15 residues) exceeding M_0.30: 1
2. SPACINGS OF C.
H2N-99-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 5 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 3 (6-10) 0 (11-20) 1 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 10/30 or 12/45 or 15/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 7 (Expected range: 0-- 13)
3 +plets (f+: 18.2%), 4 -plets (f-: 19.2%)
Total number of charge altplets: 3 (Critical number: 15)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 5 (6-10) 0 (11-20) 2 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:01 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.4176.seq)
SWISS-PROT ANNOTATION:
ID ORFG
DE ORFG, 535 bases, 35D30714 checksum.
number of residues: 535; molecular weight: 62.4 kdal
1 MNTTASDDQN DIPLEDPRLI ELREACLHSL WIFAQAVEPH RVYGECHKEL FDWWQEMELE
61 EVLNTLALMP RDHQKSHCIA VWVCWQIFKN PAVTIAYVCA TESLAILQLY DIKQILTSDE
121 FTRLSPDMIE PMEKKRQKWA ETAIIVDHPI RKKERPRDPT VLATGLDSNN IGAHCNIMVK
181 DDVVIDKNSL TETARQKVEA KAGHLSSILT TDGMEFCVGT RYHPKDHYQT LIDMTEEVWE
241 GDQLVGERPV YAVHTRVVEV EGVFLWPRMA RESDGKMFGF DRAQLSRKKA KYRKDMRNFY
301 CQYYNDPNAI NEGGIEKSMF LYYDKEKVVR RKGSWWVGKR KVNIIACQDF AYSVQTGSDW
361 TALFILGMDK DKRIYILDIV RYQTKKPSVY WKNLETAYNK WKFKWVRAEA VAAQEVIIEA
421 LREYAEKEQC PIRIKAYKPN SQSGDKETRI SQTLEPLYEN GDIYHYRGGL CDMLEEELTQ
481 DRPPHDDLKD CLHIGVGFDK LKPPVEDDDE EDYDNYHDPH ASYSYDPNNR FGGLQ
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 34( 6.4%); C : 12( 2.2%); D+ : 44( 8.2%); E : 45( 8.4%); F : 15( 2.8%)
G : 25( 4.7%); H : 16( 3.0%); I : 33( 6.2%); K : 40( 7.5%); L : 39( 7.3%)
M : 14( 2.6%); N : 20( 3.7%); P : 24( 4.5%); Q : 22( 4.1%); R : 31( 5.8%)
S : 22( 4.1%); T : 25( 4.7%); V : 34( 6.4%); W : 14( 2.6%); Y : 26( 4.9%)
KR : 71 ( 13.3%); ED + : 89 ( 16.6%); AGP : 83 ( 15.5%);
KRED + : 160 ( 29.9%); KR-ED : -18 ( -3.4%); FIKMNY : 148 ( 27.7%);
LVIFM : 135 ( 25.2%); ST : 47 ( 8.8%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000000--00 -000--0+00 -0+-000000 0000000-00 +000-00+-0 0-000-0-0-
61 -000000000 +-00+00000 00000000+0 0000000000 0-00000000 -0+00000--
121 00+000-00- 00-+++0+00 -00000-000 +++-+0+-00 000000-000 000000000+
181 --000-+000 0-00+0+0-0 +000000000 0-00-00000 +000+-0000 00-00--00-
241 0-0000-+00 00000+00-0 -000000+00 +-0-0+0000 -+0000+++0 +0++-0+000
301 00000-0000 0-000-+000 000-+-+00+ ++000000++ +0000000-0 00000000-0
361 00000000-+ -++0000-00 +000++0000 0+00-0000+ 0+0+00+0-0 0000-000-0
421 0+-00-+-00 00+0+00+00 0000-+-0+0 0000-000-0 0-0000+000 0-00---000
481 -+000--0+- 00000000-+ 0+000----- --0-000-00 00000-000+ 00000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 15/45 or 18/60): none
Negative charge clusters (cmin = 13/30 or 17/45 or 21/60): none
Mixed charge clusters (cmin = 19/30 or 25/45 or 32/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.133 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.701 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.166 ( ED )
Expected score/letter: -0.768; Average information/letter: 1.045
Minimal length of displayed segments set to: 20
M_0.01= 12.18 (cv= 7.92, lambda= 0.79365, k= 0.29549, x= 4.26;
90% confidence interval for segment length: 13 +- 12)
M_0.05= 10.12 (x= 2.21)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.166 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.701 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.133 ( KR )
Expected score/letter: -0.634; Average information/letter: 0.671
Minimal length of displayed segments set to: 20
M_0.01= 14.82 (cv= 9.92, lambda= 0.63353, k= 0.22521, x= 4.91;
90% confidence interval for segment length: 20 +- 18)
M_0.05= 12.25 (x= 2.34)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.299 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.701 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.402; Average information/letter: 0.494
Minimal length of displayed segments set to: 20
M_0.01= 11.05 (cv= 7.38, lambda= 0.85175, k= 0.23040, x= 3.68;
90% confidence interval for segment length: 28 +- 23)
M_0.05= 9.14 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.701 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.299 ( KEDR )
Expected score/letter: -1.692; Average information/letter: 0.393
Minimal length of displayed segments set to: 20
M_0.01= 28.16 (cv= 18.78, lambda= 0.33455, k= 0.23167, x= 9.38;
90% confidence interval for segment length: 35 +- 18)
M_0.05= 23.29 (x= 4.51)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 6 | 9 | 27 | 10 | 11 | 14 | 11 | 12 | 16 | 7 | 9 |
lmin1 7 | 8 | 11 | 33 | 12 | 13 | 17 | 14 | 15 | 19 | 9 | 11 |
lmin2 8 | 9 | 12 | 37 | 13 | 15 | 19 | 16 | 17 | 21 | 10 | 13 |
(Significance level: 0.010000; Minimal displayed length: 6)
(-) 7(0,0,0); at 506- 512: EDDDEED
(4. quartile) -------
Run count statistics:
+ runs >= 4: 0
- runs >= 4: 1, at 506;
* runs >= 6: 1, at 506;
0 runs >= 18: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.495; Average information/letter: 0.823
Minimal length of displayed segments set to: 15
M_0.01= 22.25 (cv= 14.25, lambda= 0.44078, k= 0.34199, x= 8.00;
90% confidence interval for segment length: 17 +- 9)
M_0.05= 18.56 (x= 4.30)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.114; Average information/letter: 0.835
Minimal length of displayed segments set to: 15
M_0.01= 47.88 (cv= 31.30, lambda= 0.20071, k= 0.28031, x= 16.58;
90% confidence interval for segment length: 17 +- 10)
M_0.05= 39.76 (x= 8.46); M_0.30= 30.10 (x= -1.20)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-25-C-19-C-31-C-5-C-14-C-75-C-41-C-83-C-45-C-82-C-40-C-19-C-44-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 5
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 9
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 34 (Expected range: 12-- 45)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 13 (6-10) 8 (11-20) 6 (>=21) 8
3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 17 (Expected range: 7-- 34)
8 +plets (f+: 13.3%), 9 -plets (f-: 16.6%)
Total number of charge altplets: 22 (Critical number: 38)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 4 (11-20) 4 (>=21) 9
3. Long charge multiplets (>= 5; Letter/Length/Position):
-/7/506
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
472- 516 9 D........ 5 5 ! 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10)
Location Period Element Copies Core Errors
287- 298 2 *. 6 6 0
462- 491 5 *.0.. 6 6 /0/./2/././
472- 516 9 -.0....0. 5 5 /0/./1/././././1/./
506- 512 1 - 7 7 ! 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
66- 68 (1.) L( 2)L 40 of 40 0.0018 large minimal spacing
107- 109 (1.) L( 2)L 38 of 40 0.0018 matching minimum
254- 465 (3.) H( 211)H 1 of 17 0.0048 large maximal spacing
534- 536 (1.) L( 2)L 39 of 40 0.0018 matching minimum
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:02 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4810.2920.seq)
SWISS-PROT ANNOTATION:
ID ORFH
DE ORFH, 599 bases, 6E34097E checksum.
number of residues: 599; molecular weight: 68.1 kdal
1 MSTDIKTLQK MLEGRDDDRA FIDELVVLFT NMENARAQKD REDKELMDYI DATDTRKTSN
61 SKLPFKNSTT INKLAHLHLM ITTSYMEHLL PNRNWVDFVG FDNDSVNAEK REIARSYVRG
121 KVEASNLEGV IERMVDDFAV RGFCVAHTRH VKRMTVTAEN QVIKNYSGTV TERLSPSDVF
181 WDVTADSLPK AAKCIRQLYT LGSLKREIEE GTFPLMSMED FQKLREERRT IREALADGYN
241 GRRKFDSLHK KGYGSMMNYI NEGVVEVLTF MGDFYDEEND ELWNNYEITV IDRKIIGRKQ
301 SKDTWDGSQN LHIAVYEFQK DTLCPIGPLH RLTGMQYKLD KRENFREDLH DRFLHPSLKK
361 VGDVREKGMR GGPNHVFEVE ETGDVQYMTP PAEVLQPDNQ LSITLQLMED LSGAPKESIG
421 QRTAGEKTKF EVQLLDQGQN KVFRRKVKKF ERELLTPVLN DYLEQGRNHL DASDTIKTFN
481 SELGTATFLD ITADDLNLNG QMVAQGATLF AEKANTLQNL NAILGGPLGA ALAPHMSRTK
541 LFNAVEYLGD LDAYGIFTFG IGVQEDQQLA RMAQKSTQQT EETALTQEEV GGPTTDTGQ
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 36( 6.0%); C : 3( 0.5%); D : 46( 7.7%); E : 49( 8.2%); F : 26( 4.3%)
G : 41( 6.8%); H : 13( 2.2%); I : 25( 4.2%); K : 39( 6.5%); L : 58( 9.7%)
M : 20( 3.3%); N : 32( 5.3%); P : 17( 2.8%); Q : 29( 4.8%); R : 37( 6.2%)
S : 26( 4.3%); T : 46( 7.7%); V : 36( 6.0%); W : 4( 0.7%); Y : 16( 2.7%)
KR : 76 ( 12.7%); ED + : 95 ( 15.9%); AGP : 94 ( 15.7%);
KRED : 171 ( 28.5%); KR-ED : -19 ( -3.2%); FIKMNY : 158 ( 26.4%);
LVIFM : 165 ( 27.5%); ST : 72 ( 12.0%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000-0+000+ 00-0+---+0 00--000000 00-00+00+- +--+-00-00 -00-0++000
61 0+000+0000 00+0000000 000000-000 00+000-000 0-0-0000-+ +-00+000+0
121 +0-0000-00 0-+00--000 +0000000+0 0++00000-0 000+000000 0-+0000-00
181 0-000-000+ 00+00+0000 0000++-0-- 00000000-- 00+0+--++0 0+-000-000
241 0+++0-000+ +000000000 0-000-0000 00-00---0- -00000-000 0-++000++0
301 0+-00-0000 000000-00+ -000000000 +000000+0- ++-00+--00 -+000000++
361 00-0+-+00+ 0000000-0- -00-000000 00-0000-00 00000000-- 00000+-000
421 0+000-+0+0 -0000-0000 +00+++0++0 -+-0000000 -00-00+000 -00-00+000
481 0-0000000- 000--00000 0000000000 0-+0000000 0000000000 0000000+0+
541 00000-000- 0-00000000 0000--0000 +000+00000 --00000--0 00000-000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 14/45 or 17/60): none
Negative charge clusters (cmin = 12/30 or 16/45 or 20/60): none
Mixed charge clusters (cmin = 18/30 or 24/45 or 31/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.127 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.715 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.159 ( ED )
Expected score/letter: -0.778; Average information/letter: 1.105
Minimal length of displayed segments set to: 20
M_0.01= 11.95 (cv= 7.79, lambda= 0.82049, k= 0.30419, x= 4.16;
90% confidence interval for segment length: 13 +- 11)
M_0.05= 9.96 (x= 2.17)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.159 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.715 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.127 ( KR )
Expected score/letter: -0.651; Average information/letter: 0.730
Minimal length of displayed segments set to: 20
M_0.01= 14.39 (cv= 9.63, lambda= 0.66411, k= 0.23732, x= 4.76;
90% confidence interval for segment length: 19 +- 17)
M_0.05= 11.94 (x= 2.31)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.285 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.715 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.429; Average information/letter: 0.568
Minimal length of displayed segments set to: 20
M_0.01= 10.51 (cv= 6.97, lambda= 0.91746, k= 0.25763, x= 3.54;
90% confidence interval for segment length: 24 +- 20)
M_0.05= 8.73 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.715 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.285 ( KEDR )
Expected score/letter: -1.569; Average information/letter: 0.356
Minimal length of displayed segments set to: 20
M_0.01= 30.23 (cv= 20.47, lambda= 0.31241, k= 0.21169, x= 9.75;
90% confidence interval for segment length: 38 +- 21)
M_0.05= 25.01 (x= 4.54)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 6 | 9 | 29 | 10 | 11 | 14 | 11 | 12 | 16 | 7 | 8 |
lmin1 7 | 7 | 11 | 35 | 12 | 13 | 17 | 14 | 15 | 19 | 8 | 10 |
lmin2 8 | 9 | 12 | 39 | 13 | 15 | 19 | 16 | 17 | 22 | 9 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 4: 0
- runs >= 4: 0
* runs >= 6: 1, at 39;
0 runs >= 19: 1, at 514;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.554; Average information/letter: 0.873
Minimal length of displayed segments set to: 15
M_0.01= 22.28 (cv= 14.31, lambda= 0.44691, k= 0.35332, x= 7.97;
90% confidence interval for segment length: 16 +- 9)
M_0.05= 18.63 (x= 4.32)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.833; Average information/letter: 0.747
Minimal length of displayed segments set to: 15
M_0.01= 52.65 (cv= 34.81, lambda= 0.18374, k= 0.26684, x= 17.85;
90% confidence interval for segment length: 19 +- 11)
M_0.05= 43.78 (x= 8.97); M_0.30= 33.23 (x= -1.58)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-143-C-49-C-129-C-275-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 5
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 9
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 34 (Expected range: 17-- 52)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 9 (6-10) 11 (11-20) 7 (>=21) 8
3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 29 (Expected range: 7-- 35)
13 +plets (f+: 12.7%), 16 -plets (f-: 15.9%)
Total number of charge altplets: 24 (Critical number: 39)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 7 (6-10) 4 (11-20) 10 (>=21) 9
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
130- 149 5 V.... 4 4 0
520- 535 4 L... 4 4 0
577- 588 3 T.. 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10)
Location Period Element Copies Core Errors
6- 59 9 *..*..... 6 6 /0/././2/./././././
33- 59 3 *0. 9 9 /0/3/./
39- 45 1 * 7 7 0
42- 56 3 -0. 5 5 /0/1/./
345- 392 8 i....0.. 6 6 /0/././././2/././
426- 455 5 *.... 6 6 0
489- 530 7 i0.000. 6 6 /0/1/./2/1/1/./
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:04 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4716.8059.seq)
SWISS-PROT ANNOTATION:
ID ORFI
DE ORFI, 88 bases, CC7DBEB2 checksum.
number of residues: 88; molecular weight: 10.5 kdal
1 VDLLQILDSN WFELEADETK EDYESNFRAS VKTRRALNLW IQHELSKLDK EMSLSRIKEK
61 PDRAELALVV MARREQLLQM QAILEFKD
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 7( 8.0%); C : 0( 0.0%); D : 7( 8.0%); E : 11(12.5%); F : 3( 3.4%)
G : 0( 0.0%); H : 1( 1.1%); I : 4( 4.5%); K : 7( 8.0%); L : 14(15.9%)
M : 3( 3.4%); N : 3( 3.4%); P : 1( 1.1%); Q : 5( 5.7%); R : 7( 8.0%)
S : 6( 6.8%); T : 2( 2.3%); V : 4( 4.5%); W : 2( 2.3%); Y : 1( 1.1%)
KR : 14 ( 15.9%); ED : 18 ( 20.5%); AGP : 8 ( 9.1%);
KRED : 32 ( 36.4%); KR-ED : -4 ( -4.5%); FIKMNY : 21 ( 23.9%);
LVIFM : 28 ( 31.8%); ST : 8 ( 9.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0-00000-00 00-0-0--0+ --0-000+00 0+0++00000 000-00+0-+ -0000+0+-+
61 0-+0-00000 00++-00000 0000-0+-
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 12/30 or 16/45 or 20/60): none
Negative charge clusters (cmin = 14/30 or 20/45 or 24/60): none
Mixed charge clusters (cmin = 21/30 or 29/45 or 36/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.159 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.636 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.205 ( ED )
Expected score/letter: -0.727; Average information/letter: 0.824
Minimal length of displayed segments set to: 20
M_0.01= 11.26 (cv= 6.52, lambda= 0.68640, k= 0.25892, x= 4.73;
90% confidence interval for segment length: 14 +- 13)
M_0.05= 8.88 (x= 2.36)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.205 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.636 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.159 ( KR )
Expected score/letter: -0.545; Average information/letter: 0.437
Minimal length of displayed segments set to: 20
M_0.01= 14.62 (cv= 8.97, lambda= 0.49916, k= 0.16908, x= 5.66;
90% confidence interval for segment length: 24 +- 25)
M_0.05= 11.36 (x= 2.39)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.364 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.636 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.273; Average information/letter: 0.220
Minimal length of displayed segments set to: 20
M_0.01= 12.39 (cv= 8.00, lambda= 0.55962, k= 0.11688, x= 4.38;
90% confidence interval for segment length: 45 +- 47)
M_0.05= 9.47 (x= 1.47)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.636 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.364 ( KEDR )
Expected score/letter: -2.273; Average information/letter: 0.576
Minimal length of displayed segments set to: 20
M_0.01= 18.01 (cv= 10.15, lambda= 0.44127, k= 0.32257, x= 7.86;
90% confidence interval for segment length: 20 +- 9)
M_0.05= 14.31 (x= 4.17)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 6 | 9 | 18 | 8 | 9 | 13 | 9 | 10 | 13 | 5 | 6 |
lmin1 6 | 7 | 11 | 22 | 10 | 11 | 15 | 11 | 12 | 16 | 6 | 8 |
lmin2 7 | 8 | 12 | 24 | 12 | 13 | 17 | 13 | 14 | 17 | 7 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 6: 0
0 runs >= 12: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.932; Average information/letter: 0.957
Minimal length of displayed segments set to: 15
M_0.01= 18.28 (cv= 10.10, lambda= 0.44335, k= 0.37855, x= 8.18;
90% confidence interval for segment length: 12 +- 7)
M_0.05= 14.61 (x= 4.51)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.659; Average information/letter: 0.844
Minimal length of displayed segments set to: 15
M_0.01= 43.16 (cv= 24.60, lambda= 0.18200, k= 0.29450, x= 18.56;
90% confidence interval for segment length: 13 +- 9)
M_0.05= 34.20 (x= 9.60); M_0.30= 23.55 (x= -1.05)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-88-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 5 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 3 (6-10) 0 (11-20) 1 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 4 (Expected range: 0-- 12)
2 +plets (f+: 15.9%), 2 -plets (f-: 20.5%)
Total number of charge altplets: 6 (Critical number: 14)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 0 (11-20) 3 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
15- 26 3 E.. 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:05 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4716.4029.seq)
SWISS-PROT ANNOTATION:
ID ORFJ
DE ORFJ, 244 bases, 65D0A5BD checksum.
number of residues: 244; molecular weight: 26.9 kdal
1 MPNPFDQNDE HDTNVNPYEG KDSKEIMDEL VGEGKKYKSV EDAVKALAYS QHHITTLESE
61 TAKLREAQSQ AKTIDEVLAK LQQKQQPSDD NKGSDENDPA AQKPTDDIDV EATVKRLLEQ
121 HTSTAQAEAN HKQVVDAVKA KYGTKAFDVW DKAEKELGIN LEQMAQTSPA ATLKLLGISG
181 ESAPAQSAAT FKGDAKPTPT DQGERPPEGS KRLVDYMLSK GEINRSQAYN LKLKYSADPE
241 KYRA
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 28(11.5%); C : 0( 0.0%); D+ : 21( 8.6%); E : 22( 9.0%); F- : 3( 1.2%)
G : 12( 4.9%); H : 5( 2.0%); I- : 7( 2.9%); K++: 27(11.1%); L : 17( 7.0%)
M : 4( 1.6%); N : 10( 4.1%); P : 13( 5.3%); Q : 17( 7.0%); R- : 6( 2.5%)
S : 16( 6.6%); T : 15( 6.1%); V : 12( 4.9%); W : 1( 0.4%); Y : 8( 3.3%)
KR : 33 ( 13.5%); ED ++: 43 ( 17.6%); AGP : 53 ( 21.7%);
KRED + : 76 ( 31.1%); KR-ED : -10 ( -4.1%); FIKMNY : 59 ( 24.2%);
LVIFM --: 43 ( 17.6%); ST : 31 ( 12.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00000-00-- 0-000000-0 +-0+-00--0 00-0++0+00 --00+00000 0000000-0-
61 00+0+-0000 0+00--000+ 000+0000-- 0+00--0-00 00+00--0-0 -000++00-0
121 0000000-00 0+000-00+0 +000+00-00 -+0-+-0000 0-00000000 000+000000
181 -000000000 0+0-0+0000 -00-+00-00 ++00-0000+ 0-00+00000 0+0+000-0-
241 +0+0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 15/45 or 18/60): none
Negative charge clusters (cmin = 13/30 or 18/45 or 22/60): none
Mixed charge clusters (cmin = 19/30 or 26/45 or 33/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.135 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.689 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.176 ( ED )
Expected score/letter: -0.770; Average information/letter: 1.030
Minimal length of displayed segments set to: 20
M_0.01= 11.31 (cv= 7.01, lambda= 0.78413, k= 0.29347, x= 4.30;
90% confidence interval for segment length: 12 +- 11)
M_0.05= 9.23 (x= 2.22)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.176 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.689 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.135 ( KR )
Expected score/letter: -0.607; Average information/letter: 0.597
Minimal length of displayed segments set to: 20
M_0.01= 14.34 (cv= 9.24, lambda= 0.59473, k= 0.20871, x= 5.10;
90% confidence interval for segment length: 21 +- 20)
M_0.05= 11.60 (x= 2.36)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.311 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.689 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.377; Average information/letter: 0.431
Minimal length of displayed segments set to: 20
M_0.01= 10.74 (cv= 6.93, lambda= 0.79323, k= 0.20648, x= 3.81;
90% confidence interval for segment length: 28 +- 26)
M_0.05= 8.69 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.689 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.311 ( KEDR )
Expected score/letter: -1.803; Average information/letter: 0.428
Minimal length of displayed segments set to: 20
M_0.01= 24.55 (cv= 15.49, lambda= 0.35481, k= 0.24970, x= 9.05;
90% confidence interval for segment length: 29 +- 15)
M_0.05= 19.95 (x= 4.46)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 6 | 8 | 24 | 9 | 10 | 13 | 10 | 11 | 15 | 6 | 8 |
lmin1 6 | 7 | 10 | 29 | 11 | 12 | 16 | 13 | 14 | 18 | 8 | 10 |
lmin2 7 | 8 | 12 | 32 | 13 | 14 | 18 | 14 | 16 | 20 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 6: 0
0 runs >= 16: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -3.037; Average information/letter: 1.208
Minimal length of displayed segments set to: 15
M_0.01= 15.38 (cv= 9.13, lambda= 0.60190, k= 0.43255, x= 6.25;
90% confidence interval for segment length: 11 +- 5)
M_0.05= 12.68 (x= 3.54)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.881; Average information/letter: 1.096
Minimal length of displayed segments set to: 15
M_0.01= 35.36 (cv= 21.69, lambda= 0.25347, k= 0.32172, x= 13.67;
90% confidence interval for segment length: 12 +- 7)
M_0.05= 28.93 (x= 7.24); M_0.30= 21.28 (x= -0.41)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-244-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 42- 46] DAVKA
[ 136- 140] DAVKA
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 14 (Expected range: 4-- 28)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 5 (6-10) 2 (11-20) 4 (>=21) 4
3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 10 (Expected range: 0-- 20)
3 +plets (f+: 13.5%), 7 -plets (f-: 17.6%)
Total number of charge altplets: 7 (Critical number: 23)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 2 (6-10) 4 (11-20) 2 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
194- 228 7 -00..00 5 5 /0/0/1/././1/0/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
159- 178 (3.) I( 19)I 8 of 8 0.0011 large minimal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:07 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4816.4029.seq)
SWISS-PROT ANNOTATION:
ID ORFK
DE ORFK, 322 bases, C287FDE1 checksum.
number of residues: 322; molecular weight: 35.3 kdal
1 MSTGNNTSNT QALIVSEIWA DEIEDILHEK LLDVNIARVV DFPDGDKLTI PSVGTPVVRS
61 RPEQGDFTFD NLDTGEISII LRDEVYAGNA ISKKLRQDSR WISNVGAMLP AEQARAIMER
121 YQTDLLALGN AQFAGQNDPN VINGVPHRFV GTGTDQTMDV TDFSRVNYVM TQSKMPMGGM
181 IGIIDPSVAH HLETITNISN ISNNPRWEGI VESGIAPDMQ FVRSVYGIDL FVSNLLADAN
241 ETINAGGDAR STTAGKCNMF MNVSDMGLLP FVVAWKEMPT TKSFIDDYND DLNTATTARW
301 GNGLVRDENL VCVLANADKV TF
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 24( 7.5%); C : 2( 0.6%); D+ : 28( 8.7%); E : 15( 4.7%); F : 12( 3.7%)
G : 24( 7.5%); H : 4( 1.2%); I : 24( 7.5%); K : 9( 2.8%); L : 22( 6.8%)
M : 13( 4.0%); N+ : 26( 8.1%); P : 13( 4.0%); Q : 10( 3.1%); R : 15( 4.7%)
S : 20( 6.2%); T : 24( 7.5%); V : 27( 8.4%); W : 5( 1.6%); Y : 5( 1.6%)
KR : 24 ( 7.5%); ED : 43 ( 13.4%); AGP : 61 ( 18.9%);
KRED : 67 ( 20.8%); KR-ED - : -19 ( -5.9%); FIKMNY : 89 ( 27.6%);
LVIFM : 98 ( 30.4%); ST : 44 ( 13.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 000000-000 --0--000-+ 00-0000+00 -00-0-+000 00000000+0
61 +0-00-000- 00-00-0000 0+--000000 00++0+0-0+ 0000000000 0-00+000-+
121 000-000000 0000000-00 0000000+00 0000-000-0 0-00+00000 000+000000
181 0000-00000 00-0000000 00000+0-00 0-00000-00 00+00000-0 0000000-00
241 -000000-0+ 00000+0000 0000-00000 00000+-000 0+000--00- -0000000+0
301 00000+--00 0000000-+0 00
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 7/30 or 10/45 or 12/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 18/60): none
Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.075 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.792 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.134 ( ED )
Expected score/letter: -0.910; Average information/letter: 1.961
Minimal length of displayed segments set to: 20
M_0.01= 8.26 (cv= 5.05, lambda= 1.14369, k= 0.39717, x= 3.21;
90% confidence interval for segment length: 7 +- 6)
M_0.05= 6.84 (x= 1.79)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.134 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.792 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.075 ( KR )
Expected score/letter: -0.674; Average information/letter: 0.912
Minimal length of displayed segments set to: 20
M_0.01= 11.87 (cv= 7.56, lambda= 0.76336, k= 0.26890, x= 4.31;
90% confidence interval for segment length: 14 +- 13)
M_0.05= 9.74 (x= 2.17)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.208 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.792 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.584; Average information/letter: 1.126
Minimal length of displayed segments set to: 20
M_0.01= 7.13 (cv= 4.32, lambda= 1.33657, k= 0.43045, x= 2.81;
90% confidence interval for segment length: 12 +- 10)
M_0.05= 5.91 (x= 1.59)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.792 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.208 ( KEDR )
Expected score/letter: -0.873; Average information/letter: 0.152
Minimal length of displayed segments set to: 20
M_0.01= 43.53 (cv= 31.28, lambda= 0.18461, k= 0.09648, x= 12.25;
90% confidence interval for segment length: 76 +- 57)
M_0.05= 34.70 (x= 3.42)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 38 | 8 | 10 | 12 | 10 | 12 | 14 | 6 | 7 |
lmin1 5 | 7 | 8 | 46 | 10 | 12 | 15 | 12 | 15 | 17 | 7 | 9 |
lmin2 6 | 8 | 9 | 51 | 11 | 14 | 16 | 14 | 16 | 20 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 25: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.969; Average information/letter: 0.631
Minimal length of displayed segments set to: 15
M_0.01= 24.61 (cv= 15.61, lambda= 0.36984, k= 0.27960, x= 8.99;
90% confidence interval for segment length: 21 +- 13)
M_0.05= 20.20 (x= 4.59)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.534; Average information/letter: 0.485
Minimal length of displayed segments set to: 15
M_0.01= 60.14 (cv= 39.63, lambda= 0.14569, k= 0.19929, x= 20.50;
90% confidence interval for segment length: 26 +- 19)
M_0.05= 48.95 (x= 9.32); M_0.30= 35.64 (x= -4.00)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-256-C-54-C-10-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 20 (Expected range: 6-- 32)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 7 (6-10) 4 (11-20) 4 (>=21) 6
3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 7 (Expected range: 0-- 15)
1 +plets (f+: 7.5%), 6 -plets (f-: 13.4%)
Total number of charge altplets: 7 (Critical number: 16)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 2 (6-10) 1 (11-20) 2 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
17- 36 4 -.00 5 5 /0/./0/1/
18- 65 8 i00.*... 6 6 /0/2/2/./2/./././
109- 156 8 i.0.0000 6 6 /0/./1/./1/1/2/2/
183- 236 9 i...0.000 6 6 /0/./././2/./2/2/2/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
86- 121 (2.) Y( 35)Y 6 of 6 0.0051 large minimal spacing
288- 323 (4.) Y( 35)Y 5 of 6 0.0051 matching minimum
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:08 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4815.3752.seq)
SWISS-PROT ANNOTATION:
ID ORFL
DE ORFL, 171 bases, CC9E95E8 checksum.
number of residues: 171; molecular weight: 17.9 kdal
1 MSRESITAGS TAGGTRRAAT HYGRRVEEGA NISVYHVDGN VFRQETTFKF DQLPTATLDQ
61 LHQAIPAGSR VLSATLKTHV DFTATTAVSL NIGLQERDGT EVDNDGLFAA LALPSANDFD
121 DGAGALVGAG TGLTVDAVVV VVPNVDDLLT GEATLVVEYE KADDRQQAMN G
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 22(12.9%); C : 0( 0.0%); D : 15( 8.8%); E : 9( 5.3%); F : 6( 3.5%)
G : 17( 9.9%); H : 4( 2.3%); I : 4( 2.3%); K : 3( 1.8%); L : 15( 8.8%)
M : 2( 1.2%); N : 7( 4.1%); P : 4( 2.3%); Q : 7( 4.1%); R : 9( 5.3%)
S : 8( 4.7%); T : 18(10.5%); V : 18(10.5%); W : 0( 0.0%); Y : 3( 1.8%)
KR : 12 ( 7.0%); ED : 24 ( 14.0%); AGP : 43 ( 25.1%);
KRED : 36 ( 21.1%); KR-ED : -12 ( -7.0%); FIKMNY : 25 ( 14.6%);
LVIFM : 45 ( 26.3%); ST : 26 ( 15.2%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00+-000000 00000++000 000++0--00 0000000-00 00+0-000+0 -0000000-0
61 000000000+ 000000+000 -000000000 00000-+-00 -0-0-00000 0000000-0-
121 -000000000 00000-0000 00000--000 0-00000-0- +0--+00000 0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 7/30 or 10/45 or 12/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 19/60): none
Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.070 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.789 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.140 ( ED )
Expected score/letter: -0.930; Average information/letter: 2.078
Minimal length of displayed segments set to: 20
M_0.01= 7.49 (cv= 4.35, lambda= 1.18077, k= 0.40659, x= 3.13;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 6.11 (x= 1.75)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.140 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.789 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.070 ( KR )
Expected score/letter: -0.649; Average information/letter: 0.832
Minimal length of displayed segments set to: 20
M_0.01= 11.49 (cv= 7.06, lambda= 0.72862, k= 0.25453, x= 4.44;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 9.26 (x= 2.20)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.211 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.789 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.579; Average information/letter: 1.104
Minimal length of displayed segments set to: 20
M_0.01= 6.72 (cv= 3.89, lambda= 1.32176, k= 0.42456, x= 2.83;
90% confidence interval for segment length: 12 +- 9)
M_0.05= 5.49 (x= 1.60)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.789 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.211 ( KEDR )
Expected score/letter: -0.895; Average information/letter: 0.158
Minimal length of displayed segments set to: 20
M_0.01= 39.41 (cv= 27.24, lambda= 0.18877, k= 0.10004, x= 12.17;
90% confidence interval for segment length: 68 +- 53)
M_0.05= 30.78 (x= 3.54)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 35 | 8 | 10 | 11 | 9 | 11 | 13 | 6 | 8 |
lmin1 5 | 6 | 8 | 42 | 9 | 12 | 14 | 11 | 14 | 16 | 8 | 9 |
lmin2 6 | 7 | 9 | 47 | 10 | 13 | 15 | 12 | 16 | 18 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 23: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.942; Average information/letter: 0.617
Minimal length of displayed segments set to: 15
M_0.01= 22.53 (cv= 13.69, lambda= 0.37566, k= 0.27810, x= 8.84;
90% confidence interval for segment length: 20 +- 13)
M_0.05= 18.19 (x= 4.50)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.240; Average information/letter: 0.413
Minimal length of displayed segments set to: 15
M_0.01= 59.67 (cv= 38.24, lambda= 0.13445, k= 0.17917, x= 21.43;
90% confidence interval for segment length: 28 +- 21)
M_0.05= 47.54 (x= 9.30); M_0.30= 33.12 (x= -5.12)
1) From 122 to 142 (type 1): length= 21, score=42.00
(pocket at 136 to 136: length= 1, score=-17.00)
122 GAGALVGAGT GLTV |D| AVVVV V
A: 4(19.0%); G: 5(23.8%); V: 7(33.3%);
# of segments (>=15 residues) exceeding M_0.30: 1
2. SPACINGS OF C.
H2N-171-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 15 (Expected range: 2-- 23)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 8 (6-10) 3 (11-20) 3 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 14/30 or 18/45 or 22/60): none
4. Long amino acid multiplets (>= 5; Letter/Length/Position):
V/5/138
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 6 (Expected range: 0-- 10)
2 +plets (f+: 7.0%), 4 -plets (f-: 14.0%)
Total number of charge altplets: 4 (Critical number: 10)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 2 (11-20) 2 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
138- 142 1 V 5 5 ! 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:10 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4815.4454.seq)
SWISS-PROT ANNOTATION:
ID ORFM
DE ORFM, 327 bases, 5948A057 checksum.
number of residues: 327; molecular weight: 35.1 kdal
1 MTDHVNLLDS ELHEPKGMQV LTGGASDVGK VVVSKGDGTT ETRKLDITEI EGAESILLSG
61 AYFDDNMATA TVDVATELKE SYLIGAAPTQ VIQLQLPDPA VYAGAPITLK RLDANHGDGS
121 EVKFIPNASE TIEGASELAI TLQNTSIVVV SDGTDWHIRA DIYPVKFKHG WFNYNDLATQ
181 STPINHTGGV DTKLTNDALG PATLVPYPPE GVTKVWDATT NQFDFTELVN GDTVDIRLDL
241 EITTTSANQE ITVYILLGVG GTTINVPIIG RIVKTAGANR IVQFSSVFMG GDNTRLNPAE
301 LYISSPSNAT VKVNGWYNRI IGVNPTY
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 25( 7.6%); C : 0( 0.0%); D : 23( 7.0%); E : 17( 5.2%); F : 8( 2.4%)
G : 29( 8.9%); H : 6( 1.8%); I : 25( 7.6%); K : 13( 4.0%); L : 26( 8.0%)
M : 4( 1.2%); N : 20( 6.1%); P : 16( 4.9%); Q : 9( 2.8%); R- : 8( 2.4%)
S : 18( 5.5%); T++: 36(11.0%); V : 30( 9.2%); W : 4( 1.2%); Y : 10( 3.1%)
KR : 21 ( 6.4%); ED : 40 ( 12.2%); AGP : 70 ( 21.4%);
KRED : 61 ( 18.7%); KR-ED - : -19 ( -5.8%); FIKMNY : 80 ( 24.5%);
LVIFM : 93 ( 28.4%); ST + : 54 ( 16.5%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00-00000-0 -00-0+0000 000000-00+ 0000+0-000 -0++0-00-0 -00-000000
61 000--00000 00-000-0+- 0000000000 0000000-00 000000000+ +0-0000-00
121 -0+000000- 00-000-000 0000000000 0-00-000+0 -0000+0+00 00000-0000
181 0000000000 -0+000-000 000000000- 000+00-000 000-00-000 0-00-0+0-0
241 -00000000- 0000000000 0000000000 +00+00000+ 0000000000 0-00+0000-
301 0000000000 0+000000+0 0000000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 7/30 or 9/45 or 11/60): none
Negative charge clusters (cmin = 10/30 or 14/45 or 17/60): none
Mixed charge clusters (cmin = 14/30 or 18/45 or 23/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.064 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.122 ( ED )
Expected score/letter: -0.930; Average information/letter: 2.209
Minimal length of displayed segments set to: 20
M_0.01= 7.73 (cv= 4.71, lambda= 1.23008, k= 0.41471, x= 3.02;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 6.41 (x= 1.70)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.122 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.064 ( KR )
Expected score/letter: -0.697; Average information/letter: 1.032
Minimal length of displayed segments set to: 20
M_0.01= 11.17 (cv= 7.07, lambda= 0.81848, k= 0.28798, x= 4.10;
90% confidence interval for segment length: 13 +- 12)
M_0.05= 9.18 (x= 2.11)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.187 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.627; Average information/letter: 1.332
Minimal length of displayed segments set to: 20
M_0.01= 6.56 (cv= 3.93, lambda= 1.47262, k= 0.48315, x= 2.63;
90% confidence interval for segment length: 10 +- 8)
M_0.05= 5.45 (x= 1.52)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.813 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.187 ( KEDR )
Expected score/letter: -0.679; Average information/letter: 0.103
Minimal length of displayed segments set to: 20
M_0.01= 52.09 (cv= 39.29, lambda= 0.14737, k= 0.06626, x= 12.80;
90% confidence interval for segment length: 108 +- 88)
M_0.05= 41.03 (x= 1.74)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 42 | 8 | 10 | 12 | 9 | 12 | 14 | 6 | 8 |
lmin1 5 | 6 | 8 | 51 | 10 | 12 | 14 | 12 | 14 | 17 | 8 | 10 |
lmin2 6 | 7 | 9 | 57 | 11 | 13 | 16 | 13 | 16 | 19 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 28: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.865; Average information/letter: 0.610
Minimal length of displayed segments set to: 15
M_0.01= 24.37 (cv= 15.53, lambda= 0.37293, k= 0.27205, x= 8.84;
90% confidence interval for segment length: 22 +- 14)
M_0.05= 20.00 (x= 4.47)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.092; Average information/letter: 0.407
Minimal length of displayed segments set to: 15
M_0.01= 64.25 (cv= 43.05, lambda= 0.13448, k= 0.17394, x= 21.20;
90% confidence interval for segment length: 31 +- 23)
M_0.05= 52.13 (x= 9.08); M_0.30= 37.71 (x= -5.34)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-327-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 31- 39] VVVSKGDGT
[ 148- 154] VVVS__DGT
______________________________
[ 50- 53] IEGA
[ 132- 135] IEGA
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 20 (Expected range: 7-- 34)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 4 (6-10) 6 (11-20) 7 (>=21) 4
3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 3 (Expected range: 0-- 13)
2 +plets (f+: 6.4%), 1 -plets (f-: 12.2%)
Total number of charge altplets: 1 (Critical number: 14)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 1 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
179- 210 8 T....... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:11 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.6375.9592.seq)
SWISS-PROT ANNOTATION:
ID ORFN
DE ORFN, 911 bases, 9272424C checksum.
number of residues: 911; molecular weight: 100.2 kdal
1 MAARKGAVNR FTPVRGWVTE GNLANYGQDV ALDVENMDIE KTGLTQRRFG LFAETSSEQF
61 LSTFTATARA RGLLAVKEWR EAWGDKDVNM LIFHAGYKVH VVQDTAPLRD ANILLTIDLL
121 EAGIKLDGVI DSPVHISVGV GFAIITNPRI EPVLIKLDDV DDEGVPTLSY EPLTLLIRTR
181 ELLTPYTTGT NYGDTLTPEE EWNLYNSGWA TITRATKDKS GSGTVYVNPV QYYFDKRGVY
241 PSHSVLYNSM KQESAKEIVA LNVFSPWADE KINFGTTTPP LGRYIHSAYY FDSAAILSLG
301 IGNLTPPTSD GTTEGSGPAE EEISNPIGLD NIGTVNNLKL IAEGTVRWTV KDRPRCSGYH
361 NGHVYFGDRD KNGKTRILVS QLVNSLDNIP KCFQDADPTA EEINDLIATD GFTMYPVGMG
421 APITMVEFNK RLLLLCTNGV WAIRGTSGGG ATATDFTLDK VASVEFNSPQ SVVDIGTAIV
481 FWSERGIIAI GVNDFGDLTS NNLTENTIDE YYDSLDRDII KNVKGTFIND ENRVYWVVPN
541 KQDSNGEYKT DGELVLVLNL DTGGFYKHTV SGGPLLHAPF RRLVNTRAEV SIPITETDGT
601 VITDTLGDPV TVTRTVTTTG VDGLAYFASF DDGVNGQFNF IAEHQPWGFA DWANVPNMTR
661 VNYSSYVDFA YEYPEVMIGN ISLPYIHSYY LTGIRVQTEQ YTTETAHLSF HRVQAHQTTA
721 LGTVTFHKVD MMVSTGMQVI SFHKDDLLRT EAVTLVNPDA ETGDATGWTV TAGTLDVRTA
781 APLYQGSYYF WSDSNANFAA YQDIDPVGGG YITAGELANN VIEAKLSWAA RGNTDLGTVY
841 IECLDAIGTV LASTDTTRFS GHDTWTRYGD AVVLPTDTDT IRVWIVGTLV ATNDVNYYVD
901 DIQLNLEVHN V
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 65( 7.1%); C : 4( 0.4%); D : 69( 7.6%); E : 46( 5.0%); F : 34( 3.7%)
G : 78( 8.6%); H : 18( 2.0%); I : 54( 5.9%); K : 28( 3.1%); L : 72( 7.9%)
M : 12( 1.3%); N : 54( 5.9%); P : 37( 4.1%); Q : 20( 2.2%); R : 38( 4.2%)
S : 47( 5.2%); T++: 97(10.6%); V : 81( 8.9%); W : 17( 1.9%); Y : 40( 4.4%)
KR : 66 ( 7.2%); ED : 115 ( 12.6%); AGP : 180 ( 19.8%);
KRED : 181 ( 19.9%); KR-ED - : -49 ( -5.4%); FIKMNY : 222 ( 24.4%);
LVIFM : 253 ( 27.8%); ST + : 144 ( 15.8%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000++0000+ 0000+0000- 00000000-0 00-0-00-0- +00000++00 000-000-00
61 00000000+0 +00000+-0+ -000-+-000 0000000+00 000-0000+- 0000000-00
121 -000+0-000 -000000000 00000000+0 -0000+0--0 ---0000000 -000000+0+
181 -000000000 000-0000-- -000000000 000+00+-+0 0000000000 0000-++000
241 0000000000 +0-00+-000 00000000-- +000000000 00+0000000 0-00000000
301 000000000- 000-00000- --0000000- 00000000+0 00-000+000 +-+0+00000
361 0000000-+- +00+0+0000 000000-000 +000-0-000 --00-0000- 0000000000
421 000000-00+ +000000000 000+000000 0000-000-+ 0000-00000 000-000000
481 000-+00000 000-00-000 0000-000-- 00-00-+-00 +00+00000- -0+0000000
541 +0-000-0+0 -0-0000000 -00000+000 0000000000 ++0000+0-0 00000-0-00
601 000-000-00 000+000000 0-00000000 --00000000 00-0000000 -00000000+
661 0000000-00 0-00-00000 0000000000 0000+000-0 000-000000 0+00000000
721 0000000+0- 0000000000 000+--00+0 -0000000-0 -00-000000 00000-0+00
781 0000000000 00-0000000 00-0-00000 00000-0000 00-0+00000 +000-00000
841 0-00-00000 0000-00+00 00-000+00- 000000-0-0 0+00000000 000-00000-
901 -00000-000 0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 8/30 or 11/45 or 13/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 19/60): none
Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.072 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.801 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.126 ( ED )
Expected score/letter: -0.909; Average information/letter: 2.001
Minimal length of displayed segments set to: 20
M_0.01= 9.06 (cv= 5.88, lambda= 1.15932, k= 0.39980, x= 3.18;
90% confidence interval for segment length: 8 +- 6)
M_0.05= 7.65 (x= 1.77)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.126 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.801 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.072 ( KR )
Expected score/letter: -0.694; Average information/letter: 0.994
Minimal length of displayed segments set to: 20
M_0.01= 12.69 (cv= 8.52, lambda= 0.80008, k= 0.28253, x= 4.17;
90% confidence interval for segment length: 15 +- 13)
M_0.05= 10.65 (x= 2.13)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.199 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.801 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.603; Average information/letter: 1.212
Minimal length of displayed segments set to: 20
M_0.01= 7.62 (cv= 4.89, lambda= 1.39455, k= 0.45321, x= 2.73;
90% confidence interval for segment length: 13 +- 9)
M_0.05= 6.45 (x= 1.56)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.801 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.199 ( KEDR )
Expected score/letter: -0.788; Average information/letter: 0.130
Minimal length of displayed segments set to: 20
M_0.01= 52.97 (cv= 40.44, lambda= 0.16852, k= 0.08306, x= 12.53;
90% confidence interval for segment length: 99 +- 72)
M_0.05= 43.30 (x= 2.86)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 7 | 44 | 9 | 11 | 13 | 11 | 13 | 16 | 7 | 9 |
lmin1 6 | 7 | 9 | 53 | 11 | 13 | 16 | 13 | 16 | 19 | 8 | 11 |
lmin2 7 | 8 | 10 | 59 | 12 | 15 | 18 | 15 | 18 | 21 | 10 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 5: 0
0 runs >= 29: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.888; Average information/letter: 0.604
Minimal length of displayed segments set to: 15
M_0.01= 27.32 (cv= 18.41, lambda= 0.37014, k= 0.27181, x= 8.91;
90% confidence interval for segment length: 24 +- 15)
M_0.05= 22.92 (x= 4.51)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.239; Average information/letter: 0.436
Minimal length of displayed segments set to: 15
M_0.01= 69.13 (cv= 48.50, lambda= 0.14050, k= 0.18236, x= 20.63;
90% confidence interval for segment length: 32 +- 22)
M_0.05= 57.53 (x= 9.03); M_0.30= 43.73 (x= -4.77)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-355-C-35-C-43-C-406-C-68-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 5
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 9
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 55 (Expected range: 33-- 77)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 11 (6-10) 12 (11-20) 18 (>=21) 15
3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 16 (Expected range: 4-- 30)
5 +plets (f+: 7.2%), 11 -plets (f-: 12.6%)
Total number of charge altplets: 16 (Critical number: 31)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 2 (6-10) 0 (11-20) 2 (>=21) 13
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
108- 131 6 L..... 4 4 0
226- 253 7 Y...... 4 4 0
432- 435 1 L 4 4 0
554- 561 2 L. 4 4 0
611- 620 2 T. 5 5 ! 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
134- 145 2 i0 6 6 /0/1/
140- 169 5 i.... 6 6 0
555- 584 5 i..00 6 6 /0/././2/1/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:13 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4819.2365.seq)
SWISS-PROT ANNOTATION:
ID ORFO
DE ORFO, 359 bases, EC7412AA checksum.
number of residues: 359; molecular weight: 38.7 kdal
1 MAILDFNTGI QMGLVGDTAA VDGKGTTDAS KIGHRVLEGF LIDRVNHRDE VAHLTWTSRN
61 EVWVSWYMWQ EEYDLNAGRD GRAVIAFRGI DASGSEVYWG AIDGQAGASD RTASFYLYGA
121 GESPSDTSSG TSIIQNIDLM PYDGVRSRVD VHVKLDNVNG EVDVYINQNL VGSFVGDTIL
181 NPDLTTVSNC HFGMLGEYIS GLVSKNNDCM FSAAFAADES TVPITMVQSE VNANGTLQQM
241 TGDYTDVSLL GDWDDATKVT ADAAGQTSTF GKSALPAQYQ TGYELIAVGV NSRTAVALDY
301 TIASLAHVLD NGTTVKEGTE IPLDDLLQYR KTIFHTDADD NPWTATAFDA TQIGLKAKL
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 35( 9.7%); C : 2( 0.6%); D++: 35( 9.7%); E : 14( 3.9%); F : 11( 3.1%)
G : 34( 9.5%); H : 7( 1.9%); I : 19( 5.3%); K : 10( 2.8%); L : 27( 7.5%)
M : 8( 2.2%); N : 18( 5.0%); P- : 7( 1.9%); Q : 13( 3.6%); R : 12( 3.3%)
S : 25( 7.0%); T+ : 32( 8.9%); V : 30( 8.4%); W : 7( 1.9%); Y : 13( 3.6%)
KR : 22 ( 6.1%); ED : 49 ( 13.6%); AGP : 76 ( 21.2%);
KRED : 71 ( 19.8%); KR-ED --: -27 ( -7.5%); FIKMNY : 79 ( 22.0%);
LVIFM : 95 ( 26.5%); ST + : 57 ( 15.9%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000-00000 000000-000 0-0+000-00 +000+00-00 00-+000+-- 00000000+0
61 -000000000 --0-0000+- 0+00000+00 -0000-0000 00-000000- +000000000
121 0-000-0000 0000000-00 00-00+0+0- 000+0-0000 -0-0000000 000000-000
181 00-0000000 000000-000 0000+00-00 0000000--0 000000000- 0000000000
241 00-00-0000 0-0--00+00 0-00000000 0+00000000 000-000000 00+00000-0
301 000000000- 00000+-00- 000--0000+ +00000-0-- 00000000-0 00000+0+0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 7/30 or 9/45 or 11/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 18/60): none
Mixed charge clusters (cmin = 14/30 or 19/45 or 24/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.061 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.802 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.136 ( ED )
Expected score/letter: -0.953; Average information/letter: 2.314
Minimal length of displayed segments set to: 20
M_0.01= 7.64 (cv= 4.67, lambda= 1.25986, k= 0.42251, x= 2.97;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 6.34 (x= 1.67)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.136 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.802 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.061 ( KR )
Expected score/letter: -0.652; Average information/letter: 0.861
Minimal length of displayed segments set to: 20
M_0.01= 12.27 (cv= 7.90, lambda= 0.74465, k= 0.25940, x= 4.37;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 10.08 (x= 2.18)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.198 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.802 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.604; Average information/letter: 1.221
Minimal length of displayed segments set to: 20
M_0.01= 6.93 (cv= 4.20, lambda= 1.40028, k= 0.45544, x= 2.72;
90% confidence interval for segment length: 11 +- 9)
M_0.05= 5.76 (x= 1.56)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.802 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.198 ( KEDR )
Expected score/letter: -0.780; Average information/letter: 0.128
Minimal length of displayed segments set to: 20
M_0.01= 47.80 (cv= 35.24, lambda= 0.16694, k= 0.08177, x= 12.56;
90% confidence interval for segment length: 90 +- 70)
M_0.05= 38.04 (x= 2.79)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 40 | 8 | 10 | 12 | 9 | 12 | 14 | 6 | 8 |
lmin1 5 | 7 | 8 | 49 | 10 | 12 | 14 | 12 | 15 | 17 | 8 | 10 |
lmin2 6 | 8 | 9 | 54 | 11 | 14 | 16 | 13 | 17 | 20 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 27: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.858; Average information/letter: 0.584
Minimal length of displayed segments set to: 15
M_0.01= 25.15 (cv= 16.14, lambda= 0.36442, k= 0.26713, x= 9.00;
90% confidence interval for segment length: 23 +- 15)
M_0.05= 20.67 (x= 4.53)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.175; Average information/letter: 0.419
Minimal length of displayed segments set to: 15
M_0.01= 62.64 (cv= 42.07, lambda= 0.13985, k= 0.17844, x= 20.57;
90% confidence interval for segment length: 30 +- 22)
M_0.05= 50.98 (x= 8.91); M_0.30= 37.12 (x= -4.95)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-189-C-18-C-150-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 15- 18] VGDT
[ 175- 178] VGDT
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 17 (Expected range: 8-- 36)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 4 (6-10) 5 (11-20) 4 (>=21) 5
3. Clusters of amino acid multiplets (cmin = 9/30 or 12/45 or 14/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 7 (Expected range: 0-- 15)
1 +plets (f+: 6.1%), 6 -plets (f-: 13.6%)
Total number of charge altplets: 5 (Critical number: 15)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 1 (11-20) 1 (>=21) 5
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
171- 218 8 i000..0. 6 6 /0/1/2/1/././1/./
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
23- 25 (1.) G( 2)G 35 of 35 0.0006 large minimal spacing
62- 64 (1.) V( 2)V 31 of 31 0.0034 large minimal spacing
119- 121 (2.) G( 2)G 34 of 35 0.0006 matching minimum
120- 213 (2.) A( 93)A 1 of 36 0.0005 large maximal spacing
149- 151 (2.) V( 2)V 27 of 31 0.0034 matching minimum
151- 153 (2.) V( 2)V 28 of 31 0.0034 matching minimum
162- 164 (2.) V( 2)V 29 of 31 0.0034 matching minimum
288- 290 (4.) V( 2)V 30 of 31 0.0034 matching minimum
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:14 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4814.3475.seq)
SWISS-PROT ANNOTATION:
ID ORFP
DE ORFP, 98 bases, 5019B0EA checksum.
number of residues: 98; molecular weight: 11.0 kdal
1 MIPSDNHSSL MTASFDWSID SRTRKVTTQQ EAYKYDSRRL AQGGAESAYP YEVVDTKLRI
61 RGQGRAVRIR FQSNGGKDLR LLGYSILGVD FGDSEGNS
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 6( 6.1%); C : 0( 0.0%); D : 8( 8.2%); E : 4( 4.1%); F : 3( 3.1%)
G : 10(10.2%); H : 1( 1.0%); I : 5( 5.1%); K : 4( 4.1%); L : 7( 7.1%)
M : 2( 2.0%); N : 3( 3.1%); P : 2( 2.0%); Q : 5( 5.1%); R : 10(10.2%)
S : 12(12.2%); T : 5( 5.1%); V : 5( 5.1%); W : 1( 1.0%); Y : 5( 5.1%)
KR : 14 ( 14.3%); ED : 12 ( 12.2%); AGP : 18 ( 18.4%);
KRED : 26 ( 26.5%); KR-ED : 2 ( 2.0%); FIKMNY : 22 ( 22.4%);
LVIFM : 22 ( 22.4%); ST : 17 ( 17.3%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000-00000 00000-000- 0+0++00000 -00+0-0++0 00000-0000 0-00-0+0+0
61 +000+00+0+ 000000+-0+ 000000000- 00-0-000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 15/45 or 19/60): none
Negative charge clusters (cmin = 10/30 or 14/45 or 17/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.143 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.122 ( ED )
Expected score/letter: -0.694; Average information/letter: 0.875
Minimal length of displayed segments set to: 20
M_0.01= 10.72 (cv= 6.26, lambda= 0.73288, k= 0.26481, x= 4.46;
90% confidence interval for segment length: 13 +- 13)
M_0.05= 8.50 (x= 2.24)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.122 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.143 ( KR )
Expected score/letter: -0.776; Average information/letter: 1.139
Minimal length of displayed segments set to: 20
M_0.01= 9.54 (cv= 5.46, lambda= 0.83914, k= 0.30840, x= 4.08;
90% confidence interval for segment length: 10 +- 10)
M_0.05= 7.60 (x= 2.14)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.265 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.469; Average information/letter: 0.690
Minimal length of displayed segments set to: 20
M_0.01= 7.84 (cv= 4.50, lambda= 1.01857, k= 0.29989, x= 3.33;
90% confidence interval for segment length: 17 +- 15)
M_0.05= 6.24 (x= 1.73)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.265 ( KEDR )
Expected score/letter: -1.388; Average information/letter: 0.300
Minimal length of displayed segments set to: 20
M_0.01= 26.76 (cv= 16.40, lambda= 0.27954, k= 0.18165, x= 10.35;
90% confidence interval for segment length: 36 +- 24)
M_0.05= 20.93 (x= 4.52)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 4 | 7 | 25 | 9 | 8 | 12 | 10 | 9 | 13 | 5 | 6 |
lmin1 6 | 6 | 8 | 31 | 11 | 10 | 14 | 12 | 12 | 16 | 6 | 8 |
lmin2 7 | 7 | 10 | 34 | 12 | 11 | 16 | 14 | 13 | 18 | 7 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 17: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.531; Average information/letter: 0.899
Minimal length of displayed segments set to: 15
M_0.01= 17.14 (cv= 9.61, lambda= 0.47701, k= 0.36371, x= 7.52;
90% confidence interval for segment length: 13 +- 8)
M_0.05= 13.72 (x= 4.11)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.408; Average information/letter: 0.696
Minimal length of displayed segments set to: 15
M_0.01= 40.87 (cv= 24.04, lambda= 0.19072, k= 0.24920, x= 16.83;
90% confidence interval for segment length: 16 +- 11)
M_0.05= 32.33 (x= 8.29); M_0.30= 22.16 (x= -1.88)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-98-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 8 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 3 (6-10) 3 (11-20) 2 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 13/30 or 17/45 or 20/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 2 (Expected range: 0-- 9)
2 +plets (f+: 14.3%), 0 -plets (f-: 12.2%)
Total number of charge altplets: 1 (Critical number: 10)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 1 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:16 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.2920.seq)
SWISS-PROT ANNOTATION:
ID ORFQ
DE ORFQ, 250 bases, A5C6E0EF checksum.
number of residues: 250; molecular weight: 28.9 kdal
1 MPKRTLLQIV KKMAQKTGSD EVTSLSEDSI EIQDMVDCAL EVLEDIIYRN DWEFLKDRPA
61 QLEAGTNAIE LSIPDNVRKI QTLRYRYEDA GVQNCFRTLR YMYPHEFMER LQNNKPTDPD
121 TTTVTINGVE LYPKTNRHPR YWTSFDEQNV VLDSYDATQN PTGVDATDSA IIATLYLDFT
181 GSDADSWVAP IPESLFTLWE QEAVAEAFVQ FRQTENPRAE RRSRRTYVQQ IKKEPVTHKD
241 EGSDEVNYGR
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 16( 6.4%); C : 2( 0.8%); D+ : 21( 8.4%); E : 22( 8.8%); F : 8( 3.2%)
G- : 8( 3.2%); H : 3( 1.2%); I : 13( 5.2%); K : 11( 4.4%); L : 17( 6.8%)
M : 5( 2.0%); N : 12( 4.8%); P : 13( 5.2%); Q : 14( 5.6%); R : 18( 7.2%)
S : 13( 5.2%); T+ : 22( 8.8%); V : 17( 6.8%); W : 4( 1.6%); Y : 11( 4.4%)
KR : 29 ( 11.6%); ED + : 43 ( 17.2%); AGP : 37 ( 14.8%);
KRED + : 72 ( 28.8%); KR-ED - : -14 ( -5.6%); FIKMNY : 60 ( 24.0%);
LVIFM : 60 ( 24.0%); ST : 35 ( 14.0%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00++000000 ++000+000- -00000--00 -00-00-000 -00--000+0 -0-00+-+00
61 00-000000- 0000-00++0 000+0+0--0 000000+00+ 00000-00-+ 0000+00-0-
121 000000000- 000+00+00+ 00000--000 00-00-0000 0000-00-00 0000000-00
181 00-0-00000 00-000000- 0-000-0000 0+00-00+0- ++0++00000 0++-0000+-
241 -00--0000+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 10/30 or 13/45 or 16/60): none
Negative charge clusters (cmin = 13/30 or 17/45 or 22/60): none
Mixed charge clusters (cmin = 18/30 or 25/45 or 31/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.116 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.712 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.172 ( ED )
Expected score/letter: -0.824; Average information/letter: 1.266
Minimal length of displayed segments set to: 20
M_0.01= 10.22 (cv= 6.27, lambda= 0.88084, k= 0.32762, x= 3.96;
90% confidence interval for segment length: 10 +- 9)
M_0.05= 8.37 (x= 2.11)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.172 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.712 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.116 ( KR )
Expected score/letter: -0.600; Average information/letter: 0.606
Minimal length of displayed segments set to: 20
M_0.01= 14.16 (cv= 9.13, lambda= 0.60472, k= 0.21014, x= 5.03;
90% confidence interval for segment length: 20 +- 20)
M_0.05= 11.46 (x= 2.33)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.288 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.712 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.424; Average information/letter: 0.554
Minimal length of displayed segments set to: 20
M_0.01= 9.66 (cv= 6.10, lambda= 0.90512, k= 0.25249, x= 3.56;
90% confidence interval for segment length: 23 +- 20)
M_0.05= 7.86 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.712 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.288 ( KEDR )
Expected score/letter: -1.592; Average information/letter: 0.363
Minimal length of displayed segments set to: 20
M_0.01= 27.13 (cv= 17.44, lambda= 0.31652, k= 0.21542, x= 9.68;
90% confidence interval for segment length: 34 +- 19)
M_0.05= 21.98 (x= 4.53)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 6 | 8 | 26 | 9 | 10 | 13 | 10 | 12 | 15 | 6 | 7 |
lmin1 6 | 7 | 10 | 32 | 11 | 12 | 16 | 12 | 14 | 18 | 7 | 9 |
lmin2 7 | 8 | 11 | 35 | 12 | 14 | 18 | 14 | 16 | 20 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
(-00) 18(0,2,0); at 28- 47: DSIEIQDMVDCALEVLEDII
(1. quartile) -00-00-00-000-00--00
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 5: 0
0 runs >= 17: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.764; Average information/letter: 1.030
Minimal length of displayed segments set to: 15
M_0.01= 17.96 (cv= 10.80, lambda= 0.51118, k= 0.39034, x= 7.16;
90% confidence interval for segment length: 13 +- 7)
M_0.05= 14.77 (x= 3.97)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.244; Average information/letter: 0.905
Minimal length of displayed segments set to: 15
M_0.01= 41.68 (cv= 25.91, lambda= 0.21312, k= 0.28987, x= 15.77;
90% confidence interval for segment length: 14 +- 9)
M_0.05= 34.03 (x= 8.13); M_0.30= 24.93 (x= -0.97)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-37-C-56-C-155-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 17- 20] TGSD
[ 180- 183] TGSD
______________________________
[ 18- 22] GSDEV
[ 242- 246] GSDEV
______________________________
[ 82- 85] TLRY
[ 98- 101] TLRY
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 11 (Expected range: 3-- 26)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 4 (6-10) 2 (11-20) 2 (>=21) 4
3. Clusters of amino acid multiplets (cmin = 9/30 or 11/45 or 14/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 13 (Expected range: 0-- 18)
6 +plets (f+: 11.6%), 7 -plets (f-: 17.2%)
Total number of charge altplets: 5 (Critical number: 21)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 3 (6-10) 7 (11-20) 1 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
37- 60 4 *... 6 6 0
70- 123 9 *.0...000 6 6 /0/./2/./././0/2/2/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
16- 49 (1.) +( 33)+ 2 of 30 0.0305 large 2. maximal spacing
140- 212 (3.) +( 72)+ 1 of 30 0.0008 large 1. maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:17 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.2250.424.seq)
SWISS-PROT ANNOTATION:
ID ORFR
DE ORFR, 90 bases, 60E5FB1F checksum.
number of residues: 90; molecular weight: 10.3 kdal
1 MDDRRFNRQK KARETKIGTD STGAEYWLVP FSEAKVGHRK FVIRNSKKQV PVQLQGMFTD
61 AATAERVYKA YMDQSKAKKK APAKAEADAE
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 13(14.4%); C : 0( 0.0%); D : 6( 6.7%); E : 6( 6.7%); F : 4( 4.4%)
G : 4( 4.4%); H : 1( 1.1%); I : 2( 2.2%); K : 13(14.4%); L : 2( 2.2%)
M : 3( 3.3%); N : 2( 2.2%); P : 3( 3.3%); Q : 5( 5.6%); R : 7( 7.8%)
S : 4( 4.4%); T : 5( 5.6%); V : 6( 6.7%); W : 1( 1.1%); Y : 3( 3.3%)
KR : 20 ( 22.2%); ED : 12 ( 13.3%); AGP : 20 ( 22.2%);
KRED : 32 ( 35.6%); KR-ED : 8 ( 8.9%); FIKMNY : 27 ( 30.0%);
LVIFM : 17 ( 18.9%); ST : 9 ( 10.0%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0--++00+0+ +0+-0+000- 0000-00000 00-0+000++ 000+00++00 000000000-
61 0000-+00+0 00-00+0+++ 000+0-0-0-
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 15/30 or 21/45 or 26/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 18/60): none
Mixed charge clusters (cmin = 21/30 or 28/45 or 36/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.222 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.644 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.133 ( ED )
Expected score/letter: -0.467; Average information/letter: 0.318
Minimal length of displayed segments set to: 20
M_0.01= 16.54 (cv= 10.50, lambda= 0.42848, k= 0.13343, x= 6.04;
90% confidence interval for segment length: 32 +- 34)
M_0.05= 12.73 (x= 2.23)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.133 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.644 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.222 ( KR )
Expected score/letter: -0.822; Average information/letter: 1.114
Minimal length of displayed segments set to: 20
M_0.01= 9.84 (cv= 5.59, lambda= 0.80543, k= 0.30877, x= 4.25;
90% confidence interval for segment length: 10 +- 10)
M_0.05= 7.82 (x= 2.23)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.356 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.644 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.289; Average information/letter: 0.248
Minimal length of displayed segments set to: 20
M_0.01= 11.86 (cv= 7.57, lambda= 0.59471, k= 0.12950, x= 4.30;
90% confidence interval for segment length: 41 +- 42)
M_0.05= 9.12 (x= 1.56)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.644 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.356 ( KEDR )
Expected score/letter: -2.200; Average information/letter: 0.553
Minimal length of displayed segments set to: 20
M_0.01= 18.55 (cv= 10.52, lambda= 0.42768, k= 0.31161, x= 8.03;
90% confidence interval for segment length: 21 +- 10)
M_0.05= 14.74 (x= 4.22)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 6 | 4 | 8 | 18 | 10 | 8 | 13 | 10 | 9 | 13 | 5 | 6 |
lmin1 7 | 6 | 10 | 22 | 12 | 10 | 15 | 13 | 11 | 16 | 6 | 8 |
lmin2 9 | 7 | 12 | 25 | 13 | 11 | 17 | 14 | 12 | 18 | 7 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 4: 0
- runs >= 3: 0
* runs >= 6: 0
0 runs >= 12: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -3.033; Average information/letter: 1.072
Minimal length of displayed segments set to: 15
M_0.01= 15.15 (cv= 8.30, lambda= 0.54224, k= 0.41168, x= 6.85;
90% confidence interval for segment length: 11 +- 6)
M_0.05= 12.14 (x= 3.84)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.911; Average information/letter: 1.017
Minimal length of displayed segments set to: 15
M_0.01= 32.94 (cv= 18.64, lambda= 0.24136, k= 0.31700, x= 14.30;
90% confidence interval for segment length: 11 +- 7)
M_0.05= 26.19 (x= 7.55); M_0.30= 18.15 (x= -0.49)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-90-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 6 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 3 (6-10) 0 (11-20) 3 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 6 (Expected range: 0-- 12)
5 +plets (f+: 22.2%), 1 -plets (f-: 13.3%)
Total number of charge altplets: 3 (Critical number: 13)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 3 (6-10) 1 (11-20) 1 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
81- 90 2 A. 5 5 ! 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:19 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4816.2920.seq)
SWISS-PROT ANNOTATION:
ID ORFS
DE ORFS, 160 bases, 9B2946CF checksum.
number of residues: 160; molecular weight: 18.0 kdal
1 MTMQIMRATV DWFIDNGGPS LAMAHLEEAM PVTAEKNEFR EVDYAMYAEL EMSGSLLVLV
61 AIDDENNNEF VGYVVGTASP SIHNRGYFEF STTAFYTVPH VREQGVARQL FEALQAVCRQ
121 SGVTEINYSV SEGQPMTHEV VKKLGLIKSE TMYSMKVKHE
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 13( 8.1%); C : 1( 0.6%); D : 5( 3.1%); E : 17(10.6%); F : 7( 4.4%)
G : 10( 6.2%); H : 5( 3.1%); I : 6( 3.8%); K : 6( 3.8%); L : 10( 6.2%)
M : 10( 6.2%); N : 7( 4.4%); P : 5( 3.1%); Q : 6( 3.8%); R : 6( 3.8%)
S : 11( 6.9%); T : 10( 6.2%); V : 17(10.6%); W : 1( 0.6%); Y : 7( 4.4%)
KR : 12 ( 7.5%); ED : 22 ( 13.8%); AGP : 28 ( 17.5%);
KRED : 34 ( 21.2%); KR-ED : -10 ( -6.2%); FIKMNY : 43 ( 26.9%);
LVIFM : 50 ( 31.2%); ST : 21 ( 13.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000000+000 -000-00000 000000--00 0000-+0-0+ -0-00000-0 -000000000
61 00---000-0 0000000000 0000+000-0 0000000000 0+-0000+00 0-000000+0
121 0000-00000 0-000000-0 0++0000+0- 00000+0+0-
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 8/30 or 10/45 or 12/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 18/60): none
Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.075 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.788 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.138 ( ED )
Expected score/letter: -0.913; Average information/letter: 1.956
Minimal length of displayed segments set to: 20
M_0.01= 7.67 (cv= 4.45, lambda= 1.14070, k= 0.39702, x= 3.22;
90% confidence interval for segment length: 6 +- 6)
M_0.05= 6.24 (x= 1.79)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.138 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.788 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.075 ( KR )
Expected score/letter: -0.663; Average information/letter: 0.869
Minimal length of displayed segments set to: 20
M_0.01= 11.20 (cv= 6.82, lambda= 0.74394, k= 0.26140, x= 4.38;
90% confidence interval for segment length: 14 +- 13)
M_0.05= 9.01 (x= 2.19)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.213 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.788 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.575; Average information/letter: 1.087
Minimal length of displayed segments set to: 20
M_0.01= 6.72 (cv= 3.87, lambda= 1.30992, k= 0.41984, x= 2.85;
90% confidence interval for segment length: 12 +- 10)
M_0.05= 5.48 (x= 1.60)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.788 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.213 ( KEDR )
Expected score/letter: -0.912; Average information/letter: 0.163
Minimal length of displayed segments set to: 20
M_0.01= 38.52 (cv= 26.42, lambda= 0.19212, k= 0.10291, x= 12.11;
90% confidence interval for segment length: 65 +- 51)
M_0.05= 30.04 (x= 3.62)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 34 | 8 | 9 | 11 | 9 | 11 | 13 | 6 | 8 |
lmin1 5 | 6 | 8 | 41 | 9 | 11 | 14 | 11 | 14 | 16 | 8 | 10 |
lmin2 6 | 7 | 9 | 46 | 11 | 13 | 15 | 13 | 15 | 18 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 1, at 63;
* runs >= 4: 0
0 runs >= 23: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.850; Average information/letter: 0.557
Minimal length of displayed segments set to: 15
M_0.01= 24.32 (cv= 14.85, lambda= 0.34175, k= 0.25571, x= 9.47;
90% confidence interval for segment length: 22 +- 15)
M_0.05= 19.55 (x= 4.70)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.444; Average information/letter: 0.466
Minimal length of displayed segments set to: 15
M_0.01= 55.35 (cv= 34.97, lambda= 0.14514, k= 0.19363, x= 20.38;
90% confidence interval for segment length: 25 +- 19)
M_0.05= 44.12 (x= 9.15); M_0.30= 30.76 (x= -4.21)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-117-C-42-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 9 (Expected range: 0-- 19)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 2 (6-10) 3 (11-20) 3 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 3 (Expected range: 0-- 9)
1 +plets (f+: 7.5%), 2 -plets (f-: 13.8%)
Total number of charge altplets: 3 (Critical number: 10)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 1 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
26- 121 8 i..0.... 10 6 /2/././4/././././
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
6- 23 (1.) M( 17)M 2 of 11 0.9902 small 2. maximal spacing
36- 142 (3.) K( 106)K 1 of 7 0.0085 large maximal spacing
52- 136 (3.) M( 84)M 1 of 11 0.0046 large 1. maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:20 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4811.1533.seq)
SWISS-PROT ANNOTATION:
ID ORFT
DE ORFT, 203 bases, 17DE115E checksum.
number of residues: 203; molecular weight: 20.7 kdal
1 MSKAVRFIGG AVQVIAGIYT ANPALIASGV GSIAGGIAEQ KREDAAKQAV QEQRKAVAIN
61 NAQQAIQRQR QIRQTIAEAR VRRALIQSRG FEGGPAGAGD SIYGDAASAI GAANTQQAAA
121 FGISAARNRA AMFGLEARSS NSFDAFAGAA NLFGQGYTAY KGGSFEGLFG SGTADTGRTS
181 DFGRSGNNSA NFLEFGGAVS GGR
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A++: 41(20.2%); C : 0( 0.0%); D : 6( 3.0%); E : 8( 3.9%); F : 12( 5.9%)
G++: 31(15.3%); H--: 0( 0.0%); I : 14( 6.9%); K : 5( 2.5%); L--: 6( 3.0%)
M- : 2( 1.0%); N : 10( 4.9%); P--: 2( 1.0%); Q+ : 15( 7.4%); R : 16( 7.9%)
S : 16( 7.9%); T : 7( 3.4%); V- : 8( 3.9%); W : 0( 0.0%); Y : 4( 2.0%)
KR : 21 ( 10.3%); ED : 14 ( 6.9%); AGP ++: 74 ( 36.5%);
KRED : 35 ( 17.2%); KR-ED : 7 ( 3.4%); FIKMNY : 47 ( 23.2%);
LVIFM --: 42 ( 20.7%); ST : 23 ( 11.3%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00+00+0000 0000000000 0000000000 00000000-0 ++--00+000 0-0++00000
61 0000000+0+ 00+0000-0+ 0++00000+0 0-0000000- 0000-00000 0000000000
121 000000+0+0 00000-0+00 000-000000 0000000000 +0000-0000 0000-00+00
181 -00+000000 000-000000 00+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 12/45 or 15/60): none
Negative charge clusters (cmin = 7/30 or 9/45 or 11/60): none
Mixed charge clusters (cmin = 13/30 or 17/45 or 22/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.103 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.828 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.069 ( ED )
Expected score/letter: -0.759; Average information/letter: 1.306
Minimal length of displayed segments set to: 20
M_0.01= 9.48 (cv= 5.72, lambda= 0.92817, k= 0.32682, x= 3.75;
90% confidence interval for segment length: 10 +- 9)
M_0.05= 7.72 (x= 2.00)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.069 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.828 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.103 ( KR )
Expected score/letter: -0.897; Average information/letter: 2.054
Minimal length of displayed segments set to: 20
M_0.01= 7.60 (cv= 4.49, lambda= 1.18462, k= 0.40240, x= 3.11;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 6.22 (x= 1.74)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.172 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.828 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.655; Average information/letter: 1.483
Minimal length of displayed segments set to: 20
M_0.01= 5.90 (cv= 3.39, lambda= 1.56862, k= 0.51868, x= 2.51;
90% confidence interval for segment length: 9 +- 7)
M_0.05= 4.86 (x= 1.48)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.828 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.172 ( KEDR )
Expected score/letter: -0.552
Average information/letter: 0.073 < .10; too small !
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 4 | 6 | 43 | 9 | 8 | 11 | 11 | 9 | 13 | NA | NA |
lmin1 6 | 5 | 7 | 52 | 11 | 9 | 13 | 13 | 12 | 16 | NA | NA |
lmin2 7 | 6 | 8 | 58 | 12 | 11 | 15 | 15 | 13 | 18 | NA | NA |
(Significance level: 0.010000; Minimal displayed length: 6)
(*00) 14(0,0,0); at 173- 186: TADTGRTSDFGRSG
(4. quartile) 00-00+00-00+00
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 1, at 41;
0 runs >= 29: 1, at 7;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.557; Average information/letter: 0.452
Minimal length of displayed segments set to: 15
M_0.01= 25.64 (cv= 16.18, lambda= 0.32831, k= 0.22429, x= 9.46;
90% confidence interval for segment length: 27 +- 19)
M_0.05= 20.68 (x= 4.49)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -2.650; Average information/letter: 0.317
Minimal length of displayed segments set to: 15
M_0.01= 65.23 (cv= 43.27, lambda= 0.12280, k= 0.14912, x= 21.96;
90% confidence interval for segment length: 36 +- 28)
M_0.05= 51.96 (x= 8.69); M_0.30= 36.17 (x= -7.10)
1) From 7 to 38 (type 1): length= 32, score=50.00
(pocket at 19 to 23: length= 5, score=-17.00)
7 FIGGAVQVIA GI |YTANP| ALI ASGVGSIAGG IA
A: 7(21.9%); G: 7(21.9%); I: 6(18.8%);
# of segments (>=15 residues) exceeding M_0.30: 1
2. SPACINGS OF C.
H2N-203-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 9- 12] GGAV
[ 196- 199] GGAV
______________________________
[ 111- 114] GAAN
[ 148- 151] GAAN
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 19 (Expected range: 5-- 30)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 8 (6-10) 6 (11-20) 4 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 14/30 or 18/45 or 22/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 4 (Expected range: 0-- 8)
3 +plets (f+: 10.3%), 1 -plets (f-: 6.9%)
Total number of charge altplets: 1 (Critical number: 10)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 1 (11-20) 0 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
90- 117 7 G...... 4 4 0
107- 142 6 A..... 6 6 ! 0
162- 181 5 G.... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 7)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
36- 90 (2.) G( 54)G 1 of 32 0.0009 large maximal spacing
57- 81 (2.) V( 24)V 2 of 9 0.9841 small 2. maximal spacing
81- 199 (3.) V( 118)V 1 of 9 0.0070 large 1. maximal spacing
123- 204 (4.) I( 81)I 1 of 15 0.0088 large maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:22 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4813.6672.seq)
SWISS-PROT ANNOTATION:
ID ORFU
DE ORFU, 1536 bases, 7CD7314B checksum.
number of residues: 1536; molecular weight: 169.7 kdal
1 MAGELNLDTY QKLLAEDNAP EPSLESYMQM LNAAPPRGPV TDARYTGQAA MIAGNPYEIE
61 SLAMAAASGD TKLYRSQQAM SWYDDRVHDA HTQLQQTQFG SAEAMLGAAQ LQSERIRMLQ
121 DAKSHPLADE VAFATNAAYK GDPEEITRAA VDVALANEIG ELAESTGIGE KIAEFGSFLI
181 PFKYAADIAD INEQISANPQ LAELAGEDLE TVVGAWKALP PERQLALVKP LKEAIMKATA
241 SFGWTDGNAT KTAGLLAQFF EVDPAAQINS EFAEDIVFGA IDVTPIGAIR GLKPVEKLRH
301 AKIIRKVAED AVADTSAAKV AARAGDKESA AKHTVASLVD DNTAAALGTT RDDAAASALP
361 LETSEWFTRV LDDDELPAAV ADEMNNMFAI ANGFARTLKE ESGLMQVGLL NRSERNQVVQ
421 SFFEKMEDVS EEYLTEGLHM DNLKLVDEDA KGFTYQYTLR DKSRPVAEGE KERLIVRSGK
481 VKFSINDVTG TYSATVEGAS AQNFFGNILS PAAWSRRTAE GDFNLEVKRA LQSDDLAVAY
541 QDKVGAFLDW ASEPVAGLTQ GKARERVSAV LQAGDEWVNP ATQVEGTVFS PTELAAGVNT
601 PLGKIHLTDP REVEAYYRYR LYGDATFINE DFVLRREKEL AGMKDVKLQG LEDGGRAIGK
661 PFEDLSSALG SVRDKRGQGV WDSKIGRTVD ITDDYVRKVY DEGDVLVRLE KDWNTKGTGE
721 LDLSGEFVQY ARVQKDAISE MPDRILNYRS GYVPKINEAR FVVSQRMPKI ARGRPNLTGS
781 QALRAFDSLS DAKLFREEQI ARWMEKHGTD RATAEQIFPE VDLLDEGLGA ARLEEAVGAH
841 TGLYHGTRSK DKLLFGLSGQ ELSRVNPLEA FQRHSKHLGA FISQNEVRIG REKRWLNTVR
901 QEFPDIPIRG FEDTPLPNTP KGKAAERVRR VIREWNGIPS REEELSDAMW QRLHDWALNG
961 ARRFGYADKE SVKSIQWLRS NDPYAVMKTI VMHNLLGVFN IAQLYTQASA MSVALGKFPA
1021 KFAPRVINDT AWMHVLDNIV DDKMLGKVYN ILFNGRAIDE TTKATYDAFR RTGLKEAVFN
1081 NSDMARIGSH GLGVTRRIIS NADNLSLMLY RSGELAARRA TFAAEFQNWM RTTGKKVPSD
1141 LELSEILEEV NKDLLELGPA NRAYWQGGRG TGDVRQLMGV ATQFMQVGAK TMELAGKGLG
1201 GVNRGGFTNR QKARIFLSQL AMFGAAGVPL GGLITQAITS GLGIEQMPEE AAELANQGFV
1261 GFGARLMLGD VEVSDRFALG AQTTQMLEDI ITSDDPLWIK ALGPAGSGVF GRAWDAIQQL
1321 RPLMNADWGH EGVVTEDGLM LAASVLGEIP SSGRNWVKYW MMKNHHAIID QRKRVLIDAS
1381 DKGGFDSRTE LGRLLGFTPT AEQNMRMLQL DQKAVDEMVQ DYANYRVQLL HRAIWEYKLD
1441 PQTVVAIEEA HKVMDRTIPD YVKTKGRELV NKRIYGPRQE SKEDELVKKF LQRTAPDKIS
1501 EDFILDSNAE LSTSVKPISQ PFRNILGGSQ VPQEDE
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A :175(11.4%); C : 0( 0.0%); D :104( 6.8%); E :118( 7.7%); F : 57( 3.7%)
G :123( 8.0%); H : 21( 1.4%); I : 71( 4.6%); K : 81( 5.3%); L :140( 9.1%)
M : 42( 2.7%); N : 59( 3.8%); P : 57( 3.7%); Q : 69( 4.5%); R :101( 6.6%)
S : 81( 5.3%); T : 78( 5.1%); V : 99( 6.4%); W : 24( 1.6%); Y : 36( 2.3%)
KR : 182 ( 11.8%); ED : 222 ( 14.5%); AGP : 355 ( 23.1%);
KRED : 404 ( 26.3%); KR-ED : -40 ( -2.6%); FIKMNY : 346 ( 22.5%);
LVIFM : 409 ( 26.6%); ST : 159 ( 10.4%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000-000-00 0+000--000 -000-00000 000000+000 0-0+000000 0000000-0-
61 000000000- 0+00+00000 000--+00-0 0000000000 00-0000000 000-+0+000
121 -0+00000-- 000000000+ 0-0--00+00 0-00000-00 -00-00000- +00-000000
181 00+000-00- 00-0000000 00-000--0- 000000+000 0-+00000+0 0+-000+000
241 00000-0000 +000000000 -0-0000000 -00--00000 0-0000000+ 00+00-+0+0
301 0+00++00-- 000-0000+0 00+00-+-00 0+0000000- -000000000 +--0000000
361 0-00-000+0 0----00000 0--0000000 00000+00+- -000000000 0+0-+00000
421 000-+0--00 --000-0000 -00+00---0 +00000000+ -+0+000-0- +-+000+00+
481 0+0000-000 000000-000 0000000000 00000++00- 0-000-0++0 000--00000
541 0-+00000-0 00-0000000 0+0+-+0000 0000--0000 0000-00000 00-0000000
601 000+0000-0 +-0-000+0+ 000-00000- -000++-+-0 000+-0+000 0--00+000+
661 00--000000 00+-++0000 0-0+00+00- 00--00++00 --0-000+0- +-000+000-
721 0-000-0000 0+00+-000- 00-+0000+0 0000+00-0+ 00000+00+0 0+0+000000
781 000+00-000 -0+00+--00 0+00-+000- +000-0000- 0-00--0000 0+0--00000
841 0000000+0+ -+00000000 -00+0000-0 00+00+0000 00000-0+00 +-++00000+
901 0-00-000+0 0--0000000 +0+00-+0++ 00+-000000 +---00-000 0+00-00000
961 0++0000-+- 00+00000+0 0-00000+00 0000000000 0000000000 000000+000
1021 +000+000-0 000000-000 --+000+000 00000+00-- 00+000-00+ +000+-0000
1081 00-00+0000 00000++000 00-0000000 +00-000++0 0000-00000 +000++000-
1141 0-00-00--0 0+-00-0000 0+000000+0 00-0+00000 000000000+ 00-000+000
1201 000+00000+ 0+0+000000 0000000000 0000000000 0000-000-- 00-0000000
1261 0000+0000- 0-00-+0000 0000000--0 000--0000+ 0000000000 0+00-00000
1321 +00000-000 -0000--000 0000000-00 000+000+00 00+000000- 0+++000-00
1381 -+000-0+0- 00+0000000 0-000+0000 -0+00--000 -0000+0000 0+000-0+0-
1441 0000000--0 0+00-+000- 00+0+0+-00 0++0000+0- 0+---00++0 00+000-+00
1501 --000-000- 00000+0000 00+0000000 000---
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 12/30 or 16/45 or 19/60): none
Negative charge clusters (cmin = 13/30 or 18/45 or 22/60): none
Mixed charge clusters (cmin = 19/30 or 26/45 or 32/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.118 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.737 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.145 ( ED )
Expected score/letter: -0.789; Average information/letter: 1.194
Minimal length of displayed segments set to: 20
M_0.01= 12.53 (cv= 8.53, lambda= 0.86057, k= 0.31625, x= 4.01;
90% confidence interval for segment length: 13 +- 11)
M_0.05= 10.64 (x= 2.11)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.145 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.737 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.118 ( KR )
Expected score/letter: -0.685; Average information/letter: 0.853
Minimal length of displayed segments set to: 20
M_0.01= 14.63 (cv= 10.14, lambda= 0.72392, k= 0.26058, x= 4.50;
90% confidence interval for segment length: 18 +- 15)
M_0.05= 12.38 (x= 2.25)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.263 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.737 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.474; Average information/letter: 0.705
Minimal length of displayed segments set to: 20
M_0.01= 10.43 (cv= 7.12, lambda= 1.03033, k= 0.30481, x= 3.31;
90% confidence interval for segment length: 22 +- 17)
M_0.05= 8.85 (x= 1.73)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.737 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.263 ( KEDR )
Expected score/letter: -1.367; Average information/letter: 0.294
Minimal length of displayed segments set to: 20
M_0.01= 37.03 (cv= 26.60, lambda= 0.27581, k= 0.17823, x= 10.43;
90% confidence interval for segment length: 50 +- 28)
M_0.05= 31.12 (x= 4.52)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 6 | 6 | 9 | 35 | 10 | 11 | 15 | 12 | 13 | 17 | 7 | 8 |
lmin1 7 | 8 | 11 | 42 | 13 | 14 | 18 | 15 | 16 | 21 | 8 | 10 |
lmin2 8 | 9 | 12 | 47 | 14 | 15 | 20 | 17 | 18 | 23 | 9 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 4: 0
- runs >= 4: 1, at 372;
* runs >= 6: 0
0 runs >= 23: 2, at 989; 1215;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.188; Average information/letter: 0.683
Minimal length of displayed segments set to: 15
M_0.01= 27.49 (cv= 18.78, lambda= 0.39060, k= 0.30152, x= 8.71;
90% confidence interval for segment length: 23 +- 13)
M_0.05= 23.32 (x= 4.53)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.127; Average information/letter: 0.580
Minimal length of displayed segments set to: 15
M_0.01= 65.16 (cv= 45.69, lambda= 0.16058, k= 0.22901, x= 19.47;
90% confidence interval for segment length: 26 +- 16)
M_0.05= 55.01 (x= 9.32); M_0.30= 42.93 (x= -2.76)
1) From 1215 to 1244 (type 1): length= 30, score=48.00
(pocket at 1235 to 1236: length= 2, score=-12.00)
(pocket at 1218 to 1219: length= 2, score=-12.00)
1215 IFL |SQ| LAMFG AAGVPLGGLI |TQ| AITSGLGI
L: 5(16.7%); A: 4(13.3%); G: 6(20.0%); I: 4(13.3%);
# of segments (>=15 residues) exceeding M_0.30: 1
2. SPACINGS OF C.
H2N-1536-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 5
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 9
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 93 (Expected range: 64--121)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 26 (6-10) 22 (11-20) 26 (>=21) 20
3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 55 (Expected range: 26-- 67)
16 +plets (f+: 11.8%), 39 -plets (f-: 14.5%)
Total number of charge altplets: 40 (Critical number: 73)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 9 (6-10) 5 (11-20) 18 (>=21) 24
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
33- 72 10 A......... 4 4 0
249- 280 8 A....... 4 4 0
318- 341 6 A..... 4 4 0
823- 862 10 L......... 4 4 0
1002-1025 6 A..... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10)
Location Period Element Copies Core Errors
259- 312 9 i....0... 6 6 /0/././././2/./././
283- 309 3 i*. 8 6 /1/3/./
296- 349 9 *00.0...0 6 6 /0/2/2/./2/./././2/
675- 728 9 *.0.0.... 6 6 /0/./2/./2/././././
805- 829 5 -.000 5 5 /0/./1/0/0/
1370-1393 4 *0.. 6 6 /0/2/././
1463-1502 5 *.... 8 8 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:24 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4814.3475.seq)
SWISS-PROT ANNOTATION:
ID ORFV
DE ORFV, 611 bases, 30B9D8E1 checksum.
number of residues: 611; molecular weight: 66.6 kdal
1 MAGGPFQTSG NNAGDLTPVN LQGGFTPVAP KSNDAVNAVE SVTQAVGTIA TLNTERLKKE
61 SVANIRQEVK AVRDALQISK YPTLQTTYFS EEALQDPYIK SVYKNFQEIK GATDQGRLSQ
121 EFAIERMEAL MSQAINRRPQ FADDIRNAAN TAAGANISSK LFSQIMTLTP QQKALQELQA
181 EATKLGIPVE TYQGMVQQQF FREQMSEQIE YAKKQGTASL NDLSQQVGLE VTNVTRNLQN
241 GLLERIRAGG VVDVPVTVAE ARQEFTMLRN KVLSNIPANV PSSQVSQVVD MIDAEEQRII
301 TQIENGTMAK VLQTKGTLFE EIAKDNARTN APDEMQILAI FGNGQSGYAA VNDYTKYKNN
361 PQAMAGLYAV DEGGVLTVAS AATQQFKAGQ ILFQGKQAAN DQERRLAGYF GGIFLQKDVN
421 QQGENPTPPQ QVLRVVDVVS KMGEEYSTAT LGDAKVAANL RAYKETHPQL INHFNSDVHS
481 LKRQYQQLKA EGAFTDENVQ IKNGRVSVTG NTIVSQFGAG SSNPANITGG TGSITAIQRF
541 LRHANLTMKM GQTYQGNGVF PESVFRNSNA FLKDLKDATD VEPADLTGGT NGQDEVIRYD
601 FDGNGKLIRL E
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 62(10.1%); C : 0( 0.0%); D : 28( 4.6%); E : 38( 6.2%); F : 24( 3.9%)
G : 49( 8.0%); H : 4( 0.7%); I : 31( 5.1%); K : 32( 5.2%); L : 43( 7.0%)
M : 14( 2.3%); N+ : 43( 7.0%); P : 21( 3.4%); Q++: 57( 9.3%); R : 27( 4.4%)
S : 32( 5.2%); T : 44( 7.2%); V : 46( 7.5%); W : 0( 0.0%); Y : 16( 2.6%)
KR : 59 ( 9.7%); ED : 66 ( 10.8%); AGP : 132 ( 21.6%);
KRED : 125 ( 20.5%); KR-ED : -7 ( -1.1%); FIKMNY : 160 ( 26.2%);
LVIFM : 158 ( 25.9%); ST : 76 ( 12.4%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 0000-00000 0000000000 +00-00000- 0000000000 0000-+0++-
61 00000+0-0+ 00+-00000+ 0000000000 --000-000+ 000+000-0+ 000-00+000
121 -000-+0-00 000000++00 00--0+0000 000000000+ 0000000000 00+000-000
181 -00+00000- 0000000000 0+-000-00- 00++000000 0-0000000- 00000+0000
241 000-+0+000 00-000000- 0+0-0000+0 +000000000 000000000- 00-0--0+00
301 000-00000+ 0000+0000- -00+-00+00 00--000000 0000000000 00-00+0+00
361 0000000000 --00000000 000000+000 00000+0000 -0-++00000 000000+-00
421 000-000000 000+00-000 +00--00000 00-0+00000 +00+-00000 000000-000
481 0++00000+0 -0000--000 0+00+00000 0000000000 0000000000 00000000+0
541 0+000000+0 0000000000 0-000+0000 00+-0+-00- 0-00-00000 000--00+0-
601 0-000+00+0 -
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 12/45 or 14/60): none
Negative charge clusters (cmin = 10/30 or 13/45 or 16/60): none
Mixed charge clusters (cmin = 14/30 or 20/45 or 24/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.097 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.795 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.108 ( ED )
Expected score/letter: -0.818; Average information/letter: 1.477
Minimal length of displayed segments set to: 20
M_0.01= 10.16 (cv= 6.54, lambda= 0.98069, k= 0.34955, x= 3.62;
90% confidence interval for segment length: 10 +- 8)
M_0.05= 8.50 (x= 1.96)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.108 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.795 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.097 ( KR )
Expected score/letter: -0.773; Average information/letter: 1.275
Minimal length of displayed segments set to: 20
M_0.01= 10.90 (cv= 7.07, lambda= 0.90744, k= 0.32447, x= 3.83;
90% confidence interval for segment length: 11 +- 10)
M_0.05= 9.10 (x= 2.03)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.205 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.795 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.591; Average information/letter: 1.157
Minimal length of displayed segments set to: 20
M_0.01= 7.51 (cv= 4.72, lambda= 1.35789, k= 0.43887, x= 2.78;
90% confidence interval for segment length: 13 +- 10)
M_0.05= 6.31 (x= 1.58)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.795 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.205 ( KEDR )
Expected score/letter: -0.841; Average information/letter: 0.144
Minimal length of displayed segments set to: 20
M_0.01= 48.27 (cv= 35.91, lambda= 0.17865, k= 0.09146, x= 12.36;
90% confidence interval for segment length: 86 +- 63)
M_0.05= 39.15 (x= 3.24)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 5 | 7 | 41 | 9 | 10 | 13 | 11 | 12 | 15 | 5 | 7 |
lmin1 6 | 6 | 9 | 50 | 11 | 12 | 15 | 14 | 14 | 18 | 7 | 8 |
lmin2 7 | 7 | 10 | 55 | 13 | 13 | 17 | 16 | 16 | 21 | 8 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 5: 0
0 runs >= 27: 1, at 506;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.146; Average information/letter: 0.753
Minimal length of displayed segments set to: 15
M_0.01= 23.22 (cv= 15.09, lambda= 0.42509, k= 0.31827, x= 8.13;
90% confidence interval for segment length: 19 +- 11)
M_0.05= 19.39 (x= 4.29)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.964; Average information/letter: 0.603
Minimal length of displayed segments set to: 15
M_0.01= 56.96 (cv= 38.25, lambda= 0.16770, k= 0.23129, x= 18.70;
90% confidence interval for segment length: 23 +- 15)
M_0.05= 47.24 (x= 8.98); M_0.30= 35.67 (x= -2.58)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-611-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 5
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 9
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 43 (Expected range: 19-- 56)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 16 (6-10) 9 (11-20) 9 (>=21) 10
3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 14 (Expected range: 1-- 22)
5 +plets (f+: 9.7%), 9 -plets (f-: 10.8%)
Total number of charge altplets: 12 (Critical number: 25)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 0 (11-20) 3 (>=21) 11
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
252- 259 2 V. 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:25 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4819.4029.seq)
SWISS-PROT ANNOTATION:
ID ORFW
DE ORFW, 291 bases, E6E6A4C8 checksum.
number of residues: 291; molecular weight: 32.5 kdal
1 MDKTKLFVMT LMVTASLFVW SKQDMPQVEF RGQRFSFPEG ATEEQISGVL RDYMETQPTA
61 NGMSTPDRSA AIRTENGYAP TESYFGETMD GEEPKLQDVP KLTAEQRRFA EAIAQVETGG
121 LDNRFIRTKV KPNGTAAGSS AYGTYQITHG LLSGYLDKGV VKLTEQERAA AEELLRRQEI
181 ALTIGGRDRA KYQKGGIYQS QGVRWAKAYG FENVDEFLDA FDYGGTLGLD DDAEFQTLYE
241 SFARKMLNKH LDEAGGNPFK AARIWHGGPK GAGATTDNYE RKVRRIYEQG N
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 28( 9.6%); C : 0( 0.0%); D : 17( 5.8%); E : 25( 8.6%); F : 14( 4.8%)
G : 31(10.7%); H : 3( 1.0%); I : 10( 3.4%); K : 17( 5.8%); L : 20( 6.9%)
M : 8( 2.7%); N : 9( 3.1%); P : 10( 3.4%); Q : 16( 5.5%); R : 20( 6.9%)
S : 12( 4.1%); T : 22( 7.6%); V- : 13( 4.5%); W : 3( 1.0%); Y : 13( 4.5%)
KR : 37 ( 12.7%); ED : 42 ( 14.4%); AGP : 69 ( 23.7%);
KRED : 79 ( 27.1%); KR-ED : -5 ( -1.7%); FIKMNY : 71 ( 24.4%);
LVIFM - : 65 ( 22.3%); ST : 34 ( 11.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0-+0+00000 0000000000 0+0-0000-0 +00+0000-0 00--000000 +-00-00000
61 000000-+00 00+0-00000 0-0000-00- 0--0+00-00 +000-0++00 -00000-000
121 0-0+00+0+0 +000000000 0000000000 000000-+00 0+00-0-+00 0--00++0-0
181 000000+-+0 +00+000000 000+00+000 0-00--00-0 0-0000000- --0-00000-
241 000++000+0 0--000000+ 00+000000+ 000000-00- ++0++00-00 0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 14/45 or 17/60): none
Negative charge clusters (cmin = 12/30 or 15/45 or 19/60): none
Mixed charge clusters (cmin = 17/30 or 24/45 or 30/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.127 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.729 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.144 ( ED )
Expected score/letter: -0.763; Average information/letter: 1.082
Minimal length of displayed segments set to: 20
M_0.01= 11.12 (cv= 6.96, lambda= 0.81533, k= 0.29999, x= 4.17;
90% confidence interval for segment length: 12 +- 11)
M_0.05= 9.12 (x= 2.17)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.144 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.729 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.127 ( KR )
Expected score/letter: -0.694; Average information/letter: 0.866
Minimal length of displayed segments set to: 20
M_0.01= 12.29 (cv= 7.80, lambda= 0.72748, k= 0.26342, x= 4.49;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 10.05 (x= 2.25)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.271 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.729 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.457; Average information/letter: 0.651
Minimal length of displayed segments set to: 20
M_0.01= 9.14 (cv= 5.75, lambda= 0.98714, k= 0.28673, x= 3.39;
90% confidence interval for segment length: 20 +- 17)
M_0.05= 7.49 (x= 1.74)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.729 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.271 ( KEDR )
Expected score/letter: -1.443; Average information/letter: 0.317
Minimal length of displayed segments set to: 20
M_0.01= 29.76 (cv= 19.59, lambda= 0.28961, k= 0.19088, x= 10.17;
90% confidence interval for segment length: 39 +- 23)
M_0.05= 24.13 (x= 4.54)
1) From 132 to 156 (type 1): length= 25, score=25.00 *
132 PNGTAAGSSA YGTYQITHGL LSGYL
L: 3(12.0%); A: 3(12.0%); G: 5(20.0%); S: 3(12.0%);
T: 3(12.0%); Y: 3(12.0%);
# of segments (>=20 residues) exceeding M_0.05: 1
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 5 | 8 | 28 | 9 | 10 | 13 | 11 | 11 | 15 | 5 | 7 |
lmin1 6 | 7 | 10 | 34 | 11 | 12 | 16 | 13 | 14 | 18 | 7 | 8 |
lmin2 7 | 8 | 11 | 38 | 13 | 13 | 18 | 15 | 16 | 20 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 1, at 230;
* runs >= 5: 0
0 runs >= 19: 1, at 132;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.368; Average information/letter: 0.784
Minimal length of displayed segments set to: 15
M_0.01= 20.86 (cv= 12.90, lambda= 0.43988, k= 0.33438, x= 7.97;
90% confidence interval for segment length: 17 +- 10)
M_0.05= 17.16 (x= 4.26)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.485; Average information/letter: 0.689
Minimal length of displayed segments set to: 15
M_0.01= 47.45 (cv= 30.26, lambda= 0.18750, k= 0.25267, x= 17.20;
90% confidence interval for segment length: 19 +- 12)
M_0.05= 38.76 (x= 8.50); M_0.30= 28.42 (x= -1.84)
1) From 6 to 20 (type 1): length= 15, score=39.00 *
6 LFVMTLMVTA SLFVW
L: 3(20.0%); V: 3(20.0%); T: 2(13.3%); F: 2(13.3%);
M: 2(13.3%);
# of segments (>=15 residues) exceeding M_0.30: 1
2. SPACINGS OF C.
H2N-291-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 22 (Expected range: 5-- 30)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 7 (6-10) 7 (11-20) 4 (>=21) 5
3. Clusters of amino acid multiplets (cmin = 13/30 or 16/45 or 19/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 11 (Expected range: 0-- 19)
5 +plets (f+: 12.7%), 6 -plets (f-: 14.4%)
Total number of charge altplets: 7 (Critical number: 22)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 2 (6-10) 2 (11-20) 3 (>=21) 5
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:27 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4812.1811.seq)
SWISS-PROT ANNOTATION:
ID ORFX
DE ORFX, 56 bases, CAC39C85 checksum.
number of residues: 56; molecular weight: 6.4 kdal
1 MNKVIKKGIW WVEVNKDGKE TGRASRKRFE PKPEPEVKKK AAPKKKAASK KDEGEQ
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 5( 8.9%); C : 0( 0.0%); D : 2( 3.6%); E : 7(12.5%); F : 1( 1.8%)
G : 4( 7.1%); H : 0( 0.0%); I : 2( 3.6%); K : 15(26.8%); L : 0( 0.0%)
M : 1( 1.8%); N : 2( 3.6%); P : 4( 7.1%); Q : 1( 1.8%); R : 3( 5.4%)
S : 2( 3.6%); T : 1( 1.8%); V : 4( 7.1%); W : 2( 3.6%); Y : 0( 0.0%)
KR : 18 ( 32.1%); ED : 9 ( 16.1%); AGP : 13 ( 23.2%);
KRED : 27 ( 48.2%); KR-ED : 9 ( 16.1%); FIKMNY : 21 ( 37.5%);
LVIFM : 8 ( 14.3%); ST : 3 ( 5.4%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00+00++000 00-00+-0+- 00+00+++0- 0+0-0-0+++ 000+++000+ +--0-0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 19/30 or 26/45 or 33/60): none
Negative charge clusters (cmin = 12/30 or 17/45 or 21/60): none
Mixed charge clusters (cmin = 25/30 or 35/45 or 44/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.321 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.518 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.161 ( ED )
Expected score/letter: -0.196
Average information/letter: 0.046 < .10; too small !
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.161 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.518 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.321 ( KR )
Expected score/letter: -0.839; Average information/letter: 0.956
Minimal length of displayed segments set to: 20
M_0.01= 10.35 (cv= 5.65, lambda= 0.71304, k= 0.28786, x= 4.71;
90% confidence interval for segment length: 11 +- 11)
M_0.05= 8.06 (x= 2.42)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.482 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.518 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.036
Average information/letter: 0.004 < .10; too small !
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.518 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.482 ( KEDR )
Expected score/letter: -3.339; Average information/letter: 0.924
Minimal length of displayed segments set to: 20
M_0.01= 12.01 (cv= 6.14, lambda= 0.65551, k= 0.46981, x= 5.87;
90% confidence interval for segment length: 12 +- 3)
M_0.05= 9.52 (x= 3.38)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 7 | 5 | 11 | 12 | 10 | 7 | 12 | 9 | 7 | 11 | NA | NA |
lmin1 9 | 6 | 13 | 15 | 12 | 9 | 15 | 11 | 9 | 14 | NA | NA |
lmin2 10 | 7 | 15 | 16 | 13 | 10 | 17 | 13 | 10 | 15 | NA | NA |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 5: 0
- runs >= 3: 0
* runs >= 7: 0
0 runs >= 8: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -3.946; Average information/letter: 1.491
Minimal length of displayed segments set to: 15
M_0.01= 11.32 (cv= 5.75, lambda= 0.69947, k= 0.49170, x= 5.56;
90% confidence interval for segment length: 8 +- 3)
M_0.05= 8.99 (x= 3.23)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -8.000; Average information/letter: 1.542
Minimal length of displayed segments set to: 15
M_0.01= 24.53 (cv= 12.89, lambda= 0.31220, k= 0.38065, x= 11.64;
90% confidence interval for segment length: 7 +- 4)
M_0.05= 19.31 (x= 6.42); M_0.30= 13.10 (x= 0.21)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-56-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
______________________________
Simple tandem repeat:
[ 38- 43] KKKAAP
[ 44- 49] KKKAAS
[ 50- 51] KK
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 7 (Expected range: 0-- 13)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 5 (6-10) 2 (11-20) 0 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 18/30 or 23/45 or 28/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 6 (Expected range: 0-- 12)
5 +plets (f+: 32.1%), 1 -plets (f-: 16.1%)
Total number of charge altplets: 3 (Critical number: 13)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 4 (6-10) 2 (11-20) 1 (>=21) 0
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
32- 55 6 KK.... 4 4 ! /0/1/././././
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
26- 53 2 *. 12 8 2
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:28 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4811.4029.seq)
SWISS-PROT ANNOTATION:
ID ORF1
DE ORF1, 174 bases, 33A7F57 checksum.
number of residues: 174; molecular weight: 19.2 kdal
1 MKALALKTVQ SNIRSIVRNS NPAHLQEVFA STIWHGLIHS NIMPAAMKAL RDSDAPSKLK
61 AGISKYMPMK WDKKREAYVF NAAKREKLLS DLALTDESSF EDLANSLPQL FEKKSEPTAY
121 SRDAYLANVA KKLEKEGEQN VQDIIGLVSA MLDNPELVSA AAKAVFKQTA AKAA
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 27(15.5%); C : 0( 0.0%); D : 9( 5.2%); E : 11( 6.3%); F : 5( 2.9%)
G : 4( 2.3%); H : 3( 1.7%); I : 8( 4.6%); K : 19(10.9%); L : 18(10.3%)
M : 6( 3.4%); N : 9( 5.2%); P : 7( 4.0%); Q : 6( 3.4%); R : 6( 3.4%)
S : 16( 9.2%); T : 5( 2.9%); V : 9( 5.2%); W : 2( 1.1%); Y : 4( 2.3%)
KR : 25 ( 14.4%); ED : 20 ( 11.5%); AGP : 38 ( 21.8%);
KRED : 45 ( 25.9%); KR-ED : 5 ( 2.9%); FIKMNY : 51 ( 29.3%);
LVIFM : 46 ( 26.4%); ST : 21 ( 12.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0+0000+000 000+000+00 000000-000 0000000000 0000000+00 +-0-000+0+
61 0000+0000+ 0-+++-0000 000++-+000 -0000--000 --00000000 0-++0-0000
121 0+-0000000 ++0-+-0-00 00-0000000 00-00-0000 00+000+000 0+00
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 15/45 or 19/60): none
Negative charge clusters (cmin = 10/30 or 13/45 or 16/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.144 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.741 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.115 ( ED )
Expected score/letter: -0.684; Average information/letter: 0.857
Minimal length of displayed segments set to: 20
M_0.01= 11.58 (cv= 7.10, lambda= 0.72678, k= 0.26115, x= 4.48;
90% confidence interval for segment length: 14 +- 14)
M_0.05= 9.34 (x= 2.24)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.115 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.741 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.144 ( KR )
Expected score/letter: -0.799; Average information/letter: 1.241
Minimal length of displayed segments set to: 20
M_0.01= 9.81 (cv= 5.86, lambda= 0.87967, k= 0.32271, x= 3.94;
90% confidence interval for segment length: 10 +- 9)
M_0.05= 7.96 (x= 2.09)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.259 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.741 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.483; Average information/letter: 0.733
Minimal length of displayed segments set to: 20
M_0.01= 8.17 (cv= 4.90, lambda= 1.05315, k= 0.31435, x= 3.27;
90% confidence interval for segment length: 17 +- 15)
M_0.05= 6.62 (x= 1.72)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.741 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.259 ( KEDR )
Expected score/letter: -1.328; Average information/letter: 0.282
Minimal length of displayed segments set to: 20
M_0.01= 29.77 (cv= 19.21, lambda= 0.26862, k= 0.17163, x= 10.56;
90% confidence interval for segment length: 41 +- 26)
M_0.05= 23.70 (x= 4.50)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 4 | 7 | 28 | 9 | 9 | 12 | 11 | 10 | 14 | 6 | 7 |
lmin1 6 | 6 | 9 | 34 | 11 | 11 | 15 | 13 | 12 | 17 | 7 | 9 |
lmin2 7 | 7 | 10 | 38 | 13 | 12 | 16 | 15 | 14 | 19 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 1, at 73;
- runs >= 3: 0
* runs >= 5: 1, at 72;
0 runs >= 19: 1, at 28;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.270; Average information/letter: 0.743
Minimal length of displayed segments set to: 15
M_0.01= 20.84 (cv= 12.48, lambda= 0.41336, k= 0.31793, x= 8.36;
90% confidence interval for segment length: 17 +- 10)
M_0.05= 16.89 (x= 4.41)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.195; Average information/letter: 0.617
Minimal length of displayed segments set to: 15
M_0.01= 49.17 (cv= 30.50, lambda= 0.16916, k= 0.23655, x= 18.67;
90% confidence interval for segment length: 19 +- 13)
M_0.05= 39.53 (x= 9.04); M_0.30= 28.07 (x= -2.43)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-174-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 147- 150] LVSA
[ 157- 160] LVSA
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 11 (Expected range: 2-- 23)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 3 (6-10) 3 (11-20) 4 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 6 (Expected range: 0-- 12)
4 +plets (f+: 14.4%), 2 -plets (f-: 11.5%)
Total number of charge altplets: 7 (Critical number: 14)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 1 (11-20) 3 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
141- 148 (4.) V( 7)V 10 of 10 0.0097 large minimal spacing
158- 165 (4.) V( 7)V 9 of 10 0.0097 matching minimum
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:30 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4816.2920.seq)
SWISS-PROT ANNOTATION:
ID ORF2
DE ORF2, 116 bases, 4FE4F868 checksum.
number of residues: 116; molecular weight: 13.2 kdal
1 MKLRESIAPL RDAVSGMPAL HGHQTFFENM ASQYEHCKIK CVNPIHAMRW NVWFAIPKTA
61 RDRIIVQSVG TADYVGGYFE SVNDSHIDSL LRKAMPHDYM ARFLAHAQLK AAKQGR
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 14(12.1%); C : 2( 1.7%); D : 6( 5.2%); E : 4( 3.4%); F : 5( 4.3%)
G : 6( 5.2%); H : 7( 6.0%); I : 7( 6.0%); K : 7( 6.0%); L : 7( 6.0%)
M : 6( 5.2%); N : 4( 3.4%); P : 5( 4.3%); Q : 5( 4.3%); R : 8( 6.9%)
S : 7( 6.0%); T : 3( 2.6%); V : 7( 6.0%); W : 2( 1.7%); Y : 4( 3.4%)
KR : 15 ( 12.9%); ED : 10 ( 8.6%); AGP : 25 ( 21.6%);
KRED : 25 ( 21.6%); KR-ED : 5 ( 4.3%); FIKMNY : 33 ( 28.4%);
LVIFM : 32 ( 27.6%); ST : 10 ( 8.6%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0+0+-00000 +-00000000 0000000-00 0000-00+0+ 00000000+0 0000000+00
61 +-+0000000 00-000000- 000-000-00 0++0000-00 0+0000000+ 00+00+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 11/30 or 14/45 or 18/60): none
Negative charge clusters (cmin = 8/30 or 11/45 or 13/60): none
Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.129 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.784 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.086 ( ED )
Expected score/letter: -0.698; Average information/letter: 0.976
Minimal length of displayed segments set to: 20
M_0.01= 10.25 (cv= 6.03, lambda= 0.78816, k= 0.28033, x= 4.22;
90% confidence interval for segment length: 12 +- 12)
M_0.05= 8.19 (x= 2.15)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.086 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.784 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.129 ( KR )
Expected score/letter: -0.871; Average information/letter: 1.702
Minimal length of displayed segments set to: 20
M_0.01= 7.93 (cv= 4.51, lambda= 1.05510, k= 0.37413, x= 3.43;
90% confidence interval for segment length: 7 +- 6)
M_0.05= 6.39 (x= 1.88)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.216 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.784 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.569; Average information/letter: 1.061
Minimal length of displayed segments set to: 20
M_0.01= 6.55 (cv= 3.68, lambda= 1.29198, k= 0.41266, x= 2.88;
90% confidence interval for segment length: 12 +- 10)
M_0.05= 5.29 (x= 1.61)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.784 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.216 ( KEDR )
Expected score/letter: -0.940; Average information/letter: 0.171
Minimal length of displayed segments set to: 20
M_0.01= 36.11 (cv= 24.10, lambda= 0.19722, k= 0.10732, x= 12.01;
90% confidence interval for segment length: 60 +- 47)
M_0.05= 27.85 (x= 3.74)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 4 | 6 | 32 | 9 | 8 | 11 | 10 | 9 | 13 | 7 | 9 |
lmin1 6 | 5 | 8 | 39 | 11 | 9 | 13 | 13 | 11 | 16 | 9 | 12 |
lmin2 7 | 6 | 9 | 43 | 12 | 11 | 15 | 14 | 13 | 18 | 10 | 13 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 21: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.793; Average information/letter: 0.538
Minimal length of displayed segments set to: 15
M_0.01= 22.94 (cv= 13.69, lambda= 0.34729, k= 0.24954, x= 9.25;
90% confidence interval for segment length: 21 +- 15)
M_0.05= 18.24 (x= 4.56)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.724; Average information/letter: 0.550
Minimal length of displayed segments set to: 15
M_0.01= 48.07 (cv= 29.20, lambda= 0.16282, k= 0.21706, x= 18.87;
90% confidence interval for segment length: 21 +- 15)
M_0.05= 38.06 (x= 8.86); M_0.30= 26.15 (x= -3.05)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-36-C-3-C-75-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 5 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 0 (11-20) 3 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 9/30 or 11/45 or 13/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 8)
1 +plets (f+: 12.9%), 0 -plets (f-: 8.6%)
Total number of charge altplets: 3 (Critical number: 9)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
20- 90 (2.) L( 70)L 1 of 8 0.0092 large 1. maximal spacing
80- 84 (3.) -( 4)- 11 of 11 0.0085 large minimal spacing
84- 88 (3.) -( 4)- 10 of 11 0.0085 matching minimum
91- 104 (4.) L( 13)L 2 of 8 0.9892 small 2. maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:31 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4716.4029.seq)
SWISS-PROT ANNOTATION:
ID ORF3
DE ORF3, 56 bases, 9BBCFDEF checksum.
number of residues: 56; molecular weight: 6.2 kdal
1 MNIDTVFKWL AFISATITIP MDPSQGAAYF VGVAVLMTVY LFIAAPIVWV YVRITS
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 7(12.5%); C : 0( 0.0%); D : 2( 3.6%); E : 0( 0.0%); F : 4( 7.1%)
G : 2( 3.6%); H : 0( 0.0%); I : 7(12.5%); K : 1( 1.8%); L : 3( 5.4%)
M : 3( 5.4%); N : 1( 1.8%); P : 3( 5.4%); Q : 1( 1.8%); R : 1( 1.8%)
S : 3( 5.4%); T : 5( 8.9%); V : 8(14.3%); W : 2( 3.6%); Y : 3( 5.4%)
KR : 2 ( 3.6%); ED : 2 ( 3.6%); AGP : 12 ( 21.4%);
KRED : 4 ( 7.1%); KR-ED : 0 ( 0.0%); FIKMNY : 19 ( 33.9%);
LVIFM : 25 ( 44.6%); ST : 8 ( 14.3%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000-000+00 0000000000 0-00000000 0000000000 0000000000 00+000
A. CHARGE CLUSTERS.
Positive charge clusters: not evaluated (frequency of + < 5%, too low)
Negative charge clusters: not evaluated (frequency of - < 5%, too low)
Mixed charge clusters (cmin = 7/30 or 10/45 or 12/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.036 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.929 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.036 ( ED )
Expected score/letter: -0.929; Average information/letter: 3.156
Minimal length of displayed segments set to: 20
M_0.01= 5.04 (cv= 2.59, lambda= 1.55610, k= 0.45704, x= 2.45;
90% confidence interval for segment length: 4 +- 3)
M_0.05= 3.99 (x= 1.41)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.036 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.929 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.036 ( KR )
Expected score/letter: -0.929; Average information/letter: 3.156
Minimal length of displayed segments set to: 20
M_0.01= 5.04 (cv= 2.59, lambda= 1.55610, k= 0.45704, x= 2.45;
90% confidence interval for segment length: 4 +- 3)
M_0.05= 3.99 (x= 1.41)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.071 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.929 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.857; Average information/letter: 3.172
Minimal length of displayed segments set to: 20
M_0.01= 3.27 (cv= 1.57, lambda= 2.56495, k= 0.79121, x= 1.70;
90% confidence interval for segment length: 4 +- 2)
M_0.05= 2.64 (x= 1.07)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.929 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.071 ( KEDR )
Expected score/letter: 0.357
! expected score > -.001; too big !
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 3 | 3 | 3 | 81 | 6 | 6 | 7 | 7 | 7 | 9 | NA | NA |
lmin1 4 | 4 | 4 | 100 | 7 | 7 | 9 | 9 | 9 | 11 | NA | NA |
lmin2 4 | 4 | 5 | 110 | 8 | 8 | 10 | 10 | 10 | 12 | NA | NA |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 3: 0
0 runs >= 54: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -0.250
Average information/letter: 0.019 < .10; too small !
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: 0.161
! expected score > -.001; too big !
2. SPACINGS OF C.
H2N-56-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 2 (Expected range: 0-- 11)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 0 (6-10) 0 (11-20) 2 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 8/30 or 10/45 or 12/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 0 (Expected range: 0-- 2)
0 +plets (f+: 3.6%), 0 -plets (f-: 3.6%)
Total number of charge altplets: 0 (Critical number: 2)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 4)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
3- 56 9 i.0...0.0 6 6 /0/./2/./././2/./1/
31- 44 2 i0 7 7 /0/2/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:32 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4815.7504.seq)
SWISS-PROT ANNOTATION:
ID ORF4
DE ORF4, 46 bases, E74BD5A1 checksum.
number of residues: 46; molecular weight: 5.6 kdal
1 MKYIDKLTAY HNGKYYLPVE AFDCCTSNWD ADDWIRVIDY YNGWTK
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 3( 6.5%); C : 2( 4.3%); D : 6(13.0%); E : 1( 2.2%); F : 1( 2.2%)
G : 2( 4.3%); H : 1( 2.2%); I : 3( 6.5%); K : 4( 8.7%); L : 2( 4.3%)
M : 1( 2.2%); N : 3( 6.5%); P : 1( 2.2%); Q : 0( 0.0%); R : 1( 2.2%)
S : 1( 2.2%); T : 3( 6.5%); V : 2( 4.3%); W : 3( 6.5%); Y : 6(13.0%)
KR : 5 ( 10.9%); ED : 7 ( 15.2%); AGP : 6 ( 13.0%);
KRED : 12 ( 26.1%); KR-ED : -2 ( -4.3%); FIKMNY : 18 ( 39.1%);
LVIFM : 9 ( 19.6%); ST : 4 ( 8.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0+00-+0000 000+00000- 00-000000- 0--00+00-0 00000+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 10/30 or 13/45 or 16/60): none
Negative charge clusters (cmin = 12/30 or 16/45 or 20/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.109 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.739 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.152 ( ED )
Expected score/letter: -0.826; Average information/letter: 1.344
Minimal length of displayed segments set to: 20
M_0.01= 8.00 (cv= 4.18, lambda= 0.91693, k= 0.33649, x= 3.83;
90% confidence interval for segment length: 8 +- 8)
M_0.05= 6.23 (x= 2.05)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.152 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.739 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.109 ( KR )
Expected score/letter: -0.652; Average information/letter: 0.766
Minimal length of displayed segments set to: 20
M_0.01= 10.23 (cv= 5.58, lambda= 0.68621, k= 0.24362, x= 4.65;
90% confidence interval for segment length: 13 +- 14)
M_0.05= 7.85 (x= 2.27)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.261 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.739 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.478; Average information/letter: 0.719
Minimal length of displayed segments set to: 20
M_0.01= 6.97 (cv= 3.68, lambda= 1.04145, k= 0.30946, x= 3.29;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 5.40 (x= 1.73)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.739 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.261 ( KEDR )
Expected score/letter: -1.348; Average information/letter: 0.288
Minimal length of displayed segments set to: 20
M_0.01= 24.55 (cv= 14.06, lambda= 0.27230, k= 0.17501, x= 10.49;
90% confidence interval for segment length: 33 +- 24)
M_0.05= 18.57 (x= 4.51)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 4 | 6 | 23 | 7 | 8 | 11 | 8 | 9 | 12 | 5 | 6 |
lmin1 5 | 6 | 8 | 28 | 9 | 10 | 13 | 10 | 12 | 14 | 7 | 8 |
lmin2 6 | 7 | 9 | 31 | 10 | 11 | 14 | 11 | 13 | 16 | 8 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 16: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.043; Average information/letter: 0.610
Minimal length of displayed segments set to: 15
M_0.01= 18.57 (cv= 9.91, lambda= 0.38639, k= 0.28511, x= 8.66;
90% confidence interval for segment length: 17 +- 11)
M_0.05= 14.35 (x= 4.44)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.565; Average information/letter: 0.795
Minimal length of displayed segments set to: 15
M_0.01= 31.92 (cv= 17.29, lambda= 0.22141, k= 0.25621, x= 14.63;
90% confidence interval for segment length: 13 +- 9)
M_0.05= 24.56 (x= 7.26); M_0.30= 15.80 (x= -1.49)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-23
CC at 24
-21-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 4 (Expected range: 0-- 9)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 0 (6-10) 4 (11-20) 1 (>=21) 0
3. Clusters of amino acid multiplets (cmin = 14/30 or 18/45 or 21/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 5)
0 +plets (f+: 10.9%), 1 -plets (f-: 15.2%)
Total number of charge altplets: 1 (Critical number: 6)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 1 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 7)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:34 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.2250.8890.seq)
SWISS-PROT ANNOTATION:
ID ORF5
DE ORF5, 84 bases, 936E4563 checksum.
number of residues: 84; molecular weight: 10.2 kdal
1 MDKVMERTHF ELPDKLDTPM IHSHYYAWLD KHCNHTWHNI HTRAKYLASL GYDRDKHYLT
61 QDDLTVLKCI LHHHFFINVP IESE
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 3( 3.6%); C : 2( 2.4%); D : 8( 9.5%); E : 4( 4.8%); F : 3( 3.6%)
G : 1( 1.2%); H : 11(13.1%); I : 5( 6.0%); K : 6( 7.1%); L : 9(10.7%)
M : 3( 3.6%); N : 3( 3.6%); P : 3( 3.6%); Q : 1( 1.2%); R : 3( 3.6%)
S : 3( 3.6%); T : 6( 7.1%); V : 3( 3.6%); W : 2( 2.4%); Y : 5( 6.0%)
KR : 9 ( 10.7%); ED : 12 ( 14.3%); AGP : 7 ( 8.3%);
KRED : 21 ( 25.0%); KR-ED : -3 ( -3.6%); FIKMNY : 25 ( 29.8%);
LVIFM : 23 ( 27.4%); ST : 9 ( 10.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0-+00-+000 -00-+0-000 000000000- +000000000 00+0+00000 00-+-+0000
61 0--0000+00 0000000000 0-0-
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 13/45 or 16/60): none
Negative charge clusters (cmin = 11/30 or 15/45 or 19/60): none
Mixed charge clusters (cmin = 16/30 or 22/45 or 28/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.107 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.750 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.143 ( ED )
Expected score/letter: -0.821; Average information/letter: 1.355
Minimal length of displayed segments set to: 20
M_0.01= 8.60 (cv= 4.80, lambda= 0.92375, k= 0.33723, x= 3.80;
90% confidence interval for segment length: 8 +- 8)
M_0.05= 6.84 (x= 2.04)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.143 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.750 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.107 ( KR )
Expected score/letter: -0.679; Average information/letter: 0.855
Minimal length of displayed segments set to: 20
M_0.01= 10.55 (cv= 6.08, lambda= 0.72822, k= 0.26045, x= 4.47;
90% confidence interval for segment length: 13 +- 13)
M_0.05= 8.32 (x= 2.23)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.250 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.750 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.500; Average information/letter: 0.792
Minimal length of displayed segments set to: 20
M_0.01= 7.22 (cv= 4.03, lambda= 1.09861, k= 0.33333, x= 3.19;
90% confidence interval for segment length: 14 +- 13)
M_0.05= 5.74 (x= 1.70)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.750 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.250 ( KEDR )
Expected score/letter: -1.250; Average information/letter: 0.259
Minimal length of displayed segments set to: 20
M_0.01= 28.25 (cv= 17.41, lambda= 0.25450, k= 0.15869, x= 10.84;
90% confidence interval for segment length: 40 +- 28)
M_0.05= 21.85 (x= 4.44)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 27 | 8 | 9 | 11 | 9 | 10 | 13 | 9 | 11 |
lmin1 5 | 6 | 8 | 32 | 10 | 11 | 14 | 11 | 12 | 15 | 11 | 14 |
lmin2 6 | 7 | 9 | 36 | 11 | 12 | 15 | 13 | 14 | 17 | 12 | 15 |
(Significance level: 0.010000; Minimal displayed length: 6)
(H..) 14(0,0,0); at 30- 43: DKHCNHTWHNIHTR
(2. quartile)
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 1, at 53;
0 runs >= 18: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.250; Average information/letter: 0.785
Minimal length of displayed segments set to: 15
M_0.01= 18.06 (cv= 10.14, lambda= 0.43713, k= 0.32058, x= 7.92;
90% confidence interval for segment length: 15 +- 9)
M_0.05= 14.33 (x= 4.19)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.845; Average information/letter: 0.858
Minimal length of displayed segments set to: 15
M_0.01= 37.68 (cv= 21.54, lambda= 0.20572, k= 0.27804, x= 16.14;
90% confidence interval for segment length: 13 +- 9)
M_0.05= 29.75 (x= 8.22); M_0.30= 20.33 (x= -1.21)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-32-C-35-C-15-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 4 (Expected range: 0-- 12)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 2 (11-20) 0 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 9/30 or 12/45 or 14/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 7)
0 +plets (f+: 10.7%), 1 -plets (f-: 14.3%)
Total number of charge altplets: 5 (Critical number: 9)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
32- 43 3 H.. 4 4 ! 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:35 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4813.2920.seq)
SWISS-PROT ANNOTATION:
ID ORF6
DE ORF6, 120 bases, 3DC19403 checksum.
number of residues: 120; molecular weight: 13.2 kdal
1 MSMQYTILLG AGAHIDDTTS AKAMRADLVQ THIKSWTEQL CAVFGGCTMS IVDGNYVMNN
61 GELCTESSFQ FTCIKFGDNT VEQAQAWIEV KLIARDMCKV FEQECVLISR HACDAELVGV
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 11( 9.2%); C : 7( 5.8%); D : 7( 5.8%); E : 8( 6.7%); F : 5( 4.2%)
G : 8( 6.7%); H : 3( 2.5%); I : 8( 6.7%); K : 5( 4.2%); L : 8( 6.7%)
M : 6( 5.0%); N : 4( 3.3%); P : 0( 0.0%); Q : 7( 5.8%); R : 3( 2.5%)
S : 7( 5.8%); T : 9( 7.5%); V : 10( 8.3%); W : 2( 1.7%); Y : 2( 1.7%)
KR : 8 ( 6.7%); ED : 15 ( 12.5%); AGP : 19 ( 15.8%);
KRED : 23 ( 19.2%); KR-ED : -7 ( -5.8%); FIKMNY : 30 ( 25.0%);
LVIFM : 37 ( 30.8%); ST : 16 ( 13.3%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 00000--000 0+00+0-000 000+000-00 0000000000 00-0000000
61 0-000-0000 0000+00-00 0-000000-0 +000+-00+0 0-0-00000+ 000-0-0000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 7/30 or 9/45 or 11/60): none
Negative charge clusters (cmin = 10/30 or 14/45 or 17/60): none
Mixed charge clusters (cmin = 14/30 or 19/45 or 23/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.067 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.808 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.125 ( ED )
Expected score/letter: -0.925; Average information/letter: 2.146
Minimal length of displayed segments set to: 20
M_0.01= 7.03 (cv= 3.96, lambda= 1.20847, k= 0.41054, x= 3.07;
90% confidence interval for segment length: 6 +- 5)
M_0.05= 5.68 (x= 1.72)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.125 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.808 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.067 ( KR )
Expected score/letter: -0.692; Average information/letter: 1.001
Minimal length of displayed segments set to: 20
M_0.01= 10.10 (cv= 5.95, lambda= 0.80491, k= 0.28337, x= 4.15;
90% confidence interval for segment length: 12 +- 11)
M_0.05= 8.07 (x= 2.12)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.192 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.808 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.617; Average information/letter: 1.280
Minimal length of displayed segments set to: 20
M_0.01= 6.00 (cv= 3.33, lambda= 1.43922, k= 0.47045, x= 2.67;
90% confidence interval for segment length: 10 +- 8)
M_0.05= 4.87 (x= 1.54)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.808 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.192 ( KEDR )
Expected score/letter: -0.725; Average information/letter: 0.114
Minimal length of displayed segments set to: 20
M_0.01= 43.33 (cv= 30.62, lambda= 0.15635, k= 0.07326, x= 12.71;
90% confidence interval for segment length: 86 +- 73)
M_0.05= 32.90 (x= 2.28)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 3 | 4 | 6 | 36 | 7 | 9 | 11 | 9 | 11 | 13 | 6 | 7 |
lmin1 5 | 6 | 7 | 44 | 9 | 11 | 13 | 11 | 13 | 15 | 7 | 9 |
lmin2 6 | 7 | 8 | 49 | 10 | 12 | 14 | 12 | 15 | 17 | 8 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 24: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.617; Average information/letter: 0.449
Minimal length of displayed segments set to: 15
M_0.01= 26.07 (cv= 15.86, lambda= 0.30179, k= 0.21876, x= 10.21;
90% confidence interval for segment length: 25 +- 18)
M_0.05= 20.67 (x= 4.81)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -2.958; Average information/letter: 0.368
Minimal length of displayed segments set to: 15
M_0.01= 59.18 (cv= 37.43, lambda= 0.12792, k= 0.16250, x= 21.76;
90% confidence interval for segment length: 30 +- 24)
M_0.05= 46.44 (x= 9.01); M_0.30= 31.28 (x= -6.15)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-40-C-5-C-16-C-8-C-24-C-6-C-7-C-7-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 6 (Expected range: 0-- 15)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 3 (11-20) 1 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 10/30 or 12/45 or 15/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 7)
0 +plets (f+: 6.7%), 1 -plets (f-: 12.5%)
Total number of charge altplets: 1 (Critical number: 8)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 1 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:37 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4814.3752.seq)
SWISS-PROT ANNOTATION:
ID ORF7
DE ORF7, 134 bases, 250A6EBF checksum.
number of residues: 134; molecular weight: 14.9 kdal
1 MSSAWSFKDP DDLELETPDG DCVYVCGSGE RIMYGAGKGE ADNLVYYVNV GDYQRAIDVL
61 RKYKDSSVTG WGHQLLADLE NYMLKSTPDP VASIDEADSF GIGQPNRKST KQWEPDTPDP
121 DVDLMKSIRD ACHR
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 8( 6.0%); C : 3( 2.2%); D : 18(13.4%); E : 7( 5.2%); F : 2( 1.5%)
G : 11( 8.2%); H : 2( 1.5%); I : 5( 3.7%); K : 8( 6.0%); L : 9( 6.7%)
M : 4( 3.0%); N : 4( 3.0%); P : 8( 6.0%); Q : 4( 3.0%); R : 6( 4.5%)
S : 11( 8.2%); T : 5( 3.7%); V : 9( 6.7%); W : 3( 2.2%); Y : 7( 5.2%)
KR : 14 ( 10.4%); ED : 25 ( 18.7%); AGP : 27 ( 20.1%);
KRED : 39 ( 29.1%); KR-ED : -11 ( -8.2%); FIKMNY : 30 ( 22.4%);
LVIFM : 29 ( 21.6%); ST : 16 ( 11.9%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000+-0 --0-0-00-0 -00000000- +000000+0- 0-00000000 0-00+00-00
61 ++0+-00000 0000000-0- 0000+000-0 0000--0-00 000000++00 +00-0-00-0
121 -0-00+00+- 000+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 12/45 or 15/60): none
Negative charge clusters (cmin = 14/30 or 18/45 or 23/60): none
Mixed charge clusters (cmin = 18/30 or 25/45 or 31/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.104 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.709 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.187 ( ED )
Expected score/letter: -0.873; Average information/letter: 1.459
Minimal length of displayed segments set to: 20
M_0.01= 8.90 (cv= 5.16, lambda= 0.94942, k= 0.35218, x= 3.75;
90% confidence interval for segment length: 8 +- 8)
M_0.05= 7.19 (x= 2.03)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.187 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.709 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.104 ( KR )
Expected score/letter: -0.545; Average information/letter: 0.489
Minimal length of displayed segments set to: 20
M_0.01= 14.34 (cv= 9.02, lambda= 0.54322, k= 0.18130, x= 5.32;
90% confidence interval for segment length: 23 +- 23)
M_0.05= 11.34 (x= 2.32)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.291 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.709 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.418; Average information/letter: 0.537
Minimal length of displayed segments set to: 20
M_0.01= 9.09 (cv= 5.50, lambda= 0.89032, k= 0.24635, x= 3.59;
90% confidence interval for segment length: 22 +- 20)
M_0.05= 7.26 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.709 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.291 ( KEDR )
Expected score/letter: -1.619; Average information/letter: 0.371
Minimal length of displayed segments set to: 20
M_0.01= 24.83 (cv= 15.24, lambda= 0.32148, k= 0.21991, x= 9.60;
90% confidence interval for segment length: 31 +- 18)
M_0.05= 19.76 (x= 4.53)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 6 | 7 | 24 | 8 | 10 | 12 | 9 | 11 | 14 | 5 | 7 |
lmin1 5 | 7 | 9 | 29 | 10 | 12 | 15 | 11 | 14 | 17 | 7 | 8 |
lmin2 6 | 8 | 11 | 32 | 11 | 13 | 17 | 13 | 15 | 19 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 5: 0
0 runs >= 16: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.522; Average information/letter: 0.862
Minimal length of displayed segments set to: 15
M_0.01= 17.98 (cv= 10.41, lambda= 0.47050, k= 0.35436, x= 7.57;
90% confidence interval for segment length: 14 +- 8)
M_0.05= 14.52 (x= 4.11)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.978; Average information/letter: 0.836
Minimal length of displayed segments set to: 15
M_0.01= 38.23 (cv= 22.81, lambda= 0.21476, k= 0.27593, x= 15.42;
90% confidence interval for segment length: 14 +- 9)
M_0.05= 30.64 (x= 7.83); M_0.30= 21.61 (x= -1.20)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-21-C-3-C-105-C-2-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 87- 90] TPDP
[ 117- 120] TPDP
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 5 (Expected range: 0-- 17)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 2 (11-20) 1 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 8/30 or 10/45 or 12/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 4 (Expected range: 0-- 12)
2 +plets (f+: 10.4%), 2 -plets (f-: 18.7%)
Total number of charge altplets: 4 (Critical number: 13)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 2 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:38 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.2250.8967.seq)
SWISS-PROT ANNOTATION:
ID ORF8
DE ORF8, 56 bases, 857665F2 checksum.
number of residues: 56; molecular weight: 6.6 kdal
1 MQFYDVLEVL FESANGGYKV YSELDECVYT LTAEDLKHER ELGNEVYDPY QLVFFN
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 2( 3.6%); C : 1( 1.8%); D : 4( 7.1%); E : 8(14.3%); F : 4( 7.1%)
G : 3( 5.4%); H : 1( 1.8%); I : 0( 0.0%); K : 2( 3.6%); L : 7(12.5%)
M : 1( 1.8%); N : 3( 5.4%); P : 1( 1.8%); Q : 2( 3.6%); R : 1( 1.8%)
S : 2( 3.6%); T : 2( 3.6%); V : 6(10.7%); W : 0( 0.0%); Y : 6(10.7%)
KR : 3 ( 5.4%); ED : 12 ( 21.4%); AGP : 6 ( 10.7%);
KRED : 15 ( 26.8%); KR-ED : -9 (-16.1%); FIKMNY : 16 ( 28.6%);
LVIFM : 18 ( 32.1%); ST : 4 ( 7.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000-00-00 0-000000+0 00-0--0000 000--0+0-+ -000-00-00 000000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 6/30 or 8/45 or 10/60): none
Negative charge clusters (cmin = 15/30 or 20/45 or 25/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.054 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.732 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.214 ( ED )
Expected score/letter: -1.054; Average information/letter: 2.671
Minimal length of displayed segments set to: 20
M_0.01= 5.80 (cv= 2.98, lambda= 1.34903, k= 0.44752, x= 2.81;
90% confidence interval for segment length: 4 +- 4)
M_0.05= 4.59 (x= 1.61)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.214 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.732 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.054 ( KR )
Expected score/letter: -0.411; Average information/letter: 0.279
Minimal length of displayed segments set to: 20
M_0.01= 15.52 (cv= 9.61, lambda= 0.41883, k= 0.11951, x= 5.91;
90% confidence interval for segment length: 34 +- 38)
M_0.05= 11.63 (x= 2.02)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.268 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.732 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.464; Average information/letter: 0.674
Minimal length of displayed segments set to: 20
M_0.01= 7.36 (cv= 4.00, lambda= 1.00552, k= 0.29443, x= 3.36;
90% confidence interval for segment length: 16 +- 15)
M_0.05= 5.74 (x= 1.74)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.732 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.268 ( KEDR )
Expected score/letter: -1.411; Average information/letter: 0.307
Minimal length of displayed segments set to: 20
M_0.01= 24.46 (cv= 14.19, lambda= 0.28371, k= 0.18547, x= 10.28;
90% confidence interval for segment length: 33 +- 22)
M_0.05= 18.72 (x= 4.53)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 3 | 5 | 6 | 23 | 6 | 10 | 11 | 7 | 11 | 12 | 5 | 6 |
lmin1 4 | 7 | 8 | 28 | 8 | 12 | 13 | 9 | 13 | 15 | 7 | 8 |
lmin2 5 | 8 | 9 | 31 | 9 | 13 | 15 | 10 | 15 | 16 | 7 | 9 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 4: 0
0 runs >= 16: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.000; Average information/letter: 0.560
Minimal length of displayed segments set to: 15
M_0.01= 22.10 (cv= 12.21, lambda= 0.32960, k= 0.26130, x= 9.88;
90% confidence interval for segment length: 19 +- 13)
M_0.05= 17.15 (x= 4.94)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.179; Average information/letter: 0.565
Minimal length of displayed segments set to: 15
M_0.01= 47.73 (cv= 26.97, lambda= 0.14926, k= 0.22291, x= 20.76;
90% confidence interval for segment length: 18 +- 14)
M_0.05= 36.81 (x= 9.84); M_0.30= 23.82 (x= -3.15)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-26-C-29-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 2 (Expected range: 0-- 11)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 0 (11-20) 1 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 8/30 or 10/45 or 12/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 2 (Expected range: 0-- 7)
0 +plets (f+: 5.4%), 2 -plets (f-: 21.4%)
Total number of charge altplets: 1 (Critical number: 6)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 1 (11-20) 0 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:40 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.8967.seq)
SWISS-PROT ANNOTATION:
ID ORF9
DE ORF9, 69 bases, DEE0CBC3 checksum.
number of residues: 69; molecular weight: 8.2 kdal
1 MPTVYVIKMI QRGTDEPDYQ IEKQFTGKSI AEQLSLAIDH AEWLNERNEA SWRHYFASQY
61 PPDEIIWNG
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 5( 7.2%); C : 0( 0.0%); D : 4( 5.8%); E : 7(10.1%); F : 2( 2.9%)
G : 3( 4.3%); H : 2( 2.9%); I : 7(10.1%); K : 3( 4.3%); L : 3( 4.3%)
M : 2( 2.9%); N : 3( 4.3%); P : 4( 5.8%); Q : 5( 7.2%); R : 3( 4.3%)
S : 4( 5.8%); T : 3( 4.3%); V : 2( 2.9%); W : 3( 4.3%); Y : 4( 5.8%)
KR : 6 ( 8.7%); ED : 11 ( 15.9%); AGP : 12 ( 17.4%);
KRED : 17 ( 24.6%); KR-ED : -5 ( -7.2%); FIKMNY : 21 ( 30.4%);
LVIFM : 16 ( 23.2%); ST : 7 ( 10.1%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000+00 0+00--0-00 0-+0000+00 0-000000-0 0-000-+0-0 00+0000000
61 00--00000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 8/30 or 11/45 or 13/60): none
Negative charge clusters (cmin = 12/30 or 16/45 or 20/60): none
Mixed charge clusters (cmin = 16/30 or 22/45 or 28/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.087 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.754 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.159 ( ED )
Expected score/letter: -0.899; Average information/letter: 1.731
Minimal length of displayed segments set to: 20
M_0.01= 7.45 (cv= 4.01, lambda= 1.05591, k= 0.37880, x= 3.44;
90% confidence interval for segment length: 7 +- 6)
M_0.05= 5.90 (x= 1.89)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.159 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.754 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.087 ( KR )
Expected score/letter: -0.609; Average information/letter: 0.673
Minimal length of displayed segments set to: 20
M_0.01= 11.34 (cv= 6.54, lambda= 0.64712, k= 0.22378, x= 4.80;
90% confidence interval for segment length: 16 +- 16)
M_0.05= 8.82 (x= 2.28)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.246 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.754 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.507; Average information/letter: 0.818
Minimal length of displayed segments set to: 20
M_0.01= 6.94 (cv= 3.79, lambda= 1.11803, k= 0.34142, x= 3.15;
90% confidence interval for segment length: 14 +- 12)
M_0.05= 5.48 (x= 1.70)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.754 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.246 ( KEDR )
Expected score/letter: -1.217; Average information/letter: 0.250
Minimal length of displayed segments set to: 20
M_0.01= 28.00 (cv= 17.04, lambda= 0.24855, k= 0.15325, x= 10.96;
90% confidence interval for segment length: 40 +- 29)
M_0.05= 21.44 (x= 4.40)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 26 | 7 | 9 | 11 | 8 | 10 | 12 | 6 | 7 |
lmin1 5 | 6 | 8 | 32 | 9 | 11 | 13 | 10 | 13 | 15 | 7 | 9 |
lmin2 6 | 7 | 9 | 35 | 10 | 12 | 15 | 12 | 14 | 17 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 18: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.333; Average information/letter: 0.825
Minimal length of displayed segments set to: 15
M_0.01= 16.90 (cv= 9.23, lambda= 0.45870, k= 0.33919, x= 7.67;
90% confidence interval for segment length: 14 +- 9)
M_0.05= 13.35 (x= 4.12)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.841; Average information/letter: 0.849
Minimal length of displayed segments set to: 15
M_0.01= 35.52 (cv= 19.91, lambda= 0.21261, k= 0.27735, x= 15.60;
90% confidence interval for segment length: 13 +- 9)
M_0.05= 27.85 (x= 7.94); M_0.30= 18.73 (x= -1.18)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-69-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 2 (Expected range: 0-- 10)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 2 (6-10) 0 (11-20) 0 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 7/30 or 9/45 or 11/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 2 (Expected range: 0-- 7)
0 +plets (f+: 8.7%), 2 -plets (f-: 15.9%)
Total number of charge altplets: 2 (Critical number: 7)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 1 (11-20) 1 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 7)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:41 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4816.9244.seq)
SWISS-PROT ANNOTATION:
ID ORF10
DE ORF10, 65 bases, 33B5E21F checksum.
number of residues: 65; molecular weight: 7.2 kdal
1 MEWINDIIFA IAKEAGCSPT QVRVLKWDMD GELTVCTCEL PRRGLTPIGI PDTMFASVSG
61 VHTLH
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 4( 6.2%); C : 3( 4.6%); D : 4( 6.2%); E : 4( 6.2%); F : 2( 3.1%)
G : 5( 7.7%); H : 2( 3.1%); I : 6( 9.2%); K : 2( 3.1%); L : 5( 7.7%)
M : 3( 4.6%); N : 1( 1.5%); P : 4( 6.2%); Q : 1( 1.5%); R : 3( 4.6%)
S : 3( 4.6%); T : 6( 9.2%); V : 5( 7.7%); W : 2( 3.1%); Y : 0( 0.0%)
KR : 5 ( 7.7%); ED : 8 ( 12.3%); AGP : 13 ( 20.0%);
KRED : 13 ( 20.0%); KR-ED : -3 ( -4.6%); FIKMNY : 14 ( 21.5%);
LVIFM : 21 ( 32.3%); ST : 9 ( 13.8%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0-000-0000 00+-000000 00+00+0-0- 0-000000-0 0++0000000 0-00000000
61 00000
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 8/30 or 10/45 or 12/60): none
Negative charge clusters (cmin = 10/30 or 14/45 or 17/60): none
Mixed charge clusters (cmin = 14/30 or 19/45 or 24/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.077 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.800 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.123 ( ED )
Expected score/letter: -0.892; Average information/letter: 1.891
Minimal length of displayed segments set to: 20
M_0.01= 6.98 (cv= 3.72, lambda= 1.12285, k= 0.39067, x= 3.26;
90% confidence interval for segment length: 6 +- 6)
M_0.05= 5.53 (x= 1.81)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.123 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.800 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.077 ( KR )
Expected score/letter: -0.708; Average information/letter: 1.040
Minimal length of displayed segments set to: 20
M_0.01= 9.21 (cv= 5.10, lambda= 0.81802, k= 0.28992, x= 4.11;
90% confidence interval for segment length: 10 +- 11)
M_0.05= 7.22 (x= 2.12)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.200 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.800 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.600; Average information/letter: 1.200
Minimal length of displayed segments set to: 20
M_0.01= 5.75 (cv= 3.01, lambda= 1.38629, k= 0.45000, x= 2.74;
90% confidence interval for segment length: 10 +- 8)
M_0.05= 4.58 (x= 1.57)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.800 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.200 ( KEDR )
Expected score/letter: -0.800; Average information/letter: 0.133
Minimal length of displayed segments set to: 20
M_0.01= 36.94 (cv= 24.44, lambda= 0.17079, k= 0.08492, x= 12.50;
90% confidence interval for segment length: 68 +- 59)
M_0.05= 27.39 (x= 2.95)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 3 | 4 | 5 | 32 | 7 | 8 | 10 | 8 | 10 | 12 | 6 | 7 |
lmin1 5 | 5 | 7 | 39 | 9 | 10 | 12 | 10 | 12 | 15 | 7 | 9 |
lmin2 5 | 6 | 8 | 43 | 10 | 11 | 14 | 12 | 13 | 16 | 8 | 10 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 21: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.600; Average information/letter: 0.443
Minimal length of displayed segments set to: 15
M_0.01= 24.05 (cv= 13.88, lambda= 0.30079, k= 0.21416, x= 10.17;
90% confidence interval for segment length: 24 +- 18)
M_0.05= 18.63 (x= 4.75)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -2.846; Average information/letter: 0.346
Minimal length of displayed segments set to: 15
M_0.01= 56.27 (cv= 34.01, lambda= 0.12276, k= 0.15453, x= 22.26;
90% confidence interval for segment length: 29 +- 24)
M_0.05= 42.99 (x= 8.98); M_0.30= 27.19 (x= -6.81)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-16-C-18
CTC at 36
-27-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 2 (Expected range: 0-- 10)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 0 (6-10) 1 (11-20) 0 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 7/30 or 9/45 or 11/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 5)
1 +plets (f+: 7.7%), 0 -plets (f-: 12.3%)
Total number of charge altplets: 1 (Critical number: 6)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:43 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4810.9522.seq)
SWISS-PROT ANNOTATION:
ID ORF11
DE ORF11, 104 bases, 5E0A9B22 checksum.
number of residues: 104; molecular weight: 12.0 kdal
1 MNHLLYHSGD RVVFVAPQYG VYRPLPAYVL KTVQTGVPFS CPHCGKKVKL HPDNVQRYGY
61 YLDCWYGVYG TDCCKPFIYS QLKRVHCPSE HPYKFINGAI TLVE
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 3( 2.9%); C : 6( 5.8%); D : 4( 3.8%); E : 2( 1.9%); F : 4( 3.8%)
G : 8( 7.7%); H : 6( 5.8%); I : 3( 2.9%); K : 7( 6.7%); L : 8( 7.7%)
M : 1( 1.0%); N : 3( 2.9%); P : 9( 8.7%); Q : 4( 3.8%); R : 4( 3.8%)
S : 4( 3.8%); T : 4( 3.8%); V : 12(11.5%); W : 1( 1.0%); Y : 11(10.6%)
KR : 11 ( 10.6%); ED : 6 ( 5.8%); AGP : 20 ( 19.2%);
KRED : 17 ( 16.3%); KR-ED : 5 ( 4.8%); FIKMNY : 29 ( 27.9%);
LVIFM : 28 ( 26.9%); ST : 8 ( 7.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 000000000- +000000000 00+0000000 +000000000 00000++0+0 00-000+000
61 00-0000000 0-00+00000 00++00000- 000+000000 000-
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 13/45 or 15/60): none
Negative charge clusters (cmin = 6/30 or 8/45 or 10/60): none
Mixed charge clusters (cmin = 13/30 or 17/45 or 21/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.106 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.837 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.058 ( ED )
Expected score/letter: -0.740; Average information/letter: 1.254
Minimal length of displayed segments set to: 20
M_0.01= 8.89 (cv= 5.10, lambda= 0.91119, k= 0.31939, x= 3.80;
90% confidence interval for segment length: 9 +- 9)
M_0.05= 7.10 (x= 2.01)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.058 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.837 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.106 ( KR )
Expected score/letter: -0.933; Average information/letter: 2.378
Minimal length of displayed segments set to: 20
M_0.01= 6.50 (cv= 3.60, lambda= 1.29026, k= 0.42442, x= 2.90;
90% confidence interval for segment length: 5 +- 4)
M_0.05= 5.24 (x= 1.64)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.163 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.837 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.673; Average information/letter: 1.585
Minimal length of displayed segments set to: 20
M_0.01= 5.29 (cv= 2.84, lambda= 1.63269, k= 0.54156, x= 2.44;
90% confidence interval for segment length: 8 +- 6)
M_0.05= 4.29 (x= 1.44)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.837 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.163 ( KEDR )
Expected score/letter: -0.471
Average information/letter: 0.056 < .10; too small !
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 3 | 5 | 42 | 8 | 7 | 10 | 10 | 8 | 12 | 7 | 9 |
lmin1 5 | 4 | 6 | 51 | 10 | 9 | 12 | 12 | 10 | 15 | 9 | 11 |
lmin2 6 | 5 | 7 | 56 | 11 | 10 | 13 | 14 | 12 | 16 | 10 | 13 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 3: 0
0 runs >= 28: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.356; Average information/letter: 0.377
Minimal length of displayed segments set to: 15
M_0.01= 25.66 (cv= 15.74, lambda= 0.29509, k= 0.18764, x= 9.92;
90% confidence interval for segment length: 29 +- 22)
M_0.05= 20.13 (x= 4.40)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -3.125; Average information/letter: 0.474
Minimal length of displayed segments set to: 15
M_0.01= 49.38 (cv= 30.29, lambda= 0.15333, k= 0.18757, x= 19.09;
90% confidence interval for segment length: 23 +- 19)
M_0.05= 38.75 (x= 8.46); M_0.30= 26.10 (x= -4.19)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-40
CPHC at 41
-19-C-8
CC at 73 (l= 34)
-12-C-17-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 19- 22] YGVY
[ 66- 69] YGVY
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 5 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 1 (11-20) 2 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 9/30 or 12/45 or 14/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 2 (Expected range: 0-- 5)
2 +plets (f+: 10.6%), 0 -plets (f-: 5.8%)
Total number of charge altplets: 1 (Critical number: 6)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 3
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
13- 44 8 V....... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 7)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
18- 34 (2.) Q( 16)Q 5 of 5 0.0028 large minimal spacing
56- 81 (3.) Q( 25)Q 1 of 5 0.9977 small maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:44 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4813.5345.seq)
SWISS-PROT ANNOTATION:
ID ORF12
DE ORF12, 322 bases, A14E237E checksum.
number of residues: 322; molecular weight: 36.3 kdal
1 MAISLSSLLH AGEFGQFMTN RRTVLTVCRH NGYRSELGGG STIIWRESGT VDHLHGTLDH
61 HRNDVSALRL VRRVPNLRRA ANGATYWTIA GYRARIECRR RPNSQDEEWK ASIHREDGSW
121 KCNIEYNMYG IPRTMPDGSD FDLVWSHDDV DRERQAIREI DKMPDPAHKD NAPHQHGVAA
181 TLGHLQSNGG LRGHSVGDVF PYVPFYKGAS CEQWVRCPDG TEEGPFSSIA AMHQFIEMNE
241 LVKEQERLPV FTEPVLQMGA AAASHIPVTP ESAEQIIEVL EYIGVPPQDM ENIGVIDDTG
301 DGFYSLREYL DIVEQSNKLL ED
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 22( 6.8%); C : 5( 1.6%); D : 22( 6.8%); E : 26( 8.1%); F : 9( 2.8%)
G : 27( 8.4%); H+ : 15( 4.7%); I : 19( 5.9%); K : 7( 2.2%); L : 22( 6.8%)
M : 9( 2.8%); N : 13( 4.0%); P : 17( 5.3%); Q : 12( 3.7%); R : 25( 7.8%)
S : 21( 6.5%); T : 14( 4.3%); V : 21( 6.5%); W : 6( 1.9%); Y : 10( 3.1%)
KR : 32 ( 9.9%); ED : 48 ( 14.9%); AGP : 66 ( 20.5%);
KRED : 80 ( 24.8%); KR-ED - : -16 ( -5.0%); FIKMNY : 67 ( 20.8%);
LVIFM : 80 ( 24.8%); ST : 35 ( 10.9%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 00-0000000 ++000000+0 000+0-0000 00000+-000 0-000000-0
61 0+0-0000+0 0++0000++0 0000000000 00+0+0-0++ +0000---0+ 0000+--000
121 +000-00000 00+000-00- 0-00000--0 -+-+000+-0 -+00-000+- 0000000000
181 0000000000 0+00000-00 000000+000 0-000+00-0 0--0000000 000000-00-
241 00+-0-+000 00-0000000 0000000000 -00-000-00 -0000000-0 -00000--00
301 -00000+-00 -00-000+00 --
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 12/45 or 15/60): none
Negative charge clusters (cmin = 12/30 or 16/45 or 19/60): none
Mixed charge clusters (cmin = 16/30 or 22/45 or 28/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.099 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.752 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.149 ( ED )
Expected score/letter: -0.851; Average information/letter: 1.489
Minimal length of displayed segments set to: 20
M_0.01= 9.60 (cv= 5.94, lambda= 0.97201, k= 0.35331, x= 3.66;
90% confidence interval for segment length: 9 +- 8)
M_0.05= 7.93 (x= 1.99)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.149 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.752 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.099 ( KR )
Expected score/letter: -0.652; Average information/letter: 0.783
Minimal length of displayed segments set to: 20
M_0.01= 12.87 (cv= 8.28, lambda= 0.69717, k= 0.24662, x= 4.59;
90% confidence interval for segment length: 17 +- 15)
M_0.05= 10.54 (x= 2.25)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.248 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.752 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.503; Average information/letter: 0.803
Minimal length of displayed segments set to: 20
M_0.01= 8.39 (cv= 5.22, lambda= 1.10691, k= 0.33679, x= 3.17;
90% confidence interval for segment length: 17 +- 14)
M_0.05= 6.92 (x= 1.70)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.752 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.248 ( KEDR )
Expected score/letter: -1.236; Average information/letter: 0.255
Minimal length of displayed segments set to: 20
M_0.01= 33.81 (cv= 22.92, lambda= 0.25195, k= 0.15636, x= 10.89;
90% confidence interval for segment length: 48 +- 31)
M_0.05= 27.34 (x= 4.42)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 7 | 31 | 9 | 10 | 13 | 10 | 12 | 15 | 8 | 10 |
lmin1 6 | 7 | 9 | 38 | 11 | 12 | 15 | 13 | 14 | 18 | 9 | 12 |
lmin2 7 | 8 | 10 | 42 | 12 | 14 | 17 | 14 | 16 | 20 | 11 | 13 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 1, at 99;
- runs >= 4: 0
* runs >= 5: 0
0 runs >= 21: 1, at 171;
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.217; Average information/letter: 0.745
Minimal length of displayed segments set to: 15
M_0.01= 21.64 (cv= 13.55, lambda= 0.42615, k= 0.31633, x= 8.09;
90% confidence interval for segment length: 18 +- 11)
M_0.05= 17.82 (x= 4.27)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.410; Average information/letter: 0.700
Minimal length of displayed segments set to: 15
M_0.01= 48.42 (cv= 31.08, lambda= 0.18581, k= 0.25212, x= 17.34;
90% confidence interval for segment length: 19 +- 12)
M_0.05= 39.65 (x= 8.57); M_0.30= 29.21 (x= -1.87)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-27-C-69-C-23-C-88-C-5-C-105-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 22 (Expected range: 5-- 31)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 7 (6-10) 5 (11-20) 6 (>=21) 5
3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 10 (Expected range: 0-- 18)
4 +plets (f+: 9.9%), 6 -plets (f-: 14.9%)
Total number of charge altplets: 9 (Critical number: 20)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 3 (6-10) 1 (11-20) 1 (>=21) 6
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
260- 263 1 A 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
226- 243 3 i0. 6 6 /0/2/./
271- 330 10 -00.00..00 6 6 ! /0/1/1/./1/1/././2/1/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
24- 27 (1.) V( 3)V 22 of 22 0.0081 large minimal spacing
71- 74 (1.) V( 3)V 20 of 22 0.0081 matching minimum
196- 199 (3.) V( 3)V 21 of 22 0.0081 matching minimum
315- 323 (4.) Q( 8)Q 13 of 13 0.0093 large minimal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:46 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4817.6454.seq)
SWISS-PROT ANNOTATION:
ID ORF13
DE ORF13, 105 bases, D1ECCB18 checksum.
number of residues: 105; molecular weight: 12.4 kdal
1 MRTNNQHYQS KGRKPAPKKQ KKNANRMFFD PMVYVLDIIR KHFDEKAFLT HNQKHGHRVI
61 KYRHTTGKRL VLGTPANSRL VYDFAFAEMD EEACEQLCQE LGQVK
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 7( 6.7%); C : 2( 1.9%); D : 5( 4.8%); E : 6( 5.7%); F : 6( 5.7%)
G : 5( 4.8%); H : 6( 5.7%); I : 3( 2.9%); K : 12(11.4%); L : 7( 6.7%)
M : 4( 3.8%); N : 6( 5.7%); P : 4( 3.8%); Q : 7( 6.7%); R : 8( 7.6%)
S : 2( 1.9%); T : 5( 4.8%); V : 6( 5.7%); W : 0( 0.0%); Y : 4( 3.8%)
KR : 20 ( 19.0%); ED : 11 ( 10.5%); AGP : 16 ( 15.2%);
KRED : 31 ( 29.5%); KR-ED : 9 ( 8.6%); FIKMNY : 35 ( 33.3%);
LVIFM : 26 ( 24.8%); ST : 7 ( 6.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0+00000000 +0++000++0 ++000+000- 000000-00+ +00--+0000 000+000+00
61 +0+0000++0 00000000+0 00-0000-0- --00-0000- 0000+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 14/30 or 19/45 or 23/60): none
Negative charge clusters (cmin = 9/30 or 12/45 or 15/60): none
Mixed charge clusters (cmin = 18/30 or 25/45 or 31/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.190 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.705 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.105 ( ED )
Expected score/letter: -0.533; Average information/letter: 0.464
Minimal length of displayed segments set to: 20
M_0.01= 14.21 (cv= 8.81, lambda= 0.52851, k= 0.17478, x= 5.40;
90% confidence interval for segment length: 23 +- 24)
M_0.05= 11.13 (x= 2.32)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.105 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.705 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.190 ( KR )
Expected score/letter: -0.876; Average information/letter: 1.460
Minimal length of displayed segments set to: 20
M_0.01= 8.66 (cv= 4.91, lambda= 0.94862, k= 0.35253, x= 3.75;
90% confidence interval for segment length: 8 +- 7)
M_0.05= 6.94 (x= 2.03)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.295 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.705 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.410; Average information/letter: 0.514
Minimal length of displayed segments set to: 20
M_0.01= 8.99 (cv= 5.35, lambda= 0.87008, k= 0.23797, x= 3.64;
90% confidence interval for segment length: 22 +- 20)
M_0.05= 7.11 (x= 1.76)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.705 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.295 ( KEDR )
Expected score/letter: -1.657; Average information/letter: 0.383
Minimal length of displayed segments set to: 20
M_0.01= 23.66 (cv= 14.18, lambda= 0.32831, k= 0.22607, x= 9.48;
90% confidence interval for segment length: 29 +- 17)
M_0.05= 18.69 (x= 4.52)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 4 | 7 | 23 | 10 | 8 | 12 | 11 | 9 | 13 | 7 | 9 |
lmin1 7 | 5 | 9 | 28 | 12 | 10 | 15 | 13 | 11 | 16 | 9 | 11 |
lmin2 8 | 6 | 10 | 31 | 13 | 11 | 16 | 15 | 12 | 18 | 10 | 13 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 4: 0
- runs >= 3: 1, at 90;
* runs >= 5: 0
0 runs >= 15: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.648; Average information/letter: 0.944
Minimal length of displayed segments set to: 15
M_0.01= 17.06 (cv= 9.61, lambda= 0.48411, k= 0.36910, x= 7.44;
90% confidence interval for segment length: 13 +- 7)
M_0.05= 13.69 (x= 4.08)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -5.590; Average information/letter: 0.989
Minimal length of displayed segments set to: 15
M_0.01= 36.58 (cv= 21.05, lambda= 0.22107, k= 0.31125, x= 15.53;
90% confidence interval for segment length: 12 +- 7)
M_0.05= 29.21 (x= 8.16); M_0.30= 20.44 (x= -0.62)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-93-C-3-C-7-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 7 (Expected range: 0-- 14)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 2 (6-10) 2 (11-20) 2 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 7 (Expected range: 0-- 11)
5 +plets (f+: 19.0%), 2 -plets (f-: 10.5%)
Total number of charge altplets: 1 (Critical number: 12)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 3 (6-10) 0 (11-20) 3 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:47 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.7286.seq)
SWISS-PROT ANNOTATION:
ID ORF14
DE ORF14, 62 bases, A3CAE8C checksum.
number of residues: 62; molecular weight: 6.5 kdal
1 MAMVYGIVAI IAALCCGSIV SFGAKTKENP DVTDYIASGF ILTIGVVCLV AGLDWLNLIP
61 EM
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 7(11.3%); C : 3( 4.8%); D : 3( 4.8%); E : 2( 3.2%); F : 2( 3.2%)
G : 6( 9.7%); H : 0( 0.0%); I : 8(12.9%); K : 2( 3.2%); L : 6( 9.7%)
M : 3( 4.8%); N : 2( 3.2%); P : 2( 3.2%); Q : 0( 0.0%); R : 0( 0.0%)
S : 3( 4.8%); T : 3( 4.8%); V : 7(11.3%); W : 1( 1.6%); Y : 2( 3.2%)
KR : 2 ( 3.2%); ED : 5 ( 8.1%); AGP : 15 ( 24.2%);
KRED : 7 ( 11.3%); KR-ED : -3 ( -4.8%); FIKMNY : 19 ( 30.6%);
LVIFM : 26 ( 41.9%); ST : 6 ( 9.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 0000000000 0000+0+-00 -00-000000 0000000000 000-000000
61 -0
A. CHARGE CLUSTERS.
Positive charge clusters: not evaluated (frequency of + < 5%, too low)
Negative charge clusters (cmin = 8/30 or 10/45 or 13/60): none
Mixed charge clusters (cmin = 10/30 or 13/45 or 16/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.032 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.887 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.081 ( ED )
Expected score/letter: -0.984; Average information/letter: 3.408
Minimal length of displayed segments set to: 20
M_0.01= 4.92 (cv= 2.55, lambda= 1.61837, k= 0.46825, x= 2.37;
90% confidence interval for segment length: 3 +- 3)
M_0.05= 3.92 (x= 1.37)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.081 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.887 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.032 ( KR )
Expected score/letter: -0.790; Average information/letter: 1.673
Minimal length of displayed segments set to: 20
M_0.01= 7.17 (cv= 3.84, lambda= 1.07607, k= 0.36516, x= 3.34;
90% confidence interval for segment length: 7 +- 7)
M_0.05= 5.66 (x= 1.82)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.113 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.887 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.774; Average information/letter: 2.302
Minimal length of displayed segments set to: 20
M_0.01= 4.04 (cv= 2.00, lambda= 2.06142, k= 0.67566, x= 2.04;
90% confidence interval for segment length: 5 +- 4)
M_0.05= 3.25 (x= 1.25)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.887 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.113 ( KEDR )
Expected score/letter: -0.016
Average information/letter: 0.000 < .10; too small !
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 3 | 3 | 4 | 55 | 6 | 7 | 8 | 7 | 9 | 10 | NA | NA |
lmin1 4 | 5 | 5 | 67 | 7 | 9 | 10 | 9 | 11 | 13 | NA | NA |
lmin2 4 | 5 | 6 | 74 | 8 | 10 | 11 | 10 | 13 | 14 | NA | NA |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 3: 0
0 runs >= 36: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -0.339
Average information/letter: 0.029 < .10; too small !
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -0.323
Average information/letter: 0.006 < .10; too small !
2. SPACINGS OF C.
H2N-14
CC at 15
-31-C-14-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 4 (Expected range: 0-- 11)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 2 (6-10) 1 (11-20) 1 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 11/30 or 15/45 or 17/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 0 (Expected range: 0-- 3)
0 +plets (f+: 3.2%), 0 -plets (f-: 8.1%)
Total number of charge altplets: 1 (Critical number: 3)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 1
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
41- 64 3 i.. 8 8 ! 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:49 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.8395.seq)
SWISS-PROT ANNOTATION:
ID ORF15
DE ORF15, 131 bases, 89229DD9 checksum.
number of residues: 131; molecular weight: 15.0 kdal
1 MRHQLHPLTL EYTRLAVQDL LQSARTKVLR EFTCLEIKSK VKSDLKQYAT DLTEGQVYAQ
61 MELVGDIMVE LAKEFCEDMG IPYHYSFVER HIDEHAKSWD ASHRACDACI GAGEHALRVS
121 WLLHLQDDIG A
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 12( 9.2%); C : 4( 3.1%); D : 10( 7.6%); E : 11( 8.4%); F : 3( 2.3%)
G : 6( 4.6%); H : 8( 6.1%); I : 6( 4.6%); K : 7( 5.3%); L : 16(12.2%)
M : 4( 3.1%); N : 0( 0.0%); P : 2( 1.5%); Q : 7( 5.3%); R : 7( 5.3%)
S : 7( 5.3%); T : 6( 4.6%); V : 8( 6.1%); W : 2( 1.5%); Y : 5( 3.8%)
KR : 14 ( 10.7%); ED : 21 ( 16.0%); AGP : 20 ( 15.3%);
KRED : 35 ( 26.7%); KR-ED : -7 ( -5.3%); FIKMNY : 25 ( 19.1%);
LVIFM : 37 ( 28.2%); ST : 13 ( 9.9%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0+00000000 -00+0000-0 0000+0+00+ -0000-0+0+ 0+0-0+0000 -00-000000
61 0-000-000- 00+-00--00 00000000-+ 00--00+00- 000+00-000 000-000+00
121 000000--00 0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 13/45 or 15/60): none
Negative charge clusters (cmin = 12/30 or 17/45 or 20/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.107 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.733 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.160 ( ED )
Expected score/letter: -0.840; Average information/letter: 1.384
Minimal length of displayed segments set to: 20
M_0.01= 9.04 (cv= 5.25, lambda= 0.92942, k= 0.34180, x= 3.79;
90% confidence interval for segment length: 9 +- 8)
M_0.05= 7.29 (x= 2.04)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.160 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.733 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.107 ( KR )
Expected score/letter: -0.626; Average information/letter: 0.690
Minimal length of displayed segments set to: 20
M_0.01= 12.30 (cv= 7.50, lambda= 0.65013, k= 0.22823, x= 4.80;
90% confidence interval for segment length: 17 +- 17)
M_0.05= 9.79 (x= 2.30)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.267 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.733 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.466; Average information/letter: 0.678
Minimal length of displayed segments set to: 20
M_0.01= 8.18 (cv= 4.83, lambda= 1.00900, k= 0.29588, x= 3.35;
90% confidence interval for segment length: 18 +- 16)
M_0.05= 6.57 (x= 1.74)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.733 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.267 ( KEDR )
Expected score/letter: -1.405; Average information/letter: 0.305
Minimal length of displayed segments set to: 20
M_0.01= 27.55 (cv= 17.25, lambda= 0.28259, k= 0.18445, x= 10.30;
90% confidence interval for segment length: 37 +- 23)
M_0.05= 21.78 (x= 4.53)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 7 | 26 | 8 | 9 | 12 | 9 | 11 | 13 | 8 | 10 |
lmin1 5 | 7 | 9 | 32 | 10 | 11 | 14 | 11 | 13 | 16 | 9 | 12 |
lmin2 6 | 8 | 10 | 35 | 11 | 13 | 16 | 13 | 15 | 18 | 10 | 13 |
(Significance level: 0.010000; Minimal displayed length: 6)
(*0) 13(0,0,0); at 35- 47: LEIKSKVKSDLKQ
(2. quartile) 0-0+0+0+0-0+0
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 5: 0
0 runs >= 17: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.115; Average information/letter: 0.639
Minimal length of displayed segments set to: 15
M_0.01= 22.10 (cv= 13.10, lambda= 0.37211, k= 0.28637, x= 9.00;
90% confidence interval for segment length: 19 +- 12)
M_0.05= 17.72 (x= 4.62)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.237; Average information/letter: 0.611
Minimal length of displayed segments set to: 15
M_0.01= 48.68 (cv= 29.58, lambda= 0.16479, k= 0.23385, x= 19.10;
90% confidence interval for segment length: 19 +- 13)
M_0.05= 38.79 (x= 9.21); M_0.30= 27.02 (x= -2.56)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-33-C-41-C-29
CDAC at 106
-22-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 3 (Expected range: 0-- 17)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 2 (6-10) 0 (11-20) 1 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 6/30 or 8/45 or 9/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 3 (Expected range: 0-- 11)
0 +plets (f+: 10.7%), 3 -plets (f-: 16.0%)
Total number of charge altplets: 3 (Critical number: 12)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 1 (6-10) 0 (11-20) 1 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
5- 24 5 L.... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
30- 85 8 *....... 7 7 0
36- 47 2 *. 6 6 0
57- 92 6 i..... 6 6 0
62- 81 4 -00. 5 5 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Location (Quartile) Spacing Rank P-value Interpretation
6- 84 (2.) H( 78)H 1 of 9 0.0046 large 1. maximal spacing
52- 63 (2.) L( 11)L 2 of 17 0.9945 small 2. maximal spacing
71- 117 (3.) L( 46)L 1 of 17 0.0097 large 1. maximal spacing
103- 115 (4.) H( 12)H 2 of 9 0.9978 small 2. maximal spacing
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:50 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4818.9504.seq)
SWISS-PROT ANNOTATION:
ID ORF16
DE ORF16, 145 bases, 61EEA456 checksum.
number of residues: 145; molecular weight: 16.3 kdal
1 MELQTIFNMV LQHLARQDHG ASDFEHGCAY RAEGGRKCAV GYLLSESAYH DDVEGEVIGD
61 EHINAAVEES IGELPYGGYN VLSRLQTLHD GDYDMVCSGE EVSGMSKERR VLRMVSAVEE
121 LADAFDLIVP RAFYDWIKKY GYERQ
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 12( 8.3%); C : 3( 2.1%); D : 11( 7.6%); E : 16(11.0%); F : 4( 2.8%)
G : 14( 9.7%); H : 6( 4.1%); I : 6( 4.1%); K : 4( 2.8%); L : 12( 8.3%)
M : 5( 3.4%); N : 3( 2.1%); P : 2( 1.4%); Q : 5( 3.4%); R : 9( 6.2%)
S : 9( 6.2%); T : 2( 1.4%); V : 12( 8.3%); W : 1( 0.7%); Y : 9( 6.2%)
KR : 13 ( 9.0%); ED : 27 ( 18.6%); AGP : 28 ( 19.3%);
KRED : 40 ( 27.6%); KR-ED : -14 ( -9.7%); FIKMNY : 31 ( 21.4%);
LVIFM : 39 ( 26.9%); ST : 11 ( 7.6%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0-00000000 00000+0-00 00-0-00000 +0-00++000 00000-0000 --0-0-000-
61 -000000--0 00-0000000 000+00000- 0-0-00000- -00000+-++ 00+00000--
121 00-00-0000 +000-00++0 00-+0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 8/30 or 11/45 or 14/60): none
Negative charge clusters (cmin = 14/30 or 18/45 or 23/60): none
Mixed charge clusters (cmin = 18/30 or 24/45 or 30/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.090 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.724 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.186 ( ED )
Expected score/letter: -0.917; Average information/letter: 1.718
Minimal length of displayed segments set to: 20
M_0.01= 8.25 (cv= 4.77, lambda= 1.04276, k= 0.37920, x= 3.48;
90% confidence interval for segment length: 7 +- 6)
M_0.05= 6.69 (x= 1.92)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.186 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.724 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.090 ( KR )
Expected score/letter: -0.531; Average information/letter: 0.474
Minimal length of displayed segments set to: 20
M_0.01= 14.56 (cv= 9.24, lambda= 0.53876, k= 0.17716, x= 5.33;
90% confidence interval for segment length: 24 +- 24)
M_0.05= 11.54 (x= 2.30)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.276 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.724 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.448; Average information/letter: 0.624
Minimal length of displayed segments set to: 20
M_0.01= 8.60 (cv= 5.16, lambda= 0.96508, k= 0.27750, x= 3.44;
90% confidence interval for segment length: 19 +- 17)
M_0.05= 6.91 (x= 1.75)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.724 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.276 ( KEDR )
Expected score/letter: -1.483; Average information/letter: 0.329
Minimal length of displayed segments set to: 20
M_0.01= 26.80 (cv= 16.77, lambda= 0.29675, k= 0.19742, x= 10.03;
90% confidence interval for segment length: 35 +- 21)
M_0.05= 21.31 (x= 4.54)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 6 | 7 | 26 | 8 | 10 | 12 | 9 | 11 | 14 | 7 | 9 |
lmin1 5 | 7 | 9 | 31 | 9 | 12 | 15 | 11 | 14 | 17 | 8 | 11 |
lmin2 6 | 8 | 10 | 35 | 11 | 14 | 16 | 12 | 16 | 19 | 10 | 12 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 4: 0
* runs >= 5: 0
0 runs >= 17: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.034; Average information/letter: 0.580
Minimal length of displayed segments set to: 15
M_0.01= 23.61 (cv= 14.21, lambda= 0.35018, k= 0.27007, x= 9.40;
90% confidence interval for segment length: 21 +- 13)
M_0.05= 18.96 (x= 4.74)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.207; Average information/letter: 0.585
Minimal length of displayed segments set to: 15
M_0.01= 50.02 (cv= 30.71, lambda= 0.16206, k= 0.22976, x= 19.31;
90% confidence interval for segment length: 20 +- 14)
M_0.05= 39.96 (x= 9.25); M_0.30= 27.99 (x= -2.71)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-27-C-9-C-58-C-48-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 66- 69] AVEE
[ 117- 120] AVEE
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 10 (Expected range: 0-- 18)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 1 (6-10) 6 (11-20) 2 (>=21) 2
3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 8 (Expected range: 0-- 12)
3 +plets (f+: 9.0%), 5 -plets (f-: 18.6%)
Total number of charge altplets: 2 (Critical number: 13)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 5 (11-20) 2 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
20- 47 7 G...... 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
67- 108 7 i.00.0* 6 6 /0/./2/2/./2/2/
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:52 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4814.2277.seq)
SWISS-PROT ANNOTATION:
ID ORF17
DE ORF17, 306 bases, 96841AB3 checksum.
number of residues: 306; molecular weight: 34.8 kdal
1 MPSTSSYLVL STIGLRSTGM RDSSFDTDIL DMTTDLCIGE RSDHLVCPCC RGGDSGERSL
61 LVWCHADGLA YRCYRVKCGL SGKIGQSGYR PVSTKMRKPK CHTRQLHPEP LPNDVLDWYL
121 DYFWWADAKM LRVNGVLWDE TTERILYPIK SMTGTHEGYL ARRYDDLVLD KSNFQGGKAK
181 AYYNSLPTDY KMTCMMTPLK AQFDEWVVVV EDYPSAMRIN TEIPCVALSG TSIQDATLME
241 LVRAGKRKVC FVLDADATSK AASMVYNYGL HFEHLTFVPL FDADPKDMED EDFDDLVDTI
301 RRQLDV
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 17( 5.6%); C+ : 11( 3.6%); D++: 31(10.1%); E : 13( 4.2%); F : 9( 2.9%)
G : 19( 6.2%); H : 7( 2.3%); I- : 10( 3.3%); K : 16( 5.2%); L : 31(10.1%)
M : 13( 4.2%); N : 6( 2.0%); P : 14( 4.6%); Q : 6( 2.0%); R : 19( 6.2%)
S : 21( 6.9%); T : 21( 6.9%); V : 21( 6.9%); W : 6( 2.0%); Y : 15( 4.9%)
KR : 35 ( 11.4%); ED : 44 ( 14.4%); AGP : 50 ( 16.3%);
KRED : 79 ( 25.8%); KR-ED : -9 ( -2.9%); FIKMNY : 69 ( 22.5%);
LVIFM : 84 ( 27.5%); ST : 42 ( 13.7%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 0000000000 00000+0000 +-000-0-00 -000-0000- +0-0000000 +00-00-+00
61 000000-000 0+00+0+000 00+000000+ 0000+0++0+ 000+0000-0 000-00-000
121 -00000-0+0 0+000000-- 00-+00000+ 000000-000 0++0--000- +000000+0+
181 00000000-0 +00000000+ 000--00000 --00000+00 0-00000000 0000-0000-
241 00+00+++00 000-0-000+ 0000000000 00-0000000 0-0-0+-0-- --0--00-00
301 ++00-0
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 10/30 or 13/45 or 16/60): none
Negative charge clusters (cmin = 12/30 or 15/45 or 19/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.114 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.742 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.144 ( ED )
Expected score/letter: -0.801; Average information/letter: 1.250
Minimal length of displayed segments set to: 20
M_0.01= 10.42 (cv= 6.48, lambda= 0.88281, k= 0.32379, x= 3.93;
90% confidence interval for segment length: 11 +- 9)
M_0.05= 8.57 (x= 2.09)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.144 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.742 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.114 ( KR )
Expected score/letter: -0.683; Average information/letter: 0.855
Minimal length of displayed segments set to: 20
M_0.01= 12.37 (cv= 7.88, lambda= 0.72609, k= 0.26078, x= 4.48;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 10.12 (x= 2.24)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.258 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.742 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.484; Average information/letter: 0.737
Minimal length of displayed segments set to: 20
M_0.01= 8.69 (cv= 5.42, lambda= 1.05550, k= 0.31534, x= 3.26;
90% confidence interval for segment length: 18 +- 15)
M_0.05= 7.14 (x= 1.72)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.742 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.258 ( KEDR )
Expected score/letter: -1.324; Average information/letter: 0.281
Minimal length of displayed segments set to: 20
M_0.01= 31.94 (cv= 21.37, lambda= 0.26789, k= 0.17096, x= 10.58;
90% confidence interval for segment length: 44 +- 28)
M_0.05= 25.86 (x= 4.49)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 5 | 5 | 7 | 30 | 9 | 10 | 13 | 11 | 11 | 15 | 6 | 8 |
lmin1 6 | 7 | 9 | 36 | 11 | 12 | 16 | 13 | 14 | 18 | 8 | 10 |
lmin2 7 | 8 | 11 | 40 | 12 | 13 | 17 | 15 | 16 | 20 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 1, at 246;
- runs >= 3: 1, at 289;
* runs >= 5: 0
0 runs >= 20: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -2.137; Average information/letter: 0.665
Minimal length of displayed segments set to: 15
M_0.01= 23.69 (cv= 14.90, lambda= 0.38423, k= 0.29426, x= 8.79;
90% confidence interval for segment length: 20 +- 12)
M_0.05= 19.44 (x= 4.55)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.023; Average information/letter: 0.594
Minimal length of displayed segments set to: 15
M_0.01= 51.80 (cv= 33.57, lambda= 0.17049, k= 0.22491, x= 18.23;
90% confidence interval for segment length: 21 +- 15)
M_0.05= 42.24 (x= 8.67); M_0.30= 30.87 (x= -2.70)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-36-C-9
CPC at 47
CC at 49 (l= 4)
-13-C-8-C-4-C-22-C-92-C-30-C-24-C-56-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
Aligned matching blocks:
[ 39- 46] GERSDHLV
[ 56- 62] GERS_LLV
______________________________
[ 165- 168] DDLV
[ 294- 297] DDLV
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 17 (Expected range: 5-- 30)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 5 (6-10) 4 (11-20) 6 (>=21) 3
3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 10 (Expected range: 0-- 18)
4 +plets (f+: 11.4%), 6 -plets (f-: 14.4%)
Total number of charge altplets: 6 (Critical number: 21)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 3 (6-10) 2 (11-20) 0 (>=21) 6
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 5)
Location Period Element Copies Core Errors
5- 28 6 S..... 4 4 0
79- 90 3 G.. 4 4 0
207- 210 1 V 4 4 0
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9)
Location Period Element Copies Core Errors
264- 311 8 i....... 6 6 0
286- 303 3 *.. 6 6 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
There are no unusual spacings.
SAPS. Version of April 11, 1996.
Date run: Mon Oct 1 23:15:53 2001
********************************************************************************
Protein 1 (File: wwwtmp/.SAPS.4811.2554.seq)
SWISS-PROT ANNOTATION:
ID ORF18
DE ORF18, 49 bases, B905AA2E checksum.
number of residues: 49; molecular weight: 6.1 kdal
1 MFDKDTVKEW NAHLMSHNAD SVGLPFEERI TWVMEWTLMW YMMEKFYGK
--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)
A : 2( 4.1%); C : 0( 0.0%); D : 3( 6.1%); E : 5(10.2%); F : 3( 6.1%)
G : 2( 4.1%); H : 2( 4.1%); I : 1( 2.0%); K : 4( 8.2%); L : 3( 6.1%)
M : 6(12.2%); N : 2( 4.1%); P : 1( 2.0%); Q : 0( 0.0%); R : 1( 2.0%)
S : 2( 4.1%); T : 3( 6.1%); V : 3( 6.1%); W : 4( 8.2%); Y : 2( 4.1%)
KR : 5 ( 10.2%); ED : 8 ( 16.3%); AGP : 5 ( 10.2%);
KRED : 13 ( 26.5%); KR-ED : -3 ( -6.1%); FIKMNY : 18 ( 36.7%);
LVIFM : 16 ( 32.7%); ST : 5 ( 10.2%).
--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
1 00-+-00+-0 000000000- 000000--+0 0000-00000 000-+000+
A. CHARGE CLUSTERS.
Positive charge clusters (cmin = 9/30 or 12/45 or 15/60): none
Negative charge clusters (cmin = 12/30 or 17/45 or 21/60): none
Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none
B. HIGH SCORING (UN)CHARGED SEGMENTS.
______________________________________
High scoring positive charge segments:
score= 2.00 frequency= 0.102 ( KR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.163 ( ED )
Expected score/letter: -0.857; Average information/letter: 1.465
Minimal length of displayed segments set to: 20
M_0.01= 7.77 (cv= 4.06, lambda= 0.95881, k= 0.35148, x= 3.71;
90% confidence interval for segment length: 7 +- 7)
M_0.05= 6.07 (x= 2.01)
# of segments (>=20 residues) exceeding M_0.05: none
______________________________________
High scoring negative charge segments:
score= 2.00 frequency= 0.163 ( ED )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
score= -2.00 frequency= 0.102 ( KR )
Expected score/letter: -0.612; Average information/letter: 0.659
Minimal length of displayed segments set to: 20
M_0.01= 10.98 (cv= 6.12, lambda= 0.63589, k= 0.22141, x= 4.86;
90% confidence interval for segment length: 15 +- 16)
M_0.05= 8.42 (x= 2.30)
# of segments (>=20 residues) exceeding M_0.05: none
___________________________________
High scoring mixed charge segments:
score= 1.00 frequency= 0.265 ( KEDR )
score= 0.00 frequency= 0.000 ( BZX )
score= -1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
Expected score/letter: -0.469; Average information/letter: 0.690
Minimal length of displayed segments set to: 20
M_0.01= 7.15 (cv= 3.82, lambda= 1.01857, k= 0.29989, x= 3.33;
90% confidence interval for segment length: 15 +- 14)
M_0.05= 5.55 (x= 1.73)
# of segments (>=20 residues) exceeding M_0.05: none
________________________________
High scoring uncharged segments:
score= 1.00 frequency= 0.735 ( LAGSVTIPNFQYHMCW )
score= 0.00 frequency= 0.000 ( BZX )
score= -8.00 frequency= 0.265 ( KEDR )
Expected score/letter: -1.388; Average information/letter: 0.300
Minimal length of displayed segments set to: 20
M_0.01= 24.28 (cv= 13.92, lambda= 0.27954, k= 0.18165, x= 10.35;
90% confidence interval for segment length: 33 +- 22)
M_0.05= 18.45 (x= 4.52)
# of segments (>=20 residues) exceeding M_0.05: none
C. CHARGE RUNS AND PATTERNS.
pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0 4 | 5 | 6 | 23 | 7 | 9 | 11 | 8 | 10 | 12 | 6 | 8 |
lmin1 5 | 6 | 8 | 28 | 9 | 10 | 13 | 10 | 12 | 15 | 8 | 9 |
lmin2 6 | 7 | 9 | 31 | 10 | 12 | 14 | 11 | 13 | 16 | 9 | 11 |
(Significance level: 0.010000; Minimal displayed length: 6)
There are no charge runs or patterns exceeding the given minimal lengths.
Run count statistics:
+ runs >= 3: 0
- runs >= 3: 0
* runs >= 4: 0
0 runs >= 15: 0
--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES
1. HIGH SCORING SEGMENTS.
__________________________________
High scoring hydrophobic segments:
2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ)
-8.00 (KEDR)
Expected score/letter: -1.959; Average information/letter: 0.547
Minimal length of displayed segments set to: 15
M_0.01= 21.84 (cv= 11.93, lambda= 0.32626, k= 0.25539, x= 9.92;
90% confidence interval for segment length: 19 +- 14)
M_0.05= 16.85 (x= 4.92)
# of segments (>=15 residues) exceeding M_0.05: none
____________________________________
High scoring transmembrane segments:
5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST)
-6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)
Expected score/letter: -4.163; Average information/letter: 0.618
Minimal length of displayed segments set to: 15
M_0.01= 39.49 (cv= 21.86, lambda= 0.17807, k= 0.23239, x= 17.64;
90% confidence interval for segment length: 16 +- 12)
M_0.05= 30.34 (x= 8.48); M_0.30= 19.45 (x= -2.41)
# of segments (>=15 residues) exceeding M_0.30: none
2. SPACINGS OF C.
H2N-49-COOH
--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.
A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length: 4
B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
(i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length: 8
--------------------------------------------------------------------------------
MULTIPLETS.
A. AMINO ACID ALPHABET.
1. Total number of amino acid multiplets: 2 (Expected range: 0-- 9)
2. Histogram of spacings between consecutive amino acid multiplets:
(1-5) 0 (6-10) 1 (11-20) 1 (>=21) 1
3. Clusters of amino acid multiplets (cmin = 9/30 or 11/45 or 13/60): none
B. CHARGE ALPHABET.
1. Total number of charge multiplets: 1 (Expected range: 0-- 6)
0 +plets (f+: 10.2%), 1 -plets (f-: 16.3%)
Total number of charge altplets: 4 (Critical number: 7)
2. Histogram of spacings between consecutive charge multiplets:
(1-5) 0 (6-10) 0 (11-20) 0 (>=21) 2
--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.
A. AMINO ACID ALPHABET (core: 4; !-core: 4)
Location Period Element Copies Core Errors
There are no periodicities of the prescribed length.
B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6)
and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8)
Location Period Element Copies Core Errors
22- 49 4 i... 7 7 0
--------------------------------------------------------------------------------
SPACING ANALYSIS.
Not evaluated (sequence length < 100 aa, too short).
SAPS. Version of April 11, 1996. Date run: Mon Oct 1 23:15:55 2001 ******************************************************************************** Protein 1 (File: wwwtmp/.SAPS.4810.5050.seq) SWISS-PROT ANNOTATION: ID ORF19 DE ORF19, 408 bases, 7C4D2D59 checksum. number of residues: 408; molecular weight: 44.9 kdal 1 MESEASILSA CIADRNAYDV VMSLEANADL SPNGVILFKF ISEYYKRDKK CQSIDVDVLL 61 STISAQKPLA YEKLSKVVDN PPEPSPANLI TLLTEQRLKR LGAEMTAAFA AGDSAKGSAL 121 AAEYHDVYEM GIQTDDEDPL FEVYNDVSVA DLTQSLREGA NLSLLPDLLG DIVFNMMQGD 181 HVIIFGPVNA GKSAVNIQIA GDYCYDDQVV LYVGNEDPAD RM