SAPS output for orfs.txt.

see (http://www.ch.embnet.org/software/SAPS_form.html)

(http://www.ch.embnet.org/cgi-bin/SAPS_form_parser)


orfA VpV262 wrong way solo
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:50 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4812.3268.seq)

SWISS-PROT ANNOTATION:
ID   ORFA
DE   ORFA, 81 bases, 174C2750 checksum.

number of residues:   81;   molecular weight:   8.5 kdal
 
         1  MLVTGNGTFP LAQGANWSAS ALYTDALGGA TVQLQVNGVD LVDGALADNT QYVVTHGKRA 
        61  VVEVVVTGYT ADFNIWTYDA S

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 11(13.6%); C  :  0( 0.0%); D  :  6( 7.4%); E  :  1( 1.2%); F  :  2( 2.5%)
G  :  9(11.1%); H  :  1( 1.2%); I  :  1( 1.2%); K  :  1( 1.2%); L  :  7( 8.6%)
M  :  1( 1.2%); N  :  5( 6.2%); P  :  1( 1.2%); Q  :  4( 4.9%); R  :  1( 1.2%)
S  :  3( 3.7%); T  :  9(11.1%); V  : 12(14.8%); W  :  2( 2.5%); Y  :  4( 4.9%)

KR      :    2 (  2.5%);   ED      :    7 (  8.6%);   AGP     :   21 ( 25.9%);
KRED    :    9 ( 11.1%);   KR-ED   :   -5 ( -6.2%);   FIKMNY  :   14 ( 17.3%);
LVIFM   :   23 ( 28.4%);   ST      :   12 ( 14.8%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 0000000000 0000-00000 000000000- 00-0000-00 0000000++0 
        61  00-0000000 0-000000-0 0

A. CHARGE CLUSTERS.


Positive charge clusters:  not evaluated (frequency of + < 5%, too low)


Negative charge clusters (cmin =  8/30 or 11/45 or 13/60):  none


Mixed charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.025  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.889  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.086  ( ED )

 Expected score/letter:  -1.012;    Average information/letter:   3.911
 Minimal length of displayed segments set to:  20

M_0.01=  4.68  (cv=  2.49, lambda=  1.76679, k=  0.48174, x=  2.19;
                90% confidence interval for segment length:   3 +-   2)
M_0.05=  3.76  (x=  1.27)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.086  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.889  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.025  ( KR )

 Expected score/letter:  -0.765;    Average information/letter:   1.543
 Minimal length of displayed segments set to:  20

M_0.01=  7.71  (cv=  4.26, lambda=  1.03142, k=  0.35146, x=  3.45;
                90% confidence interval for segment length:   7 +-   7)
M_0.05=  6.13  (x=  1.87)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.111  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.889  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.778;    Average information/letter:   2.333
 Minimal length of displayed segments set to:  20

M_0.01=  4.14  (cv=  2.11, lambda=  2.07944, k=  0.68056, x=  2.03;
                90% confidence interval for segment length:   5 +-   4)
M_0.05=  3.36  (x=  1.24)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.889  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.111  ( KEDR )

 Expected score/letter:  -0.000
! expected score > -.001; too big !


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     2 |   4 |   4 |  58 |   6 |   8 |   9 |   7 |  10 |  11 |   5 |   6 | 
lmin1     3 |   5 |   5 |  71 |   7 |  10 |  10 |   8 |  12 |  13 |   6 |   8 | 
lmin2     4 |   6 |   6 |  78 |   8 |  11 |  12 |  10 |  13 |  15 |   7 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   3:   0
  0  runs >=  38:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.086;    Average information/letter:   0.274
 Minimal length of displayed segments set to:  15

M_0.01= 29.09  (cv= 18.09, lambda=  0.24287, k=  0.14512, x= 10.99;
                90% confidence interval for segment length:  37 +-  32)
M_0.05= 22.38  (x=  4.28)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -1.642;    Average information/letter:   0.153
 Minimal length of displayed segments set to:  15

M_0.01= 78.63  (cv= 53.43, lambda=  0.08225, k=  0.07983, x= 25.20;
                90% confidence interval for segment length:  61 +-  58)
M_0.05= 58.81  (x=  5.38);     M_0.30= 35.23  (x= -18.20)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-81-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   4  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 1   (11-20) 1   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 10/30 or 12/45 or 15/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  3)
   1 +plets (f+: 2.5%), 0 -plets (f-: 8.6%)
   Total number of charge altplets: 0 (Critical number: 3)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 7)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


orfB VpV262 poor SIO1 gp27 match
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:51 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.5763.seq)

SWISS-PROT ANNOTATION:
ID   ORFB
DE   ORFB, 132 bases, 292DD71D checksum.

number of residues:  132;   molecular weight:  14.1 kdal
 
         1  MNKYAKVTFR ASIILWTIVF LLMCTSGCSA LTGLATDAAI GAVAKDDPLL GIDTEIVAGD 
        61  KQGIKSGPDT KLDDVEVNGN LTTSTIGRKT DVSGTTDTLT INEGVPFWWV LVGMSVMLLL 
       121  GLFIPQFKLT RK

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  9( 6.8%); C  :  2( 1.5%); D  : 10( 7.6%); E  :  3( 2.3%); F  :  5( 3.8%)
G  : 13( 9.8%); H  :  0( 0.0%); I  : 10( 7.6%); K  :  9( 6.8%); L  : 16(12.1%)
M  :  4( 3.0%); N  :  4( 3.0%); P  :  4( 3.0%); Q  :  2( 1.5%); R  :  3( 2.3%)
S  :  7( 5.3%); T  : 16(12.1%); V  : 11( 8.3%); W  :  3( 2.3%); Y  :  1( 0.8%)

KR      :   12 (  9.1%);   ED      :   13 (  9.8%);   AGP     :   26 ( 19.7%);
KRED    :   25 ( 18.9%);   KR-ED   :   -1 ( -0.8%);   FIKMNY  :   33 ( 25.0%);
LVIFM   :   46 ( 34.8%);   ST      :   23 ( 17.4%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00+00+000+ 0000000000 0000000000 000000-000 0000+--000 00-0-0000- 
        61  +000+000-0 +0--0-0000 0000000++0 -00000-000 00-0000000 0000000000 
       121  0000000+00 ++

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 11/45 or 14/60):  none


Negative charge clusters (cmin =  9/30 or 12/45 or 15/60):  none


Mixed charge clusters (cmin = 14/30 or 19/45 or 23/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.091  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.811  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.098  ( ED )

 Expected score/letter:  -0.826;    Average information/letter:   1.565
 Minimal length of displayed segments set to:  20

M_0.01=  8.32  (cv=  4.81, lambda=  1.01604, k=  0.35852, x=  3.52;
                90% confidence interval for segment length:   8 +-   7)
M_0.05=  6.72  (x=  1.91)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.098  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.811  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.091  ( KR )

 Expected score/letter:  -0.795;    Average information/letter:   1.420
 Minimal length of displayed segments set to:  20

M_0.01=  8.72  (cv=  5.06, lambda=  0.96448, k=  0.34189, x=  3.66;
                90% confidence interval for segment length:   9 +-   8)
M_0.05=  7.03  (x=  1.97)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.189  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.811  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.621;    Average information/letter:   1.303
 Minimal length of displayed segments set to:  20

M_0.01=  6.01  (cv=  3.36, lambda=  1.45395, k=  0.47607, x=  2.65;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  4.89  (x=  1.53)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.811  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.189  ( KEDR )

 Expected score/letter:  -0.705;    Average information/letter:   0.109
 Minimal length of displayed segments set to:  20

M_0.01= 44.80  (cv= 32.04, lambda=  0.15237, k=  0.07014, x= 12.75;
                90% confidence interval for segment length:  90 +-  78)
M_0.05= 34.10  (x=  2.05)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   4 |   6 |  37 |   8 |   8 |  11 |  10 |  10 |  13 |  NA |  NA | 
lmin1     5 |   5 |   7 |  45 |  10 |  10 |  13 |  12 |  12 |  16 |  NA |  NA | 
lmin2     6 |   6 |   8 |  50 |  11 |  11 |  14 |  13 |  14 |  17 |  NA |  NA | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  25:   1, at   11;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.545;    Average information/letter:   0.412
 Minimal length of displayed segments set to:  15

M_0.01= 28.16  (cv= 17.43, lambda=  0.28010, k=  0.20294, x= 10.73;
                90% confidence interval for segment length:  28 +-  21)
M_0.05= 22.34  (x=  4.91)

 1) From  104 to  124 (type 1):  length= 21, score=27.00  * 
     104  GVPFWWVLVG MSVMLLLGLF I
    L:  5(23.8%);  G:  3(14.3%);  V:  4(19.0%);

# of segments (>=15 residues) exceeding M_0.05:  1


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -2.174;    Average information/letter:   0.209
 Minimal length of displayed segments set to:  15

M_0.01= 80.81  (cv= 54.59, lambda=  0.08944, k=  0.10489, x= 26.22;
                90% confidence interval for segment length:  50 +-  43)
M_0.05= 62.59  (x=  8.00);     M_0.30= 40.91  (x= -13.68)

 1) From   13 to   44 (type 1):  length= 32, score=48.00
    (pocket at   36 to   37:   length=  2, score=-19.00) 
      13  IILWTIVFLL MCTSGCSALT GLA |TD| AAIGA VA
    L:  5(15.6%);  A:  6(18.8%);  T:  4(12.5%);  I:  4(12.5%);

 2) From  104 to  124 (type 1):  length= 21, score=60.00 
     104  GVPFWWVLVG MSVMLLLGLF I
    L:  5(23.8%);  G:  3(14.3%);  V:  4(19.0%);


# of segments (>=15 residues) exceeding M_0.30:  2


2. SPACINGS OF C.


H2N-23-C-3-C-104-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  10  (Expected range:   0-- 18)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 4   (11-20) 5   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 13/30 or 16/45 or 19/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   4  (Expected range:   0--  7)
   2 +plets (f+: 9.1%), 2 -plets (f-: 9.8%)
   Total number of charge altplets: 2 (Critical number: 8)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 0   (11-20) 1   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  85- 104        5      T....            4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
  18-  23        1      i                6       6       0
 110- 125        2      i.               8       8 !     0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  58- 133  (3.)     A(  75)A     1 of  10   0.0035   large maximal spacing
  59-  63  (2.)     G(   4)G    14 of  14   0.0008   large minimal spacing
  63-  67  (2.)     G(   4)G    13 of  14   0.0008     matching minimum



orfC VpV262 poor C ter maturase match
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:53 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.902.seq)

SWISS-PROT ANNOTATION:
ID   ORFC
DE   ORFC, 77 bases, E0767B7B checksum.

number of residues:   77;   molecular weight:   9.0 kdal
 
         1  MAKKVQMTTY KCEVCRHEEE HKTEKRNPMY RDCPECNTRV GMKAISPFTS INIKRYGTDT 
        61  EPSKYEIIKN RPKGGNF

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  2( 2.6%); C  :  4( 5.2%); D  :  2( 2.6%); E  :  8(10.4%); F  :  2( 2.6%)
G  :  4( 5.2%); H  :  2( 2.6%); I  :  5( 6.5%); K  : 10(13.0%); L  :  0( 0.0%)
M  :  4( 5.2%); N  :  5( 6.5%); P  :  5( 6.5%); Q  :  1( 1.3%); R  :  6( 7.8%)
S  :  3( 3.9%); T  :  7( 9.1%); V  :  3( 3.9%); W  :  0( 0.0%); Y  :  4( 5.2%)

KR      :   16 ( 20.8%);   ED      :   10 ( 13.0%);   AGP     :   11 ( 14.3%);
KRED    :   26 ( 33.8%);   KR-ED   :    6 (  7.8%);   FIKMNY  :   30 ( 39.0%);
LVIFM   :   14 ( 18.2%);   ST      :   10 ( 13.0%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00++000000 +0-00+0--- 0+0-++0000 +-00-000+0 00+0000000 000++000-0 
        61  -00+0-00+0 +0+0000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


Negative charge clusters (cmin = 11/30 or 14/45 or 18/60):  none


Mixed charge clusters (cmin = 20/30 or 27/45 or 34/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.208  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.662  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.130  ( ED )

 Expected score/letter:  -0.506;    Average information/letter:   0.388
 Minimal length of displayed segments set to:  20

M_0.01= 14.87  (cv=  9.12, lambda=  0.47604, k=  0.15459, x=  5.74;
                90% confidence interval for segment length:  26 +-  28)
M_0.05= 11.44  (x=  2.32)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.130  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.662  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.208  ( KR )

 Expected score/letter:  -0.818;    Average information/letter:   1.136
 Minimal length of displayed segments set to:  20

M_0.01=  9.50  (cv=  5.31, lambda=  0.81842, k=  0.31128, x=  4.19;
                90% confidence interval for segment length:  10 +-   9)
M_0.05=  7.51  (x=  2.20)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.338  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.662  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.325;    Average information/letter:   0.316
 Minimal length of displayed segments set to:  20

M_0.01= 10.55  (cv=  6.45, lambda=  0.67373, k=  0.15915, x=  4.10;
                90% confidence interval for segment length:  32 +-  33)
M_0.05=  8.13  (x=  1.68)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.662  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.338  ( KEDR )

 Expected score/letter:  -2.039;    Average information/letter:   0.502
 Minimal length of displayed segments set to:  20

M_0.01= 19.34  (cv= 10.92, lambda=  0.39788, k=  0.28690, x=  8.42;
                90% confidence interval for segment length:  22 +-  11)
M_0.05= 15.24  (x=  4.33)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     6 |   4 |   8 |  19 |   9 |   8 |  12 |  10 |   9 |  13 |   6 |   7 | 
lmin1     7 |   6 |  10 |  23 |  12 |  10 |  15 |  13 |  11 |  15 |   7 |   9 | 
lmin2     8 |   7 |  11 |  26 |  13 |  11 |  16 |  14 |  12 |  17 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   4:   0
  -  runs >=   3:   1, at   18;
  *  runs >=   5:   0
  0  runs >=  13:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -3.169;    Average information/letter:   1.256
 Minimal length of displayed segments set to:  15

M_0.01= 13.28  (cv=  7.10, lambda=  0.61141, k=  0.43982, x=  6.18;
                90% confidence interval for segment length:   9 +-   5)
M_0.05= 10.62  (x=  3.51)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -6.364;    Average information/letter:   1.353
 Minimal length of displayed segments set to:  15

M_0.01= 25.79  (cv= 14.27, lambda=  0.30448, k=  0.33572, x= 11.52;
                90% confidence interval for segment length:   8 +-   5)
M_0.05= 20.44  (x=  6.17);     M_0.30= 14.07  (x= -0.20)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-11
CEVC at   12
    -17
CPEC at   33   (l=  25)
    -41-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   5  (Expected range:   0-- 12)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 3   (6-10) 2   (11-20) 0   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 11/30 or 15/45 or 18/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   4  (Expected range:   0-- 10)
   3 +plets (f+: 20.8%), 1 -plets (f-: 13.0%)
   Total number of charge altplets: 2 (Critical number: 11)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 2   (6-10) 0   (11-20) 1   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
   4-  45        7      *.0..0.          6       6      /0/./2/././1/./
  16-  27        2      *0               6       6      /0/2/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


orfC2 VpV262 suspected false frame
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:54 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4810.3398.seq)

SWISS-PROT ANNOTATION:
ID   ORFC2
DE   ORFC2, 250 bases, C0857596 checksum.

number of residues:  250;   molecular weight:  27.1 kdal
 
         1  MDQPLVESSP PPILFGLSLS LEVTTFRSVL DNLVLARFSV RSVSLDVDTG ERANRFHSDT 
        61  GVTLRAITIH RVALLSFVFL LVTAHFTLVS SHLYFLRHNY FLVSLNCGIN NPSSSITDIP 
       121  TSTHQNGTPS LMVKVSVVPD TSVLRPIVLV VRLPLTSTSS NLVSGPDLIP CLSPATISVS 
       181  IPNSGSSFAT APIAASVASP VSAEHPDVHI NRNTIVHRMM DALKVTFAYL FILNTSTDIV 
       241  VYSPVNLYRR

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 14( 5.6%); C  :  2( 0.8%); D  : 11( 4.4%); E- :  4( 1.6%); F  : 12( 4.8%)
G--:  7( 2.8%); H  :  9( 3.6%); I  : 15( 6.0%); K--:  2( 0.8%); L  : 31(12.4%)
M  :  4( 1.6%); N  : 13( 5.2%); P+ : 18( 7.2%); Q--:  2( 0.8%); R  : 14( 5.6%)
S++: 35(14.0%); T+ : 21( 8.4%); V++: 31(12.4%); W  :  0( 0.0%); Y  :  5( 2.0%)

KR      :   16 (  6.4%);   ED      :   15 (  6.0%);   AGP     :   39 ( 15.6%);
KRED    :   31 ( 12.4%);   KR-ED   :    1 (  0.4%);   FIKMNY  :   51 ( 20.4%);
LVIFM   :   93 ( 37.2%);   ST    ++:   56 ( 22.4%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0-0000-000 0000000000 0-0000+000 -00000+000 +0000-0-00 -+00+000-0 
        61  0000+00000 +000000000 0000000000 000000+000 0000000000 0000000-00 
       121  0000000000 000+00000- 0000+00000 0+00000000 000000-000 0000000000 
       181  0000000000 0000000000 000-00-000 0+00000+00 -00+000000 0000000-00 
       241  00000000++

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  7/30 or  9/45 or 11/60):  none


Negative charge clusters (cmin =  7/30 or  9/45 or 10/60):  none


Mixed charge clusters (cmin = 10/30 or 14/45 or 17/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.064  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.876  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.060  ( ED )

 Expected score/letter:  -0.868;    Average information/letter:   2.123
 Minimal length of displayed segments set to:  20

M_0.01=  7.55  (cv=  4.52, lambda=  1.22156, k=  0.40458, x=  3.03;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  6.21  (x=  1.69)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.060  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.876  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.064  ( KR )

 Expected score/letter:  -0.884;    Average information/letter:   2.245
 Minimal length of displayed segments set to:  20

M_0.01=  7.33  (cv=  4.38, lambda=  1.26053, k=  0.41326, x=  2.95;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  6.04  (x=  1.66)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.124  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.876  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.752;    Average information/letter:   2.121
 Minimal length of displayed segments set to:  20

M_0.01=  4.95  (cv=  2.82, lambda=  1.95508, k=  0.64555, x=  2.13;
                90% confidence interval for segment length:   7 +-   4)
M_0.05=  4.12  (x=  1.30)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.876  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.124  ( KEDR )

 Expected score/letter:  -0.116
 Average information/letter:   0.004 < .10; too small !


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   4 |   5 |  61 |   8 |   8 |  10 |  10 |  10 |  12 |   7 |   9 | 
lmin1     5 |   5 |   6 |  74 |  10 |   9 |  12 |  12 |  12 |  15 |   9 |  11 | 
lmin2     6 |   6 |   7 |  82 |  11 |  11 |  13 |  14 |  13 |  17 |  10 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   3:   0
  0  runs >=  40:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.488;    Average information/letter:   0.470
 Minimal length of displayed segments set to:  15

M_0.01= 27.28  (cv= 17.53, lambda=  0.31501, k=  0.21676, x=  9.75;
                90% confidence interval for segment length:  26 +-  20)
M_0.05= 22.10  (x=  4.58)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -1.860;    Average information/letter:   0.188
 Minimal length of displayed segments set to:  15

M_0.01= 87.96  (cv= 62.63, lambda=  0.08817, k=  0.09380, x= 25.33;
                90% confidence interval for segment length:  59 +-  52)
M_0.05= 69.47  (x=  6.85);     M_0.30= 47.48  (x= -15.15)

 1) From   72 to   96 (type 1):  length= 25, score=52.00 
      72  VALLSFVFLL VTAHFTLVSS HLYFL
    L:  7(28.0%);  S:  3(12.0%);  V:  4(16.0%);  F:  4(16.0%);

# of segments (>=15 residues) exceeding M_0.30:  1


2. SPACINGS OF C.


H2N-106-C-63-C-79-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  16  (Expected range:   6-- 31)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 4   (6-10) 5   (11-20) 4   (>=21) 4

3. Clusters of amino acid multiplets (cmin = 11/30 or 15/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  6)
   1 +plets (f+: 6.4%), 0 -plets (f-: 6.0%)
   Total number of charge altplets: 1 (Critical number: 8)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 0   (11-20) 0   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 133- 152        5      V....            4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
  13-  24        2      i0               6       6      /0/2/
  56- 103        8      i.0.0...         6       6      /0/./2/./0/./././
  67- 122        7      i...0.0          8       8      /0/./././2/./2/
  74-  82        1      i                8       6       1


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  12- 112  (1.)     P( 100)P     1 of  19   0.0012   large maximal spacing



orfD VpV262 SPRY domain match
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:56 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.6375.8536.seq)

SWISS-PROT ANNOTATION:
ID   ORFD
DE   ORFD, 224 bases, B6C47028 checksum.

number of residues:  224;   molecular weight:  24.1 kdal
 
         1  MKDLIRTGLV EEMIRPMVGP LIGGMGGGGP VTTPQTPMNF NHNEGAFAIL IDGGDGNDYV 
        61  DDSGDPIIER NELEPTVYGS LSGNNRTSIG SGKYYYEFLI VAEPASGRIG LGLSALDNRS 
       121  STTDIRHDFW NGWYSDGVKI NNPNYSSNGS QTTWAGTFGV GDRVGIGLDQ SLDDIILYVN 
       181  GVEMDRIELT NTGTNNFMTP YLLGDGGKLD HSAQAYLPSG FSEY

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A- :  8( 3.6%); C  :  0( 0.0%); D+ : 18( 8.0%); E  : 11( 4.9%); F  :  7( 3.1%)
G++: 34(15.2%); H  :  3( 1.3%); I  : 16( 7.1%); K- :  4( 1.8%); L  : 18( 8.0%)
M  :  7( 3.1%); N+ : 17( 7.6%); P  : 11( 4.9%); Q- :  4( 1.8%); R  :  9( 4.0%)
S  : 17( 7.6%); T  : 15( 6.7%); V  : 11( 4.9%); W  :  3( 1.3%); Y  : 11( 4.9%)

KR    - :   13 (  5.8%);   ED      :   29 ( 12.9%);   AGP     :   53 ( 23.7%);
KRED    :   42 ( 18.8%);   KR-ED - :  -16 ( -7.1%);   FIKMNY  :   62 ( 27.7%);
LVIFM   :   59 ( 26.3%);   ST      :   32 ( 14.3%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0+-00+0000 --00+00000 0000000000 0000000000 000-000000 0-00-00-00 
        61  --00-000-+ 0-0-000000 00000+0000 00+000-000 00-0000+00 000000-0+0 
       121  000-0+0-00 00000-00+0 0000000000 0000000000 0-+00000-0 00--000000 
       181  00-0-+0-00 0000000000 0000-00+0- 0000000000 00-0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  6/30 or  8/45 or 10/60):  none


Negative charge clusters (cmin = 11/30 or 14/45 or 18/60):  none


Mixed charge clusters (cmin = 14/30 or 18/45 or 23/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.058  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.129  ( ED )

 Expected score/letter:  -0.955;    Average information/letter:   2.402
 Minimal length of displayed segments set to:  20

M_0.01=  7.10  (cv=  4.19, lambda=  1.29049, k=  0.42742, x=  2.91;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  5.84  (x=  1.64)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.129  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.058  ( KR )

 Expected score/letter:  -0.670;    Average information/letter:   0.936
 Minimal length of displayed segments set to:  20

M_0.01= 11.18  (cv=  6.95, lambda=  0.77910, k=  0.27219, x=  4.23;
                90% confidence interval for segment length:  13 +-  13)
M_0.05=  9.09  (x=  2.14)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.188  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.625;    Average information/letter:   1.322
 Minimal length of displayed segments set to:  20

M_0.01=  6.33  (cv=  3.69, lambda=  1.46634, k=  0.48077, x=  2.64;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  5.22  (x=  1.53)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.188  ( KEDR )

 Expected score/letter:  -0.688;    Average information/letter:   0.105
 Minimal length of displayed segments set to:  20

M_0.01= 49.09  (cv= 36.31, lambda=  0.14905, k=  0.06756, x= 12.78;
                90% confidence interval for segment length: 101 +-  84)
M_0.05= 38.15  (x=  1.85)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     3 |   5 |   6 |  40 |   7 |  10 |  11 |   9 |  12 |  14 |   6 |   7 | 
lmin1     5 |   6 |   7 |  49 |   9 |  12 |  14 |  11 |  14 |  16 |   7 |   9 | 
lmin2     6 |   7 |   9 |  54 |  10 |  13 |  15 |  12 |  16 |  18 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  27:   1, at   16;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.795;    Average information/letter:   0.569
 Minimal length of displayed segments set to:  15

M_0.01= 23.86  (cv= 14.90, lambda=  0.36328, k=  0.26054, x=  8.96;
                90% confidence interval for segment length:  22 +-  15)
M_0.05= 19.37  (x=  4.47)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.219;    Average information/letter:   0.445
 Minimal length of displayed segments set to:  15

M_0.01= 56.69  (cv= 36.84, lambda=  0.14688, k=  0.18536, x= 19.84;
                90% confidence interval for segment length:  27 +-  20)
M_0.05= 45.59  (x=  8.75);     M_0.30= 32.39  (x= -4.46)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-224-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  19  (Expected range:   3-- 26)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 8   (6-10) 3   (11-20) 8   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 14/30 or 17/45 or 21/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   3  (Expected range:   0-- 10)
   0 +plets (f+: 5.8%), 3 -plets (f-: 12.9%)
   Total number of charge altplets: 4 (Critical number: 10)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 1   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  23-  29        1      G                6       4       1
  52-  63        3      D..              4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
   1-  28        4      i..0             7       7      /0/././2/
   4-  57        9      i..0000.0        6       6      /0/././2/2/2/2/./2/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  38- 184  (2.)     M( 146)M     1 of   8   0.0041   large  1. maximal spacing
 198- 225  (4.)     M(  27)M     2 of   8   0.9836   small  2. maximal spacing



orfE VpV262 matches unannotated SIO1 frame
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:58 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.8483.seq)

SWISS-PROT ANNOTATION:
ID   ORFE
DE   ORFE, 185 bases, 52D098DD checksum.

number of residues:  185;   molecular weight:  21.1 kdal
 
         1  MLLTPELKKE RWEAAYAGEI HVPKDMNRAT AALFWEKRHE ARTIERGKYP VFNLSMRDHD 
        61  DTLSFPQLYF QCDSDYEAAM VILGSWGHWQ RLCEAKWFAE KLALWQEEKA QRDIAHGRAK 
       121  IKALADAGNL SACKFLASGG LSDKPKQEKP SNPKPAKNPE PAPTAEEDDF LKHGLSILEK 
       181  NSEHK

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 23(12.4%); C  :  3( 1.6%); D  : 11( 5.9%); E  : 18( 9.7%); F  :  7( 3.8%)
G  :  9( 4.9%); H  :  7( 3.8%); I  :  6( 3.2%); K  : 19(10.3%); L  : 18( 9.7%)
M  :  4( 2.2%); N  :  6( 3.2%); P  : 11( 5.9%); Q  :  6( 3.2%); R  :  9( 4.9%)
S  : 10( 5.4%); T  :  5( 2.7%); V  :  3( 1.6%); W  :  6( 3.2%); Y  :  4( 2.2%)

KR      :   28 ( 15.1%);   ED      :   29 ( 15.7%);   AGP     :   43 ( 23.2%);
KRED    :   57 ( 30.8%);   KR-ED   :   -1 ( -0.5%);   FIKMNY  :   46 ( 24.9%);
LVIFM   :   38 ( 20.5%);   ST      :   15 (  8.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00000-0++- +0-00000-0 000+-00+00 00000-++0- 0+00-+0+00 000000+-0- 
        61  -000000000 00-0-0-000 0000000000 +00-0+000- +00000--+0 0+-0000+0+ 
       121  0+000-0000 000+000000 00-+0+0-+0 000+00+00- 00000----0 0+000000-+ 
       181  00-0+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 12/30 or 16/45 or 20/60):  none


Negative charge clusters (cmin = 12/30 or 16/45 or 20/60):  none


Mixed charge clusters (cmin = 19/30 or 26/45 or 32/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.151  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.692  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.157  ( ED )

 Expected score/letter:  -0.703;    Average information/letter:   0.835
 Minimal length of displayed segments set to:  20

M_0.01= 12.02  (cv=  7.41, lambda=  0.70492, k=  0.25905, x=  4.61;
                90% confidence interval for segment length:  15 +-  14)
M_0.05=  9.70  (x=  2.30)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.157  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.692  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.151  ( KR )

 Expected score/letter:  -0.681;    Average information/letter:   0.778
 Minimal length of displayed segments set to:  20

M_0.01= 12.39  (cv=  7.68, lambda=  0.67984, k=  0.24781, x=  4.71;
                90% confidence interval for segment length:  16 +-  15)
M_0.05= 10.00  (x=  2.32)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.308  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.692  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.384;    Average information/letter:   0.448
 Minimal length of displayed segments set to:  20

M_0.01= 10.23  (cv=  6.45, lambda=  0.80898, k=  0.21288, x=  3.77;
                90% confidence interval for segment length:  27 +-  24)
M_0.05=  8.21  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.692  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.308  ( KEDR )

 Expected score/letter:  -1.773;    Average information/letter:   0.418
 Minimal length of displayed segments set to:  20

M_0.01= 24.09  (cv= 14.94, lambda=  0.34931, k=  0.24483, x=  9.14;
                90% confidence interval for segment length:  29 +-  16)
M_0.05= 19.42  (x=  4.47)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   5 |   8 |  23 |   9 |   9 |  13 |  10 |  11 |  14 |   7 |   9 | 
lmin1     7 |   7 |  10 |  28 |  11 |  11 |  16 |  13 |  13 |  17 |   8 |  11 | 
lmin2     8 |   8 |  11 |  32 |  13 |  13 |  17 |  14 |  15 |  19 |  10 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   1, at  166;
  *  runs >=   5:   0
  0  runs >=  16:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.589;    Average information/letter:   0.885
 Minimal length of displayed segments set to:  15

M_0.01= 18.18  (cv= 10.78, lambda=  0.48410, k=  0.36062, x=  7.40;
                90% confidence interval for segment length:  14 +-   8)
M_0.05= 14.81  (x=  4.03)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.319;    Average information/letter:   0.902
 Minimal length of displayed segments set to:  15

M_0.01= 39.12  (cv= 23.75, lambda=  0.21977, k=  0.29424, x= 15.37;
                90% confidence interval for segment length:  14 +-   9)
M_0.05= 31.70  (x=  7.95);     M_0.30= 22.88  (x= -0.88)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-71-C-20-C-39-C-52-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  10  (Expected range:   1-- 22)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 4   (6-10) 0   (11-20) 3   (>=21) 4

3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   5  (Expected range:   0-- 16)
   2 +plets (f+: 15.1%), 3 -plets (f-: 15.7%)
   Total number of charge altplets: 11 (Critical number: 19)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 1   (11-20) 1   (>=21) 4

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 115- 130        4      A...             4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfF VpV262 weak SPP1 gp32 match
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:14:59 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4819.9592.seq)

SWISS-PROT ANNOTATION:
ID   ORFF
DE   ORFF, 99 bases, 814D438A checksum.

number of residues:   99;   molecular weight:  11.7 kdal
 
         1  MIAAIIVQGP LFWFIGSRQK KGEDMTSELK SIRMSIAKVH QRVDEELDKL DEKYATKTEV 
        61  EVHMAHLKEV NELQFRAIME NLDYIRKRLD EHDSKRRPE

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  6( 6.1%); C  :  0( 0.0%); D  :  7( 7.1%); E  : 12(12.1%); F  :  3( 3.0%)
G  :  3( 3.0%); H  :  4( 4.0%); I  :  8( 8.1%); K  : 10(10.1%); L  :  8( 8.1%)
M  :  5( 5.1%); N  :  2( 2.0%); P  :  2( 2.0%); Q  :  4( 4.0%); R  :  8( 8.1%)
S  :  5( 5.1%); T  :  3( 3.0%); V  :  6( 6.1%); W  :  1( 1.0%); Y  :  2( 2.0%)

KR      :   18 ( 18.2%);   ED      :   19 ( 19.2%);   AGP     :   11 ( 11.1%);
KRED    :   37 ( 37.4%);   KR-ED   :   -1 ( -1.0%);   FIKMNY  :   30 ( 30.3%);
LVIFM   :   30 ( 30.3%);   ST      :    8 (  8.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 0000000+0+ +0--000-0+ 00+0000+00 0+0---0-+0 --+000+0-0 
        61  -000000+-0 0-000+000- 00-00+++0- -0-0+++0-

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 13/30 or 18/45 or 22/60):  none


Negative charge clusters (cmin = 14/30 or 19/45 or 23/60):  none


Mixed charge clusters (cmin = 21/30 or 29/45 or 37/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.182  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.626  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.192  ( ED )

 Expected score/letter:  -0.646;    Average information/letter:   0.622
 Minimal length of displayed segments set to:  20

M_0.01= 12.93  (cv=  7.75, lambda=  0.59300, k=  0.21646, x=  5.18;
                90% confidence interval for segment length:  18 +-  18)
M_0.05= 10.18  (x=  2.43)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.192  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.626  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.182  ( KR )

 Expected score/letter:  -0.606;    Average information/letter:   0.540
 Minimal length of displayed segments set to:  20

M_0.01= 13.70  (cv=  8.32, lambda=  0.55237, k=  0.19663, x=  5.38;
                90% confidence interval for segment length:  20 +-  20)
M_0.05= 10.75  (x=  2.43)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.374  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.626  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.253;    Average information/letter:   0.188
 Minimal length of displayed segments set to:  20

M_0.01= 13.39  (cv=  8.90, lambda=  0.51622, k=  0.10182, x=  4.49;
                90% confidence interval for segment length:  53 +-  55)
M_0.05= 10.23  (x=  1.33)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.626  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.374  ( KEDR )

 Expected score/letter:  -2.364;    Average information/letter:   0.604
 Minimal length of displayed segments set to:  20

M_0.01= 17.68  (cv= 10.02, lambda=  0.45839, k=  0.33610, x=  7.66;
                90% confidence interval for segment length:  19 +-   8)
M_0.05= 14.13  (x=  4.10)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   5 |   9 |  18 |   9 |   9 |  13 |   9 |  10 |  13 |   7 |   8 | 
lmin1     7 |   7 |  11 |  21 |  11 |  11 |  15 |  12 |  12 |  16 |   8 |  10 | 
lmin2     8 |   8 |  12 |  24 |  12 |  12 |  17 |  13 |  13 |  18 |   9 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   4:   0
  -  runs >=   4:   0
  *  runs >=   6:   0
  0  runs >=  12:   1, at    1;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.949;    Average information/letter:   0.967
 Minimal length of displayed segments set to:  15

M_0.01= 18.11  (cv= 10.11, lambda=  0.45433, k=  0.37944, x=  7.99;
                90% confidence interval for segment length:  12 +-   7)
M_0.05= 14.52  (x=  4.40)

 1) From    1 to   16 (type 1):  length= 16, score=17.00  *
    (pocket at    8 to   10:   length=  3, score=-5.00) 
       1  MIAAIIV |QGP|  LFWFIG
    A:  2(12.5%);  G:  2(12.5%);  I:  4(25.0%);  F:  2(12.5%);

# of segments (>=15 residues) exceeding M_0.05:  1


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.869;    Average information/letter:   0.911
 Minimal length of displayed segments set to:  15

M_0.01= 40.86  (cv= 23.43, lambda=  0.19613, k=  0.30667, x= 17.43;
                90% confidence interval for segment length:  13 +-   8)
M_0.05= 32.55  (x=  9.12);     M_0.30= 22.66  (x= -0.77)

 1) From    1 to   16 (type 1):  length= 16, score=33.00  *
    (pocket at    8 to   10:   length=  3, score=-14.00) 
       1  MIAAIIV |QGP|  LFWFIG
    A:  2(12.5%);  G:  2(12.5%);  I:  4(25.0%);  F:  2(12.5%);

# of segments (>=15 residues) exceeding M_0.30:  1


2. SPACINGS OF C.


H2N-99-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   5  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 3   (6-10) 0   (11-20) 1   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 10/30 or 12/45 or 15/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   7  (Expected range:   0-- 13)
   3 +plets (f+: 18.2%), 4 -plets (f-: 19.2%)
   Total number of charge altplets: 3 (Critical number: 15)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 5   (6-10) 0   (11-20) 2   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core:10)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


orfG VpV262 homologue of T7 gp29 maturase B
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:01 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.4176.seq)

SWISS-PROT ANNOTATION:
ID   ORFG
DE   ORFG, 535 bases, 35D30714 checksum.

number of residues:  535;   molecular weight:  62.4 kdal
 
         1  MNTTASDDQN DIPLEDPRLI ELREACLHSL WIFAQAVEPH RVYGECHKEL FDWWQEMELE 
        61  EVLNTLALMP RDHQKSHCIA VWVCWQIFKN PAVTIAYVCA TESLAILQLY DIKQILTSDE 
       121  FTRLSPDMIE PMEKKRQKWA ETAIIVDHPI RKKERPRDPT VLATGLDSNN IGAHCNIMVK 
       181  DDVVIDKNSL TETARQKVEA KAGHLSSILT TDGMEFCVGT RYHPKDHYQT LIDMTEEVWE 
       241  GDQLVGERPV YAVHTRVVEV EGVFLWPRMA RESDGKMFGF DRAQLSRKKA KYRKDMRNFY 
       301  CQYYNDPNAI NEGGIEKSMF LYYDKEKVVR RKGSWWVGKR KVNIIACQDF AYSVQTGSDW 
       361  TALFILGMDK DKRIYILDIV RYQTKKPSVY WKNLETAYNK WKFKWVRAEA VAAQEVIIEA 
       421  LREYAEKEQC PIRIKAYKPN SQSGDKETRI SQTLEPLYEN GDIYHYRGGL CDMLEEELTQ 
       481  DRPPHDDLKD CLHIGVGFDK LKPPVEDDDE EDYDNYHDPH ASYSYDPNNR FGGLQ

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 34( 6.4%); C  : 12( 2.2%); D+ : 44( 8.2%); E  : 45( 8.4%); F  : 15( 2.8%)
G  : 25( 4.7%); H  : 16( 3.0%); I  : 33( 6.2%); K  : 40( 7.5%); L  : 39( 7.3%)
M  : 14( 2.6%); N  : 20( 3.7%); P  : 24( 4.5%); Q  : 22( 4.1%); R  : 31( 5.8%)
S  : 22( 4.1%); T  : 25( 4.7%); V  : 34( 6.4%); W  : 14( 2.6%); Y  : 26( 4.9%)

KR      :   71 ( 13.3%);   ED    + :   89 ( 16.6%);   AGP     :   83 ( 15.5%);
KRED  + :  160 ( 29.9%);   KR-ED   :  -18 ( -3.4%);   FIKMNY  :  148 ( 27.7%);
LVIFM   :  135 ( 25.2%);   ST      :   47 (  8.8%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000000--00 -000--0+00 -0+-000000 0000000-00 +000-00+-0 0-000-0-0- 
        61  -000000000 +-00+00000 00000000+0 0000000000 0-00000000 -0+00000-- 
       121  00+000-00- 00-+++0+00 -00000-000 +++-+0+-00 000000-000 000000000+ 
       181  --000-+000 0-00+0+0-0 +000000000 0-00-00000 +000+-0000 00-00--00- 
       241  0-0000-+00 00000+00-0 -000000+00 +-0-0+0000 -+0000+++0 +0++-0+000 
       301  00000-0000 0-000-+000 000-+-+00+ ++000000++ +0000000-0 00000000-0 
       361  00000000-+ -++0000-00 +000++0000 0+00-0000+ 0+0+00+0-0 0000-000-0 
       421  0+-00-+-00 00+0+00+00 0000-+-0+0 0000-000-0 0-0000+000 0-00---000 
       481  -+000--0+- 00000000-+ 0+000----- --0-000-00 00000-000+ 00000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 15/45 or 18/60):  none


Negative charge clusters (cmin = 13/30 or 17/45 or 21/60):  none


Mixed charge clusters (cmin = 19/30 or 25/45 or 32/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.133  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.701  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.166  ( ED )

 Expected score/letter:  -0.768;    Average information/letter:   1.045
 Minimal length of displayed segments set to:  20

M_0.01= 12.18  (cv=  7.92, lambda=  0.79365, k=  0.29549, x=  4.26;
                90% confidence interval for segment length:  13 +-  12)
M_0.05= 10.12  (x=  2.21)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.166  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.701  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.133  ( KR )

 Expected score/letter:  -0.634;    Average information/letter:   0.671
 Minimal length of displayed segments set to:  20

M_0.01= 14.82  (cv=  9.92, lambda=  0.63353, k=  0.22521, x=  4.91;
                90% confidence interval for segment length:  20 +-  18)
M_0.05= 12.25  (x=  2.34)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.299  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.701  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.402;    Average information/letter:   0.494
 Minimal length of displayed segments set to:  20

M_0.01= 11.05  (cv=  7.38, lambda=  0.85175, k=  0.23040, x=  3.68;
                90% confidence interval for segment length:  28 +-  23)
M_0.05=  9.14  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.701  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.299  ( KEDR )

 Expected score/letter:  -1.692;    Average information/letter:   0.393
 Minimal length of displayed segments set to:  20

M_0.01= 28.16  (cv= 18.78, lambda=  0.33455, k=  0.23167, x=  9.38;
                90% confidence interval for segment length:  35 +-  18)
M_0.05= 23.29  (x=  4.51)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   6 |   9 |  27 |  10 |  11 |  14 |  11 |  12 |  16 |   7 |   9 | 
lmin1     7 |   8 |  11 |  33 |  12 |  13 |  17 |  14 |  15 |  19 |   9 |  11 | 
lmin2     8 |   9 |  12 |  37 |  13 |  15 |  19 |  16 |  17 |  21 |  10 |  13 | 
 (Significance level: 0.010000; Minimal displayed length:  6)

  (-)   7(0,0,0); at  506- 512:   EDDDEED
      (4. quartile)               -------


Run count statistics:

  +  runs >=   4:   0
  -  runs >=   4:   1, at  506;
  *  runs >=   6:   1, at  506;
  0  runs >=  18:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.495;    Average information/letter:   0.823
 Minimal length of displayed segments set to:  15

M_0.01= 22.25  (cv= 14.25, lambda=  0.44078, k=  0.34199, x=  8.00;
                90% confidence interval for segment length:  17 +-   9)
M_0.05= 18.56  (x=  4.30)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.114;    Average information/letter:   0.835
 Minimal length of displayed segments set to:  15

M_0.01= 47.88  (cv= 31.30, lambda=  0.20071, k=  0.28031, x= 16.58;
                90% confidence interval for segment length:  17 +-  10)
M_0.05= 39.76  (x=  8.46);     M_0.30= 30.10  (x= -1.20)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-25-C-19-C-31-C-5-C-14-C-75-C-41-C-83-C-45-C-82-C-40-C-19-C-44-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  5

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  9

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  34  (Expected range:  12-- 45)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 13   (6-10) 8   (11-20) 6   (>=21) 8

3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  17  (Expected range:   7-- 34)
   8 +plets (f+: 13.3%), 9 -plets (f-: 16.6%)
   Total number of charge altplets: 22 (Critical number: 38)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 4   (11-20) 4   (>=21) 9

3. Long charge multiplets (>= 5; Letter/Length/Position):
    -/7/506

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 472- 516        9      D........        5       5 !     0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core:10)

Location        Period  Element         Copies  Core    Errors
 287- 298        2      *.               6       6       0
 462- 491        5      *.0..            6       6      /0/./2/././
 472- 516        9      -.0....0.        5       5      /0/./1/././././1/./
 506- 512        1      -                7       7 !     0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  66-  68  (1.)     L(   2)L    40 of  40   0.0018   large minimal spacing
 107- 109  (1.)     L(   2)L    38 of  40   0.0018     matching minimum
 254- 465  (3.)     H( 211)H     1 of  17   0.0048   large maximal spacing
 534- 536  (1.)     L(   2)L    39 of  40   0.0018     matching minimum



orfH VpV262 homologue of SIO1 gp3,5; suspected capsid protein
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:02 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4810.2920.seq)

SWISS-PROT ANNOTATION:
ID   ORFH
DE   ORFH, 599 bases, 6E34097E checksum.

number of residues:  599;   molecular weight:  68.1 kdal
 
         1  MSTDIKTLQK MLEGRDDDRA FIDELVVLFT NMENARAQKD REDKELMDYI DATDTRKTSN 
        61  SKLPFKNSTT INKLAHLHLM ITTSYMEHLL PNRNWVDFVG FDNDSVNAEK REIARSYVRG 
       121  KVEASNLEGV IERMVDDFAV RGFCVAHTRH VKRMTVTAEN QVIKNYSGTV TERLSPSDVF 
       181  WDVTADSLPK AAKCIRQLYT LGSLKREIEE GTFPLMSMED FQKLREERRT IREALADGYN 
       241  GRRKFDSLHK KGYGSMMNYI NEGVVEVLTF MGDFYDEEND ELWNNYEITV IDRKIIGRKQ 
       301  SKDTWDGSQN LHIAVYEFQK DTLCPIGPLH RLTGMQYKLD KRENFREDLH DRFLHPSLKK 
       361  VGDVREKGMR GGPNHVFEVE ETGDVQYMTP PAEVLQPDNQ LSITLQLMED LSGAPKESIG 
       421  QRTAGEKTKF EVQLLDQGQN KVFRRKVKKF ERELLTPVLN DYLEQGRNHL DASDTIKTFN 
       481  SELGTATFLD ITADDLNLNG QMVAQGATLF AEKANTLQNL NAILGGPLGA ALAPHMSRTK 
       541  LFNAVEYLGD LDAYGIFTFG IGVQEDQQLA RMAQKSTQQT EETALTQEEV GGPTTDTGQ

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 36( 6.0%); C  :  3( 0.5%); D  : 46( 7.7%); E  : 49( 8.2%); F  : 26( 4.3%)
G  : 41( 6.8%); H  : 13( 2.2%); I  : 25( 4.2%); K  : 39( 6.5%); L  : 58( 9.7%)
M  : 20( 3.3%); N  : 32( 5.3%); P  : 17( 2.8%); Q  : 29( 4.8%); R  : 37( 6.2%)
S  : 26( 4.3%); T  : 46( 7.7%); V  : 36( 6.0%); W  :  4( 0.7%); Y  : 16( 2.7%)

KR      :   76 ( 12.7%);   ED    + :   95 ( 15.9%);   AGP     :   94 ( 15.7%);
KRED    :  171 ( 28.5%);   KR-ED   :  -19 ( -3.2%);   FIKMNY  :  158 ( 26.4%);
LVIFM   :  165 ( 27.5%);   ST      :   72 ( 12.0%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000-0+000+ 00-0+---+0 00--000000 00-00+00+- +--+-00-00 -00-0++000 
        61  0+000+0000 00+0000000 000000-000 00+000-000 0-0-0000-+ +-00+000+0 
       121  +0-0000-00 0-+00--000 +0000000+0 0++00000-0 000+000000 0-+0000-00 
       181  0-000-000+ 00+00+0000 0000++-0-- 00000000-- 00+0+--++0 0+-000-000 
       241  0+++0-000+ +000000000 0-000-0000 00-00---0- -00000-000 0-++000++0 
       301  0+-00-0000 000000-00+ -000000000 +000000+0- ++-00+--00 -+000000++ 
       361  00-0+-+00+ 0000000-0- -00-000000 00-0000-00 00000000-- 00000+-000 
       421  0+000-+0+0 -0000-0000 +00+++0++0 -+-0000000 -00-00+000 -00-00+000 
       481  0-0000000- 000--00000 0000000000 0-+0000000 0000000000 0000000+0+ 
       541  00000-000- 0-00000000 0000--0000 +000+00000 --00000--0 00000-000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 14/45 or 17/60):  none


Negative charge clusters (cmin = 12/30 or 16/45 or 20/60):  none


Mixed charge clusters (cmin = 18/30 or 24/45 or 31/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.127  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.715  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.159  ( ED )

 Expected score/letter:  -0.778;    Average information/letter:   1.105
 Minimal length of displayed segments set to:  20

M_0.01= 11.95  (cv=  7.79, lambda=  0.82049, k=  0.30419, x=  4.16;
                90% confidence interval for segment length:  13 +-  11)
M_0.05=  9.96  (x=  2.17)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.159  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.715  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.127  ( KR )

 Expected score/letter:  -0.651;    Average information/letter:   0.730
 Minimal length of displayed segments set to:  20

M_0.01= 14.39  (cv=  9.63, lambda=  0.66411, k=  0.23732, x=  4.76;
                90% confidence interval for segment length:  19 +-  17)
M_0.05= 11.94  (x=  2.31)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.285  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.715  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.429;    Average information/letter:   0.568
 Minimal length of displayed segments set to:  20

M_0.01= 10.51  (cv=  6.97, lambda=  0.91746, k=  0.25763, x=  3.54;
                90% confidence interval for segment length:  24 +-  20)
M_0.05=  8.73  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.715  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.285  ( KEDR )

 Expected score/letter:  -1.569;    Average information/letter:   0.356
 Minimal length of displayed segments set to:  20

M_0.01= 30.23  (cv= 20.47, lambda=  0.31241, k=  0.21169, x=  9.75;
                90% confidence interval for segment length:  38 +-  21)
M_0.05= 25.01  (x=  4.54)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   6 |   9 |  29 |  10 |  11 |  14 |  11 |  12 |  16 |   7 |   8 | 
lmin1     7 |   7 |  11 |  35 |  12 |  13 |  17 |  14 |  15 |  19 |   8 |  10 | 
lmin2     8 |   9 |  12 |  39 |  13 |  15 |  19 |  16 |  17 |  22 |   9 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   4:   0
  -  runs >=   4:   0
  *  runs >=   6:   1, at   39;
  0  runs >=  19:   1, at  514;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.554;    Average information/letter:   0.873
 Minimal length of displayed segments set to:  15

M_0.01= 22.28  (cv= 14.31, lambda=  0.44691, k=  0.35332, x=  7.97;
                90% confidence interval for segment length:  16 +-   9)
M_0.05= 18.63  (x=  4.32)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.833;    Average information/letter:   0.747
 Minimal length of displayed segments set to:  15

M_0.01= 52.65  (cv= 34.81, lambda=  0.18374, k=  0.26684, x= 17.85;
                90% confidence interval for segment length:  19 +-  11)
M_0.05= 43.78  (x=  8.97);     M_0.30= 33.23  (x= -1.58)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-143-C-49-C-129-C-275-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  5

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  9

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  34  (Expected range:  17-- 52)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 9   (6-10) 11   (11-20) 7   (>=21) 8

3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  29  (Expected range:   7-- 35)
   13 +plets (f+: 12.7%), 16 -plets (f-: 15.9%)
   Total number of charge altplets: 24 (Critical number: 39)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 7   (6-10) 4   (11-20) 10   (>=21) 9

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 130- 149        5      V....            4       4       0
 520- 535        4      L...             4       4       0
 577- 588        3      T..              4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core:10)

Location        Period  Element         Copies  Core    Errors
   6-  59        9      *..*.....        6       6      /0/././2/./././././
  33-  59        3      *0.              9       9      /0/3/./
  39-  45        1      *                7       7       0
  42-  56        3      -0.              5       5      /0/1/./
 345- 392        8      i....0..         6       6      /0/././././2/././
 426- 455        5      *....            6       6       0
 489- 530        7      i0.000.          6       6      /0/1/./2/1/1/./


--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfI VpV262 suspected alternative C ter for orfH
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:04 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4716.8059.seq)

SWISS-PROT ANNOTATION:
ID   ORFI
DE   ORFI, 88 bases, CC7DBEB2 checksum.

number of residues:   88;   molecular weight:  10.5 kdal
 
         1  VDLLQILDSN WFELEADETK EDYESNFRAS VKTRRALNLW IQHELSKLDK EMSLSRIKEK 
        61  PDRAELALVV MARREQLLQM QAILEFKD

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  7( 8.0%); C  :  0( 0.0%); D  :  7( 8.0%); E  : 11(12.5%); F  :  3( 3.4%)
G  :  0( 0.0%); H  :  1( 1.1%); I  :  4( 4.5%); K  :  7( 8.0%); L  : 14(15.9%)
M  :  3( 3.4%); N  :  3( 3.4%); P  :  1( 1.1%); Q  :  5( 5.7%); R  :  7( 8.0%)
S  :  6( 6.8%); T  :  2( 2.3%); V  :  4( 4.5%); W  :  2( 2.3%); Y  :  1( 1.1%)

KR      :   14 ( 15.9%);   ED      :   18 ( 20.5%);   AGP     :    8 (  9.1%);
KRED    :   32 ( 36.4%);   KR-ED   :   -4 ( -4.5%);   FIKMNY  :   21 ( 23.9%);
LVIFM   :   28 ( 31.8%);   ST      :    8 (  9.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0-00000-00 00-0-0--0+ --0-000+00 0+0++00000 000-00+0-+ -0000+0+-+ 
        61  0-+0-00000 00++-00000 0000-0+-

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 12/30 or 16/45 or 20/60):  none


Negative charge clusters (cmin = 14/30 or 20/45 or 24/60):  none


Mixed charge clusters (cmin = 21/30 or 29/45 or 36/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.159  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.636  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.205  ( ED )

 Expected score/letter:  -0.727;    Average information/letter:   0.824
 Minimal length of displayed segments set to:  20

M_0.01= 11.26  (cv=  6.52, lambda=  0.68640, k=  0.25892, x=  4.73;
                90% confidence interval for segment length:  14 +-  13)
M_0.05=  8.88  (x=  2.36)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.205  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.636  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.159  ( KR )

 Expected score/letter:  -0.545;    Average information/letter:   0.437
 Minimal length of displayed segments set to:  20

M_0.01= 14.62  (cv=  8.97, lambda=  0.49916, k=  0.16908, x=  5.66;
                90% confidence interval for segment length:  24 +-  25)
M_0.05= 11.36  (x=  2.39)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.364  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.636  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.273;    Average information/letter:   0.220
 Minimal length of displayed segments set to:  20

M_0.01= 12.39  (cv=  8.00, lambda=  0.55962, k=  0.11688, x=  4.38;
                90% confidence interval for segment length:  45 +-  47)
M_0.05=  9.47  (x=  1.47)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.636  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.364  ( KEDR )

 Expected score/letter:  -2.273;    Average information/letter:   0.576
 Minimal length of displayed segments set to:  20

M_0.01= 18.01  (cv= 10.15, lambda=  0.44127, k=  0.32257, x=  7.86;
                90% confidence interval for segment length:  20 +-   9)
M_0.05= 14.31  (x=  4.17)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   6 |   9 |  18 |   8 |   9 |  13 |   9 |  10 |  13 |   5 |   6 | 
lmin1     6 |   7 |  11 |  22 |  10 |  11 |  15 |  11 |  12 |  16 |   6 |   8 | 
lmin2     7 |   8 |  12 |  24 |  12 |  13 |  17 |  13 |  14 |  17 |   7 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   6:   0
  0  runs >=  12:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.932;    Average information/letter:   0.957
 Minimal length of displayed segments set to:  15

M_0.01= 18.28  (cv= 10.10, lambda=  0.44335, k=  0.37855, x=  8.18;
                90% confidence interval for segment length:  12 +-   7)
M_0.05= 14.61  (x=  4.51)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.659;    Average information/letter:   0.844
 Minimal length of displayed segments set to:  15

M_0.01= 43.16  (cv= 24.60, lambda=  0.18200, k=  0.29450, x= 18.56;
                90% confidence interval for segment length:  13 +-   9)
M_0.05= 34.20  (x=  9.60);     M_0.30= 23.55  (x= -1.05)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-88-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   5  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 3   (6-10) 0   (11-20) 1   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   4  (Expected range:   0-- 12)
   2 +plets (f+: 15.9%), 2 -plets (f-: 20.5%)
   Total number of charge altplets: 6 (Critical number: 14)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 0   (11-20) 3   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  15-  26        3      E..              4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


orfJ VpV262 SIO1 gp30 homologue
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:05 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4716.4029.seq)

SWISS-PROT ANNOTATION:
ID   ORFJ
DE   ORFJ, 244 bases, 65D0A5BD checksum.

number of residues:  244;   molecular weight:  26.9 kdal
 
         1  MPNPFDQNDE HDTNVNPYEG KDSKEIMDEL VGEGKKYKSV EDAVKALAYS QHHITTLESE 
        61  TAKLREAQSQ AKTIDEVLAK LQQKQQPSDD NKGSDENDPA AQKPTDDIDV EATVKRLLEQ 
       121  HTSTAQAEAN HKQVVDAVKA KYGTKAFDVW DKAEKELGIN LEQMAQTSPA ATLKLLGISG 
       181  ESAPAQSAAT FKGDAKPTPT DQGERPPEGS KRLVDYMLSK GEINRSQAYN LKLKYSADPE 
       241  KYRA

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 28(11.5%); C  :  0( 0.0%); D+ : 21( 8.6%); E  : 22( 9.0%); F- :  3( 1.2%)
G  : 12( 4.9%); H  :  5( 2.0%); I- :  7( 2.9%); K++: 27(11.1%); L  : 17( 7.0%)
M  :  4( 1.6%); N  : 10( 4.1%); P  : 13( 5.3%); Q  : 17( 7.0%); R- :  6( 2.5%)
S  : 16( 6.6%); T  : 15( 6.1%); V  : 12( 4.9%); W  :  1( 0.4%); Y  :  8( 3.3%)

KR      :   33 ( 13.5%);   ED    ++:   43 ( 17.6%);   AGP     :   53 ( 21.7%);
KRED  + :   76 ( 31.1%);   KR-ED   :  -10 ( -4.1%);   FIKMNY  :   59 ( 24.2%);
LVIFM --:   43 ( 17.6%);   ST      :   31 ( 12.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00000-00-- 0-000000-0 +-0+-00--0 00-0++0+00 --00+00000 0000000-0- 
        61  00+0+-0000 0+00--000+ 000+0000-- 0+00--0-00 00+00--0-0 -000++00-0 
       121  0000000-00 0+000-00+0 +000+00-00 -+0-+-0000 0-00000000 000+000000 
       181  -000000000 0+0-0+0000 -00-+00-00 ++00-0000+ 0-00+00000 0+0+000-0- 
       241  +0+0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 15/45 or 18/60):  none


Negative charge clusters (cmin = 13/30 or 18/45 or 22/60):  none


Mixed charge clusters (cmin = 19/30 or 26/45 or 33/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.135  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.689  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.176  ( ED )

 Expected score/letter:  -0.770;    Average information/letter:   1.030
 Minimal length of displayed segments set to:  20

M_0.01= 11.31  (cv=  7.01, lambda=  0.78413, k=  0.29347, x=  4.30;
                90% confidence interval for segment length:  12 +-  11)
M_0.05=  9.23  (x=  2.22)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.176  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.689  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.135  ( KR )

 Expected score/letter:  -0.607;    Average information/letter:   0.597
 Minimal length of displayed segments set to:  20

M_0.01= 14.34  (cv=  9.24, lambda=  0.59473, k=  0.20871, x=  5.10;
                90% confidence interval for segment length:  21 +-  20)
M_0.05= 11.60  (x=  2.36)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.311  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.689  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.377;    Average information/letter:   0.431
 Minimal length of displayed segments set to:  20

M_0.01= 10.74  (cv=  6.93, lambda=  0.79323, k=  0.20648, x=  3.81;
                90% confidence interval for segment length:  28 +-  26)
M_0.05=  8.69  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.689  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.311  ( KEDR )

 Expected score/letter:  -1.803;    Average information/letter:   0.428
 Minimal length of displayed segments set to:  20

M_0.01= 24.55  (cv= 15.49, lambda=  0.35481, k=  0.24970, x=  9.05;
                90% confidence interval for segment length:  29 +-  15)
M_0.05= 19.95  (x=  4.46)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   6 |   8 |  24 |   9 |  10 |  13 |  10 |  11 |  15 |   6 |   8 | 
lmin1     6 |   7 |  10 |  29 |  11 |  12 |  16 |  13 |  14 |  18 |   8 |  10 | 
lmin2     7 |   8 |  12 |  32 |  13 |  14 |  18 |  14 |  16 |  20 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   6:   0
  0  runs >=  16:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -3.037;    Average information/letter:   1.208
 Minimal length of displayed segments set to:  15

M_0.01= 15.38  (cv=  9.13, lambda=  0.60190, k=  0.43255, x=  6.25;
                90% confidence interval for segment length:  11 +-   5)
M_0.05= 12.68  (x=  3.54)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.881;    Average information/letter:   1.096
 Minimal length of displayed segments set to:  15

M_0.01= 35.36  (cv= 21.69, lambda=  0.25347, k=  0.32172, x= 13.67;
                90% confidence interval for segment length:  12 +-   7)
M_0.05= 28.93  (x=  7.24);     M_0.30= 21.28  (x= -0.41)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-244-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  42-  46]   DAVKA
[ 136- 140]   DAVKA


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  14  (Expected range:   4-- 28)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 5   (6-10) 2   (11-20) 4   (>=21) 4

3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  10  (Expected range:   0-- 20)
   3 +plets (f+: 13.5%), 7 -plets (f-: 17.6%)
   Total number of charge altplets: 7 (Critical number: 23)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 2   (6-10) 4   (11-20) 2   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
 194- 228        7      -00..00          5       5      /0/0/1/././1/0/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

 159- 178  (3.)     I(  19)I     8 of   8   0.0011   large minimal spacing



orfK VpV262 homologue of SIO1 gp29.1
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:07 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4816.4029.seq)

SWISS-PROT ANNOTATION:
ID   ORFK
DE   ORFK, 322 bases, C287FDE1 checksum.

number of residues:  322;   molecular weight:  35.3 kdal
 
         1  MSTGNNTSNT QALIVSEIWA DEIEDILHEK LLDVNIARVV DFPDGDKLTI PSVGTPVVRS 
        61  RPEQGDFTFD NLDTGEISII LRDEVYAGNA ISKKLRQDSR WISNVGAMLP AEQARAIMER 
       121  YQTDLLALGN AQFAGQNDPN VINGVPHRFV GTGTDQTMDV TDFSRVNYVM TQSKMPMGGM 
       181  IGIIDPSVAH HLETITNISN ISNNPRWEGI VESGIAPDMQ FVRSVYGIDL FVSNLLADAN 
       241  ETINAGGDAR STTAGKCNMF MNVSDMGLLP FVVAWKEMPT TKSFIDDYND DLNTATTARW 
       301  GNGLVRDENL VCVLANADKV TF

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 24( 7.5%); C  :  2( 0.6%); D+ : 28( 8.7%); E  : 15( 4.7%); F  : 12( 3.7%)
G  : 24( 7.5%); H  :  4( 1.2%); I  : 24( 7.5%); K  :  9( 2.8%); L  : 22( 6.8%)
M  : 13( 4.0%); N+ : 26( 8.1%); P  : 13( 4.0%); Q  : 10( 3.1%); R  : 15( 4.7%)
S  : 20( 6.2%); T  : 24( 7.5%); V  : 27( 8.4%); W  :  5( 1.6%); Y  :  5( 1.6%)

KR      :   24 (  7.5%);   ED      :   43 ( 13.4%);   AGP     :   61 ( 18.9%);
KRED    :   67 ( 20.8%);   KR-ED - :  -19 ( -5.9%);   FIKMNY  :   89 ( 27.6%);
LVIFM   :   98 ( 30.4%);   ST      :   44 ( 13.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 000000-000 --0--000-+ 00-0000+00 -00-0-+000 00000000+0 
        61  +0-00-000- 00-00-0000 0+--000000 00++0+0-0+ 0000000000 0-00+000-+ 
       121  000-000000 0000000-00 0000000+00 0000-000-0 0-00+00000 000+000000 
       181  0000-00000 00-0000000 00000+0-00 0-00000-00 00+00000-0 0000000-00 
       241  -000000-0+ 00000+0000 0000-00000 00000+-000 0+000--00- -0000000+0 
       301  00000+--00 0000000-+0 00

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  7/30 or 10/45 or 12/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 18/60):  none


Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.075  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.792  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.134  ( ED )

 Expected score/letter:  -0.910;    Average information/letter:   1.961
 Minimal length of displayed segments set to:  20

M_0.01=  8.26  (cv=  5.05, lambda=  1.14369, k=  0.39717, x=  3.21;
                90% confidence interval for segment length:   7 +-   6)
M_0.05=  6.84  (x=  1.79)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.134  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.792  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.075  ( KR )

 Expected score/letter:  -0.674;    Average information/letter:   0.912
 Minimal length of displayed segments set to:  20

M_0.01= 11.87  (cv=  7.56, lambda=  0.76336, k=  0.26890, x=  4.31;
                90% confidence interval for segment length:  14 +-  13)
M_0.05=  9.74  (x=  2.17)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.208  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.792  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.584;    Average information/letter:   1.126
 Minimal length of displayed segments set to:  20

M_0.01=  7.13  (cv=  4.32, lambda=  1.33657, k=  0.43045, x=  2.81;
                90% confidence interval for segment length:  12 +-  10)
M_0.05=  5.91  (x=  1.59)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.792  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.208  ( KEDR )

 Expected score/letter:  -0.873;    Average information/letter:   0.152
 Minimal length of displayed segments set to:  20

M_0.01= 43.53  (cv= 31.28, lambda=  0.18461, k=  0.09648, x= 12.25;
                90% confidence interval for segment length:  76 +-  57)
M_0.05= 34.70  (x=  3.42)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  38 |   8 |  10 |  12 |  10 |  12 |  14 |   6 |   7 | 
lmin1     5 |   7 |   8 |  46 |  10 |  12 |  15 |  12 |  15 |  17 |   7 |   9 | 
lmin2     6 |   8 |   9 |  51 |  11 |  14 |  16 |  14 |  16 |  20 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  25:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.969;    Average information/letter:   0.631
 Minimal length of displayed segments set to:  15

M_0.01= 24.61  (cv= 15.61, lambda=  0.36984, k=  0.27960, x=  8.99;
                90% confidence interval for segment length:  21 +-  13)
M_0.05= 20.20  (x=  4.59)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.534;    Average information/letter:   0.485
 Minimal length of displayed segments set to:  15

M_0.01= 60.14  (cv= 39.63, lambda=  0.14569, k=  0.19929, x= 20.50;
                90% confidence interval for segment length:  26 +-  19)
M_0.05= 48.95  (x=  9.32);     M_0.30= 35.64  (x= -4.00)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-256-C-54-C-10-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  20  (Expected range:   6-- 32)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 7   (6-10) 4   (11-20) 4   (>=21) 6

3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   7  (Expected range:   0-- 15)
   1 +plets (f+: 7.5%), 6 -plets (f-: 13.4%)
   Total number of charge altplets: 7 (Critical number: 16)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 2   (6-10) 1   (11-20) 2   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
  17-  36        4      -.00             5       5      /0/./0/1/
  18-  65        8      i00.*...         6       6      /0/2/2/./2/./././
 109- 156        8      i.0.0000         6       6      /0/./1/./1/1/2/2/
 183- 236        9      i...0.000        6       6      /0/./././2/./2/2/2/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  86- 121  (2.)     Y(  35)Y     6 of   6   0.0051   large minimal spacing
 288- 323  (4.)     Y(  35)Y     5 of   6   0.0051     matching minimum



orfL VpV262 weak match to SIO1 gp24
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:08 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4815.3752.seq)

SWISS-PROT ANNOTATION:
ID   ORFL
DE   ORFL, 171 bases, CC9E95E8 checksum.

number of residues:  171;   molecular weight:  17.9 kdal
 
         1  MSRESITAGS TAGGTRRAAT HYGRRVEEGA NISVYHVDGN VFRQETTFKF DQLPTATLDQ 
        61  LHQAIPAGSR VLSATLKTHV DFTATTAVSL NIGLQERDGT EVDNDGLFAA LALPSANDFD 
       121  DGAGALVGAG TGLTVDAVVV VVPNVDDLLT GEATLVVEYE KADDRQQAMN G

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 22(12.9%); C  :  0( 0.0%); D  : 15( 8.8%); E  :  9( 5.3%); F  :  6( 3.5%)
G  : 17( 9.9%); H  :  4( 2.3%); I  :  4( 2.3%); K  :  3( 1.8%); L  : 15( 8.8%)
M  :  2( 1.2%); N  :  7( 4.1%); P  :  4( 2.3%); Q  :  7( 4.1%); R  :  9( 5.3%)
S  :  8( 4.7%); T  : 18(10.5%); V  : 18(10.5%); W  :  0( 0.0%); Y  :  3( 1.8%)

KR      :   12 (  7.0%);   ED      :   24 ( 14.0%);   AGP     :   43 ( 25.1%);
KRED    :   36 ( 21.1%);   KR-ED   :  -12 ( -7.0%);   FIKMNY  :   25 ( 14.6%);
LVIFM   :   45 ( 26.3%);   ST      :   26 ( 15.2%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00+-000000 00000++000 000++0--00 0000000-00 00+0-000+0 -0000000-0 
        61  000000000+ 000000+000 -000000000 00000-+-00 -0-0-00000 0000000-0- 
       121  -000000000 00000-0000 00000--000 0-00000-0- +0--+00000 0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  7/30 or 10/45 or 12/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 19/60):  none


Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.070  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.789  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.140  ( ED )

 Expected score/letter:  -0.930;    Average information/letter:   2.078
 Minimal length of displayed segments set to:  20

M_0.01=  7.49  (cv=  4.35, lambda=  1.18077, k=  0.40659, x=  3.13;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  6.11  (x=  1.75)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.140  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.789  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.070  ( KR )

 Expected score/letter:  -0.649;    Average information/letter:   0.832
 Minimal length of displayed segments set to:  20

M_0.01= 11.49  (cv=  7.06, lambda=  0.72862, k=  0.25453, x=  4.44;
                90% confidence interval for segment length:  15 +-  14)
M_0.05=  9.26  (x=  2.20)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.211  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.789  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.579;    Average information/letter:   1.104
 Minimal length of displayed segments set to:  20

M_0.01=  6.72  (cv=  3.89, lambda=  1.32176, k=  0.42456, x=  2.83;
                90% confidence interval for segment length:  12 +-   9)
M_0.05=  5.49  (x=  1.60)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.789  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.211  ( KEDR )

 Expected score/letter:  -0.895;    Average information/letter:   0.158
 Minimal length of displayed segments set to:  20

M_0.01= 39.41  (cv= 27.24, lambda=  0.18877, k=  0.10004, x= 12.17;
                90% confidence interval for segment length:  68 +-  53)
M_0.05= 30.78  (x=  3.54)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  35 |   8 |  10 |  11 |   9 |  11 |  13 |   6 |   8 | 
lmin1     5 |   6 |   8 |  42 |   9 |  12 |  14 |  11 |  14 |  16 |   8 |   9 | 
lmin2     6 |   7 |   9 |  47 |  10 |  13 |  15 |  12 |  16 |  18 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  23:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.942;    Average information/letter:   0.617
 Minimal length of displayed segments set to:  15

M_0.01= 22.53  (cv= 13.69, lambda=  0.37566, k=  0.27810, x=  8.84;
                90% confidence interval for segment length:  20 +-  13)
M_0.05= 18.19  (x=  4.50)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.240;    Average information/letter:   0.413
 Minimal length of displayed segments set to:  15

M_0.01= 59.67  (cv= 38.24, lambda=  0.13445, k=  0.17917, x= 21.43;
                90% confidence interval for segment length:  28 +-  21)
M_0.05= 47.54  (x=  9.30);     M_0.30= 33.12  (x= -5.12)

 1) From  122 to  142 (type 1):  length= 21, score=42.00
    (pocket at  136 to  136:   length=  1, score=-17.00) 
     122  GAGALVGAGT GLTV |D| AVVVV V
    A:  4(19.0%);  G:  5(23.8%);  V:  7(33.3%);

# of segments (>=15 residues) exceeding M_0.30:  1


2. SPACINGS OF C.


H2N-171-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  15  (Expected range:   2-- 23)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 8   (6-10) 3   (11-20) 3   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 14/30 or 18/45 or 22/60):  none

4. Long amino acid multiplets (>= 5; Letter/Length/Position):
    V/5/138


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   6  (Expected range:   0-- 10)
   2 +plets (f+: 7.0%), 4 -plets (f-: 14.0%)
   Total number of charge altplets: 4 (Critical number: 10)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 2   (11-20) 2   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 138- 142        1      V                5       5 !     0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfM VpV262 limited match to N ter of SIO1 gp29
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:10 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4815.4454.seq)

SWISS-PROT ANNOTATION:
ID   ORFM
DE   ORFM, 327 bases, 5948A057 checksum.

number of residues:  327;   molecular weight:  35.1 kdal
 
         1  MTDHVNLLDS ELHEPKGMQV LTGGASDVGK VVVSKGDGTT ETRKLDITEI EGAESILLSG 
        61  AYFDDNMATA TVDVATELKE SYLIGAAPTQ VIQLQLPDPA VYAGAPITLK RLDANHGDGS 
       121  EVKFIPNASE TIEGASELAI TLQNTSIVVV SDGTDWHIRA DIYPVKFKHG WFNYNDLATQ 
       181  STPINHTGGV DTKLTNDALG PATLVPYPPE GVTKVWDATT NQFDFTELVN GDTVDIRLDL 
       241  EITTTSANQE ITVYILLGVG GTTINVPIIG RIVKTAGANR IVQFSSVFMG GDNTRLNPAE 
       301  LYISSPSNAT VKVNGWYNRI IGVNPTY

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 25( 7.6%); C  :  0( 0.0%); D  : 23( 7.0%); E  : 17( 5.2%); F  :  8( 2.4%)
G  : 29( 8.9%); H  :  6( 1.8%); I  : 25( 7.6%); K  : 13( 4.0%); L  : 26( 8.0%)
M  :  4( 1.2%); N  : 20( 6.1%); P  : 16( 4.9%); Q  :  9( 2.8%); R- :  8( 2.4%)
S  : 18( 5.5%); T++: 36(11.0%); V  : 30( 9.2%); W  :  4( 1.2%); Y  : 10( 3.1%)

KR      :   21 (  6.4%);   ED      :   40 ( 12.2%);   AGP     :   70 ( 21.4%);
KRED    :   61 ( 18.7%);   KR-ED - :  -19 ( -5.8%);   FIKMNY  :   80 ( 24.5%);
LVIFM   :   93 ( 28.4%);   ST    + :   54 ( 16.5%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00-00000-0 -00-0+0000 000000-00+ 0000+0-000 -0++0-00-0 -00-000000 
        61  000--00000 00-000-0+- 0000000000 0000000-00 000000000+ +0-0000-00 
       121  -0+000000- 00-000-000 0000000000 0-00-000+0 -0000+0+00 00000-0000 
       181  0000000000 -0+000-000 000000000- 000+00-000 000-00-000 0-00-0+0-0 
       241  -00000000- 0000000000 0000000000 +00+00000+ 0000000000 0-00+0000- 
       301  0000000000 0+000000+0 0000000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  7/30 or  9/45 or 11/60):  none


Negative charge clusters (cmin = 10/30 or 14/45 or 17/60):  none


Mixed charge clusters (cmin = 14/30 or 18/45 or 23/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.064  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.122  ( ED )

 Expected score/letter:  -0.930;    Average information/letter:   2.209
 Minimal length of displayed segments set to:  20

M_0.01=  7.73  (cv=  4.71, lambda=  1.23008, k=  0.41471, x=  3.02;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  6.41  (x=  1.70)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.122  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.064  ( KR )

 Expected score/letter:  -0.697;    Average information/letter:   1.032
 Minimal length of displayed segments set to:  20

M_0.01= 11.17  (cv=  7.07, lambda=  0.81848, k=  0.28798, x=  4.10;
                90% confidence interval for segment length:  13 +-  12)
M_0.05=  9.18  (x=  2.11)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.187  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.627;    Average information/letter:   1.332
 Minimal length of displayed segments set to:  20

M_0.01=  6.56  (cv=  3.93, lambda=  1.47262, k=  0.48315, x=  2.63;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  5.45  (x=  1.52)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.813  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.187  ( KEDR )

 Expected score/letter:  -0.679;    Average information/letter:   0.103
 Minimal length of displayed segments set to:  20

M_0.01= 52.09  (cv= 39.29, lambda=  0.14737, k=  0.06626, x= 12.80;
                90% confidence interval for segment length: 108 +-  88)
M_0.05= 41.03  (x=  1.74)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  42 |   8 |  10 |  12 |   9 |  12 |  14 |   6 |   8 | 
lmin1     5 |   6 |   8 |  51 |  10 |  12 |  14 |  12 |  14 |  17 |   8 |  10 | 
lmin2     6 |   7 |   9 |  57 |  11 |  13 |  16 |  13 |  16 |  19 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  28:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.865;    Average information/letter:   0.610
 Minimal length of displayed segments set to:  15

M_0.01= 24.37  (cv= 15.53, lambda=  0.37293, k=  0.27205, x=  8.84;
                90% confidence interval for segment length:  22 +-  14)
M_0.05= 20.00  (x=  4.47)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.092;    Average information/letter:   0.407
 Minimal length of displayed segments set to:  15

M_0.01= 64.25  (cv= 43.05, lambda=  0.13448, k=  0.17394, x= 21.20;
                90% confidence interval for segment length:  31 +-  23)
M_0.05= 52.13  (x=  9.08);     M_0.30= 37.71  (x= -5.34)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-327-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  31-  39]   VVVSKGDGT
[ 148- 154]   VVVS__DGT

______________________________

[  50-  53]   IEGA
[ 132- 135]   IEGA


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  20  (Expected range:   7-- 34)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 4   (6-10) 6   (11-20) 7   (>=21) 4

3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   3  (Expected range:   0-- 13)
   2 +plets (f+: 6.4%), 1 -plets (f-: 12.2%)
   Total number of charge altplets: 1 (Critical number: 14)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 1   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 179- 210        8      T.......         4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfN VpV262 mosaic homologue of SIO1 gp27; suspected tail fiber
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:11 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.6375.9592.seq)

SWISS-PROT ANNOTATION:
ID   ORFN
DE   ORFN, 911 bases, 9272424C checksum.

number of residues:  911;   molecular weight: 100.2 kdal
 
         1  MAARKGAVNR FTPVRGWVTE GNLANYGQDV ALDVENMDIE KTGLTQRRFG LFAETSSEQF 
        61  LSTFTATARA RGLLAVKEWR EAWGDKDVNM LIFHAGYKVH VVQDTAPLRD ANILLTIDLL 
       121  EAGIKLDGVI DSPVHISVGV GFAIITNPRI EPVLIKLDDV DDEGVPTLSY EPLTLLIRTR 
       181  ELLTPYTTGT NYGDTLTPEE EWNLYNSGWA TITRATKDKS GSGTVYVNPV QYYFDKRGVY 
       241  PSHSVLYNSM KQESAKEIVA LNVFSPWADE KINFGTTTPP LGRYIHSAYY FDSAAILSLG 
       301  IGNLTPPTSD GTTEGSGPAE EEISNPIGLD NIGTVNNLKL IAEGTVRWTV KDRPRCSGYH 
       361  NGHVYFGDRD KNGKTRILVS QLVNSLDNIP KCFQDADPTA EEINDLIATD GFTMYPVGMG 
       421  APITMVEFNK RLLLLCTNGV WAIRGTSGGG ATATDFTLDK VASVEFNSPQ SVVDIGTAIV 
       481  FWSERGIIAI GVNDFGDLTS NNLTENTIDE YYDSLDRDII KNVKGTFIND ENRVYWVVPN 
       541  KQDSNGEYKT DGELVLVLNL DTGGFYKHTV SGGPLLHAPF RRLVNTRAEV SIPITETDGT 
       601  VITDTLGDPV TVTRTVTTTG VDGLAYFASF DDGVNGQFNF IAEHQPWGFA DWANVPNMTR 
       661  VNYSSYVDFA YEYPEVMIGN ISLPYIHSYY LTGIRVQTEQ YTTETAHLSF HRVQAHQTTA 
       721  LGTVTFHKVD MMVSTGMQVI SFHKDDLLRT EAVTLVNPDA ETGDATGWTV TAGTLDVRTA 
       781  APLYQGSYYF WSDSNANFAA YQDIDPVGGG YITAGELANN VIEAKLSWAA RGNTDLGTVY 
       841  IECLDAIGTV LASTDTTRFS GHDTWTRYGD AVVLPTDTDT IRVWIVGTLV ATNDVNYYVD 
       901  DIQLNLEVHN V

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 65( 7.1%); C  :  4( 0.4%); D  : 69( 7.6%); E  : 46( 5.0%); F  : 34( 3.7%)
G  : 78( 8.6%); H  : 18( 2.0%); I  : 54( 5.9%); K  : 28( 3.1%); L  : 72( 7.9%)
M  : 12( 1.3%); N  : 54( 5.9%); P  : 37( 4.1%); Q  : 20( 2.2%); R  : 38( 4.2%)
S  : 47( 5.2%); T++: 97(10.6%); V  : 81( 8.9%); W  : 17( 1.9%); Y  : 40( 4.4%)

KR      :   66 (  7.2%);   ED      :  115 ( 12.6%);   AGP     :  180 ( 19.8%);
KRED    :  181 ( 19.9%);   KR-ED - :  -49 ( -5.4%);   FIKMNY  :  222 ( 24.4%);
LVIFM   :  253 ( 27.8%);   ST    + :  144 ( 15.8%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000++0000+ 0000+0000- 00000000-0 00-0-00-0- +00000++00 000-000-00 
        61  00000000+0 +00000+-0+ -000-+-000 0000000+00 000-0000+- 0000000-00 
       121  -000+0-000 -000000000 00000000+0 -0000+0--0 ---0000000 -000000+0+ 
       181  -000000000 000-0000-- -000000000 000+00+-+0 0000000000 0000-++000 
       241  0000000000 +0-00+-000 00000000-- +000000000 00+0000000 0-00000000 
       301  000000000- 000-00000- --0000000- 00000000+0 00-000+000 +-+0+00000 
       361  0000000-+- +00+0+0000 000000-000 +000-0-000 --00-0000- 0000000000 
       421  000000-00+ +000000000 000+000000 0000-000-+ 0000-00000 000-000000 
       481  000-+00000 000-00-000 0000-000-- 00-00-+-00 +00+00000- -0+0000000 
       541  +0-000-0+0 -0-0000000 -00000+000 0000000000 ++0000+0-0 00000-0-00 
       601  000-000-00 000+000000 0-00000000 --00000000 00-0000000 -00000000+ 
       661  0000000-00 0-00-00000 0000000000 0000+000-0 000-000000 0+00000000 
       721  0000000+0- 0000000000 000+--00+0 -0000000-0 -00-000000 00000-0+00 
       781  0000000000 00-0000000 00-0-00000 00000-0000 00-0+00000 +000-00000 
       841  0-00-00000 0000-00+00 00-000+00- 000000-0-0 0+00000000 000-00000- 
       901  -00000-000 0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  8/30 or 11/45 or 13/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 19/60):  none


Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.072  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.801  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.126  ( ED )

 Expected score/letter:  -0.909;    Average information/letter:   2.001
 Minimal length of displayed segments set to:  20

M_0.01=  9.06  (cv=  5.88, lambda=  1.15932, k=  0.39980, x=  3.18;
                90% confidence interval for segment length:   8 +-   6)
M_0.05=  7.65  (x=  1.77)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.126  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.801  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.072  ( KR )

 Expected score/letter:  -0.694;    Average information/letter:   0.994
 Minimal length of displayed segments set to:  20

M_0.01= 12.69  (cv=  8.52, lambda=  0.80008, k=  0.28253, x=  4.17;
                90% confidence interval for segment length:  15 +-  13)
M_0.05= 10.65  (x=  2.13)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.199  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.801  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.603;    Average information/letter:   1.212
 Minimal length of displayed segments set to:  20

M_0.01=  7.62  (cv=  4.89, lambda=  1.39455, k=  0.45321, x=  2.73;
                90% confidence interval for segment length:  13 +-   9)
M_0.05=  6.45  (x=  1.56)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.801  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.199  ( KEDR )

 Expected score/letter:  -0.788;    Average information/letter:   0.130
 Minimal length of displayed segments set to:  20

M_0.01= 52.97  (cv= 40.44, lambda=  0.16852, k=  0.08306, x= 12.53;
                90% confidence interval for segment length:  99 +-  72)
M_0.05= 43.30  (x=  2.86)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   7 |  44 |   9 |  11 |  13 |  11 |  13 |  16 |   7 |   9 | 
lmin1     6 |   7 |   9 |  53 |  11 |  13 |  16 |  13 |  16 |  19 |   8 |  11 | 
lmin2     7 |   8 |  10 |  59 |  12 |  15 |  18 |  15 |  18 |  21 |  10 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   5:   0
  0  runs >=  29:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.888;    Average information/letter:   0.604
 Minimal length of displayed segments set to:  15

M_0.01= 27.32  (cv= 18.41, lambda=  0.37014, k=  0.27181, x=  8.91;
                90% confidence interval for segment length:  24 +-  15)
M_0.05= 22.92  (x=  4.51)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.239;    Average information/letter:   0.436
 Minimal length of displayed segments set to:  15

M_0.01= 69.13  (cv= 48.50, lambda=  0.14050, k=  0.18236, x= 20.63;
                90% confidence interval for segment length:  32 +-  22)
M_0.05= 57.53  (x=  9.03);     M_0.30= 43.73  (x= -4.77)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-355-C-35-C-43-C-406-C-68-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  5

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  9

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  55  (Expected range:  33-- 77)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 11   (6-10) 12   (11-20) 18   (>=21) 15

3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  16  (Expected range:   4-- 30)
   5 +plets (f+: 7.2%), 11 -plets (f-: 12.6%)
   Total number of charge altplets: 16 (Critical number: 31)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 2   (6-10) 0   (11-20) 2   (>=21) 13

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 108- 131        6      L.....           4       4       0
 226- 253        7      Y......          4       4       0
 432- 435        1      L                4       4       0
 554- 561        2      L.               4       4       0
 611- 620        2      T.               5       5 !     0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
 134- 145        2      i0               6       6      /0/1/
 140- 169        5      i....            6       6       0
 555- 584        5      i..00            6       6      /0/././2/1/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfO VpV262 weak indications may correspond to parts of SIO1 gp26 or gp24
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:13 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4819.2365.seq)

SWISS-PROT ANNOTATION:
ID   ORFO
DE   ORFO, 359 bases, EC7412AA checksum.

number of residues:  359;   molecular weight:  38.7 kdal
 
         1  MAILDFNTGI QMGLVGDTAA VDGKGTTDAS KIGHRVLEGF LIDRVNHRDE VAHLTWTSRN 
        61  EVWVSWYMWQ EEYDLNAGRD GRAVIAFRGI DASGSEVYWG AIDGQAGASD RTASFYLYGA 
       121  GESPSDTSSG TSIIQNIDLM PYDGVRSRVD VHVKLDNVNG EVDVYINQNL VGSFVGDTIL 
       181  NPDLTTVSNC HFGMLGEYIS GLVSKNNDCM FSAAFAADES TVPITMVQSE VNANGTLQQM 
       241  TGDYTDVSLL GDWDDATKVT ADAAGQTSTF GKSALPAQYQ TGYELIAVGV NSRTAVALDY 
       301  TIASLAHVLD NGTTVKEGTE IPLDDLLQYR KTIFHTDADD NPWTATAFDA TQIGLKAKL

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 35( 9.7%); C  :  2( 0.6%); D++: 35( 9.7%); E  : 14( 3.9%); F  : 11( 3.1%)
G  : 34( 9.5%); H  :  7( 1.9%); I  : 19( 5.3%); K  : 10( 2.8%); L  : 27( 7.5%)
M  :  8( 2.2%); N  : 18( 5.0%); P- :  7( 1.9%); Q  : 13( 3.6%); R  : 12( 3.3%)
S  : 25( 7.0%); T+ : 32( 8.9%); V  : 30( 8.4%); W  :  7( 1.9%); Y  : 13( 3.6%)

KR      :   22 (  6.1%);   ED      :   49 ( 13.6%);   AGP     :   76 ( 21.2%);
KRED    :   71 ( 19.8%);   KR-ED --:  -27 ( -7.5%);   FIKMNY  :   79 ( 22.0%);
LVIFM   :   95 ( 26.5%);   ST    + :   57 ( 15.9%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000-00000 000000-000 0-0+000-00 +000+00-00 00-+000+-- 00000000+0 
        61  -000000000 --0-0000+- 0+00000+00 -0000-0000 00-000000- +000000000 
       121  0-000-0000 0000000-00 00-00+0+0- 000+0-0000 -0-0000000 000000-000 
       181  00-0000000 000000-000 0000+00-00 0000000--0 000000000- 0000000000 
       241  00-00-0000 0-0--00+00 0-00000000 0+00000000 000-000000 00+00000-0 
       301  000000000- 00000+-00- 000--0000+ +00000-0-- 00000000-0 00000+0+0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  7/30 or  9/45 or 11/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 18/60):  none


Mixed charge clusters (cmin = 14/30 or 19/45 or 24/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.061  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.136  ( ED )

 Expected score/letter:  -0.953;    Average information/letter:   2.314
 Minimal length of displayed segments set to:  20

M_0.01=  7.64  (cv=  4.67, lambda=  1.25986, k=  0.42251, x=  2.97;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  6.34  (x=  1.67)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.136  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.061  ( KR )

 Expected score/letter:  -0.652;    Average information/letter:   0.861
 Minimal length of displayed segments set to:  20

M_0.01= 12.27  (cv=  7.90, lambda=  0.74465, k=  0.25940, x=  4.37;
                90% confidence interval for segment length:  15 +-  14)
M_0.05= 10.08  (x=  2.18)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.198  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.604;    Average information/letter:   1.221
 Minimal length of displayed segments set to:  20

M_0.01=  6.93  (cv=  4.20, lambda=  1.40028, k=  0.45544, x=  2.72;
                90% confidence interval for segment length:  11 +-   9)
M_0.05=  5.76  (x=  1.56)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.802  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.198  ( KEDR )

 Expected score/letter:  -0.780;    Average information/letter:   0.128
 Minimal length of displayed segments set to:  20

M_0.01= 47.80  (cv= 35.24, lambda=  0.16694, k=  0.08177, x= 12.56;
                90% confidence interval for segment length:  90 +-  70)
M_0.05= 38.04  (x=  2.79)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  40 |   8 |  10 |  12 |   9 |  12 |  14 |   6 |   8 | 
lmin1     5 |   7 |   8 |  49 |  10 |  12 |  14 |  12 |  15 |  17 |   8 |  10 | 
lmin2     6 |   8 |   9 |  54 |  11 |  14 |  16 |  13 |  17 |  20 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  27:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.858;    Average information/letter:   0.584
 Minimal length of displayed segments set to:  15

M_0.01= 25.15  (cv= 16.14, lambda=  0.36442, k=  0.26713, x=  9.00;
                90% confidence interval for segment length:  23 +-  15)
M_0.05= 20.67  (x=  4.53)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.175;    Average information/letter:   0.419
 Minimal length of displayed segments set to:  15

M_0.01= 62.64  (cv= 42.07, lambda=  0.13985, k=  0.17844, x= 20.57;
                90% confidence interval for segment length:  30 +-  22)
M_0.05= 50.98  (x=  8.91);     M_0.30= 37.12  (x= -4.95)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-189-C-18-C-150-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  15-  18]   VGDT
[ 175- 178]   VGDT


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  17  (Expected range:   8-- 36)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 4   (6-10) 5   (11-20) 4   (>=21) 5

3. Clusters of amino acid multiplets (cmin =  9/30 or 12/45 or 14/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   7  (Expected range:   0-- 15)
   1 +plets (f+: 6.1%), 6 -plets (f-: 13.6%)
   Total number of charge altplets: 5 (Critical number: 15)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 1   (11-20) 1   (>=21) 5

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
 171- 218        8      i000..0.         6       6      /0/1/2/1/././1/./


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  23-  25  (1.)     G(   2)G    35 of  35   0.0006   large minimal spacing
  62-  64  (1.)     V(   2)V    31 of  31   0.0034   large minimal spacing
 119- 121  (2.)     G(   2)G    34 of  35   0.0006     matching minimum
 120- 213  (2.)     A(  93)A     1 of  36   0.0005   large maximal spacing
 149- 151  (2.)     V(   2)V    27 of  31   0.0034     matching minimum
 151- 153  (2.)     V(   2)V    28 of  31   0.0034     matching minimum
 162- 164  (2.)     V(   2)V    29 of  31   0.0034     matching minimum
 288- 290  (4.)     V(   2)V    30 of  31   0.0034     matching minimum



orfP VpV262 corresponds to C ter of SIO1 gp27
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:14 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4814.3475.seq)

SWISS-PROT ANNOTATION:
ID   ORFP
DE   ORFP, 98 bases, 5019B0EA checksum.

number of residues:   98;   molecular weight:  11.0 kdal
 
         1  MIPSDNHSSL MTASFDWSID SRTRKVTTQQ EAYKYDSRRL AQGGAESAYP YEVVDTKLRI 
        61  RGQGRAVRIR FQSNGGKDLR LLGYSILGVD FGDSEGNS

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  6( 6.1%); C  :  0( 0.0%); D  :  8( 8.2%); E  :  4( 4.1%); F  :  3( 3.1%)
G  : 10(10.2%); H  :  1( 1.0%); I  :  5( 5.1%); K  :  4( 4.1%); L  :  7( 7.1%)
M  :  2( 2.0%); N  :  3( 3.1%); P  :  2( 2.0%); Q  :  5( 5.1%); R  : 10(10.2%)
S  : 12(12.2%); T  :  5( 5.1%); V  :  5( 5.1%); W  :  1( 1.0%); Y  :  5( 5.1%)

KR      :   14 ( 14.3%);   ED      :   12 ( 12.2%);   AGP     :   18 ( 18.4%);
KRED    :   26 ( 26.5%);   KR-ED   :    2 (  2.0%);   FIKMNY  :   22 ( 22.4%);
LVIFM   :   22 ( 22.4%);   ST      :   17 ( 17.3%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000-00000 00000-000- 0+0++00000 -00+0-0++0 00000-0000 0-00-0+0+0 
        61  +000+00+0+ 000000+-0+ 000000000- 00-0-000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 15/45 or 19/60):  none


Negative charge clusters (cmin = 10/30 or 14/45 or 17/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.143  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.122  ( ED )

 Expected score/letter:  -0.694;    Average information/letter:   0.875
 Minimal length of displayed segments set to:  20

M_0.01= 10.72  (cv=  6.26, lambda=  0.73288, k=  0.26481, x=  4.46;
                90% confidence interval for segment length:  13 +-  13)
M_0.05=  8.50  (x=  2.24)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.122  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.143  ( KR )

 Expected score/letter:  -0.776;    Average information/letter:   1.139
 Minimal length of displayed segments set to:  20

M_0.01=  9.54  (cv=  5.46, lambda=  0.83914, k=  0.30840, x=  4.08;
                90% confidence interval for segment length:  10 +-  10)
M_0.05=  7.60  (x=  2.14)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.265  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.469;    Average information/letter:   0.690
 Minimal length of displayed segments set to:  20

M_0.01=  7.84  (cv=  4.50, lambda=  1.01857, k=  0.29989, x=  3.33;
                90% confidence interval for segment length:  17 +-  15)
M_0.05=  6.24  (x=  1.73)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.265  ( KEDR )

 Expected score/letter:  -1.388;    Average information/letter:   0.300
 Minimal length of displayed segments set to:  20

M_0.01= 26.76  (cv= 16.40, lambda=  0.27954, k=  0.18165, x= 10.35;
                90% confidence interval for segment length:  36 +-  24)
M_0.05= 20.93  (x=  4.52)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   4 |   7 |  25 |   9 |   8 |  12 |  10 |   9 |  13 |   5 |   6 | 
lmin1     6 |   6 |   8 |  31 |  11 |  10 |  14 |  12 |  12 |  16 |   6 |   8 | 
lmin2     7 |   7 |  10 |  34 |  12 |  11 |  16 |  14 |  13 |  18 |   7 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  17:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.531;    Average information/letter:   0.899
 Minimal length of displayed segments set to:  15

M_0.01= 17.14  (cv=  9.61, lambda=  0.47701, k=  0.36371, x=  7.52;
                90% confidence interval for segment length:  13 +-   8)
M_0.05= 13.72  (x=  4.11)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.408;    Average information/letter:   0.696
 Minimal length of displayed segments set to:  15

M_0.01= 40.87  (cv= 24.04, lambda=  0.19072, k=  0.24920, x= 16.83;
                90% confidence interval for segment length:  16 +-  11)
M_0.05= 32.33  (x=  8.29);     M_0.30= 22.16  (x= -1.88)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-98-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   8  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 3   (6-10) 3   (11-20) 2   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 13/30 or 17/45 or 20/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   2  (Expected range:   0--  9)
   2 +plets (f+: 14.3%), 0 -plets (f-: 12.2%)
   Total number of charge altplets: 1 (Critical number: 10)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 1   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


orfQ VpV262 homologue of SIO1 gp28
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:16 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.2920.seq)

SWISS-PROT ANNOTATION:
ID   ORFQ
DE   ORFQ, 250 bases, A5C6E0EF checksum.

number of residues:  250;   molecular weight:  28.9 kdal
 
         1  MPKRTLLQIV KKMAQKTGSD EVTSLSEDSI EIQDMVDCAL EVLEDIIYRN DWEFLKDRPA 
        61  QLEAGTNAIE LSIPDNVRKI QTLRYRYEDA GVQNCFRTLR YMYPHEFMER LQNNKPTDPD 
       121  TTTVTINGVE LYPKTNRHPR YWTSFDEQNV VLDSYDATQN PTGVDATDSA IIATLYLDFT 
       181  GSDADSWVAP IPESLFTLWE QEAVAEAFVQ FRQTENPRAE RRSRRTYVQQ IKKEPVTHKD 
       241  EGSDEVNYGR

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 16( 6.4%); C  :  2( 0.8%); D+ : 21( 8.4%); E  : 22( 8.8%); F  :  8( 3.2%)
G- :  8( 3.2%); H  :  3( 1.2%); I  : 13( 5.2%); K  : 11( 4.4%); L  : 17( 6.8%)
M  :  5( 2.0%); N  : 12( 4.8%); P  : 13( 5.2%); Q  : 14( 5.6%); R  : 18( 7.2%)
S  : 13( 5.2%); T+ : 22( 8.8%); V  : 17( 6.8%); W  :  4( 1.6%); Y  : 11( 4.4%)

KR      :   29 ( 11.6%);   ED    + :   43 ( 17.2%);   AGP     :   37 ( 14.8%);
KRED  + :   72 ( 28.8%);   KR-ED - :  -14 ( -5.6%);   FIKMNY  :   60 ( 24.0%);
LVIFM   :   60 ( 24.0%);   ST      :   35 ( 14.0%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00++000000 ++000+000- -00000--00 -00-00-000 -00--000+0 -0-00+-+00 
        61  00-000000- 0000-00++0 000+0+0--0 000000+00+ 00000-00-+ 0000+00-0- 
       121  000000000- 000+00+00+ 00000--000 00-00-0000 0000-00-00 0000000-00 
       181  00-0-00000 00-000000- 0-000-0000 0+00-00+0- ++0++00000 0++-0000+- 
       241  -00--0000+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


Negative charge clusters (cmin = 13/30 or 17/45 or 22/60):  none


Mixed charge clusters (cmin = 18/30 or 25/45 or 31/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.116  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.712  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.172  ( ED )

 Expected score/letter:  -0.824;    Average information/letter:   1.266
 Minimal length of displayed segments set to:  20

M_0.01= 10.22  (cv=  6.27, lambda=  0.88084, k=  0.32762, x=  3.96;
                90% confidence interval for segment length:  10 +-   9)
M_0.05=  8.37  (x=  2.11)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.172  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.712  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.116  ( KR )

 Expected score/letter:  -0.600;    Average information/letter:   0.606
 Minimal length of displayed segments set to:  20

M_0.01= 14.16  (cv=  9.13, lambda=  0.60472, k=  0.21014, x=  5.03;
                90% confidence interval for segment length:  20 +-  20)
M_0.05= 11.46  (x=  2.33)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.288  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.712  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.424;    Average information/letter:   0.554
 Minimal length of displayed segments set to:  20

M_0.01=  9.66  (cv=  6.10, lambda=  0.90512, k=  0.25249, x=  3.56;
                90% confidence interval for segment length:  23 +-  20)
M_0.05=  7.86  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.712  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.288  ( KEDR )

 Expected score/letter:  -1.592;    Average information/letter:   0.363
 Minimal length of displayed segments set to:  20

M_0.01= 27.13  (cv= 17.44, lambda=  0.31652, k=  0.21542, x=  9.68;
                90% confidence interval for segment length:  34 +-  19)
M_0.05= 21.98  (x=  4.53)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   6 |   8 |  26 |   9 |  10 |  13 |  10 |  12 |  15 |   6 |   7 | 
lmin1     6 |   7 |  10 |  32 |  11 |  12 |  16 |  12 |  14 |  18 |   7 |   9 | 
lmin2     7 |   8 |  11 |  35 |  12 |  14 |  18 |  14 |  16 |  20 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)

(-00)  18(0,2,0); at   28-  47:   DSIEIQDMVDCALEVLEDII
      (1. quartile)               -00-00-00-000-00--00


Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   5:   0
  0  runs >=  17:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.764;    Average information/letter:   1.030
 Minimal length of displayed segments set to:  15

M_0.01= 17.96  (cv= 10.80, lambda=  0.51118, k=  0.39034, x=  7.16;
                90% confidence interval for segment length:  13 +-   7)
M_0.05= 14.77  (x=  3.97)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.244;    Average information/letter:   0.905
 Minimal length of displayed segments set to:  15

M_0.01= 41.68  (cv= 25.91, lambda=  0.21312, k=  0.28987, x= 15.77;
                90% confidence interval for segment length:  14 +-   9)
M_0.05= 34.03  (x=  8.13);     M_0.30= 24.93  (x= -0.97)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-37-C-56-C-155-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  17-  20]   TGSD
[ 180- 183]   TGSD

______________________________

[  18-  22]   GSDEV
[ 242- 246]   GSDEV

______________________________

[  82-  85]   TLRY
[  98- 101]   TLRY


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  11  (Expected range:   3-- 26)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 4   (6-10) 2   (11-20) 2   (>=21) 4

3. Clusters of amino acid multiplets (cmin =  9/30 or 11/45 or 14/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  13  (Expected range:   0-- 18)
   6 +plets (f+: 11.6%), 7 -plets (f-: 17.2%)
   Total number of charge altplets: 5 (Critical number: 21)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 3   (6-10) 7   (11-20) 1   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
  37-  60        4      *...             6       6       0
  70- 123        9      *.0...000        6       6      /0/./2/./././0/2/2/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  16-  49  (1.)     +(  33)+     2 of  30   0.0305   large  2. maximal spacing
 140- 212  (3.)     +(  72)+     1 of  30   0.0008   large  1. maximal spacing



orfR VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:17 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.2250.424.seq)

SWISS-PROT ANNOTATION:
ID   ORFR
DE   ORFR, 90 bases, 60E5FB1F checksum.

number of residues:   90;   molecular weight:  10.3 kdal
 
         1  MDDRRFNRQK KARETKIGTD STGAEYWLVP FSEAKVGHRK FVIRNSKKQV PVQLQGMFTD 
        61  AATAERVYKA YMDQSKAKKK APAKAEADAE

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 13(14.4%); C  :  0( 0.0%); D  :  6( 6.7%); E  :  6( 6.7%); F  :  4( 4.4%)
G  :  4( 4.4%); H  :  1( 1.1%); I  :  2( 2.2%); K  : 13(14.4%); L  :  2( 2.2%)
M  :  3( 3.3%); N  :  2( 2.2%); P  :  3( 3.3%); Q  :  5( 5.6%); R  :  7( 7.8%)
S  :  4( 4.4%); T  :  5( 5.6%); V  :  6( 6.7%); W  :  1( 1.1%); Y  :  3( 3.3%)

KR      :   20 ( 22.2%);   ED      :   12 ( 13.3%);   AGP     :   20 ( 22.2%);
KRED    :   32 ( 35.6%);   KR-ED   :    8 (  8.9%);   FIKMNY  :   27 ( 30.0%);
LVIFM   :   17 ( 18.9%);   ST      :    9 ( 10.0%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0--++00+0+ +0+-0+000- 0000-00000 00-0+000++ 000+00++00 000000000- 
        61  0000-+00+0 00-00+0+++ 000+0-0-0-

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 15/30 or 21/45 or 26/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 18/60):  none


Mixed charge clusters (cmin = 21/30 or 28/45 or 36/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.222  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.644  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.133  ( ED )

 Expected score/letter:  -0.467;    Average information/letter:   0.318
 Minimal length of displayed segments set to:  20

M_0.01= 16.54  (cv= 10.50, lambda=  0.42848, k=  0.13343, x=  6.04;
                90% confidence interval for segment length:  32 +-  34)
M_0.05= 12.73  (x=  2.23)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.133  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.644  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.222  ( KR )

 Expected score/letter:  -0.822;    Average information/letter:   1.114
 Minimal length of displayed segments set to:  20

M_0.01=  9.84  (cv=  5.59, lambda=  0.80543, k=  0.30877, x=  4.25;
                90% confidence interval for segment length:  10 +-  10)
M_0.05=  7.82  (x=  2.23)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.356  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.644  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.289;    Average information/letter:   0.248
 Minimal length of displayed segments set to:  20

M_0.01= 11.86  (cv=  7.57, lambda=  0.59471, k=  0.12950, x=  4.30;
                90% confidence interval for segment length:  41 +-  42)
M_0.05=  9.12  (x=  1.56)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.644  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.356  ( KEDR )

 Expected score/letter:  -2.200;    Average information/letter:   0.553
 Minimal length of displayed segments set to:  20

M_0.01= 18.55  (cv= 10.52, lambda=  0.42768, k=  0.31161, x=  8.03;
                90% confidence interval for segment length:  21 +-  10)
M_0.05= 14.74  (x=  4.22)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     6 |   4 |   8 |  18 |  10 |   8 |  13 |  10 |   9 |  13 |   5 |   6 | 
lmin1     7 |   6 |  10 |  22 |  12 |  10 |  15 |  13 |  11 |  16 |   6 |   8 | 
lmin2     9 |   7 |  12 |  25 |  13 |  11 |  17 |  14 |  12 |  18 |   7 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   4:   0
  -  runs >=   3:   0
  *  runs >=   6:   0
  0  runs >=  12:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -3.033;    Average information/letter:   1.072
 Minimal length of displayed segments set to:  15

M_0.01= 15.15  (cv=  8.30, lambda=  0.54224, k=  0.41168, x=  6.85;
                90% confidence interval for segment length:  11 +-   6)
M_0.05= 12.14  (x=  3.84)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.911;    Average information/letter:   1.017
 Minimal length of displayed segments set to:  15

M_0.01= 32.94  (cv= 18.64, lambda=  0.24136, k=  0.31700, x= 14.30;
                90% confidence interval for segment length:  11 +-   7)
M_0.05= 26.19  (x=  7.55);     M_0.30= 18.15  (x= -0.49)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-90-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   6  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 3   (6-10) 0   (11-20) 3   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   6  (Expected range:   0-- 12)
   5 +plets (f+: 22.2%), 1 -plets (f-: 13.3%)
   Total number of charge altplets: 3 (Critical number: 13)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 3   (6-10) 1   (11-20) 1   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  81-  90        2      A.               5       5 !     0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


orfS VpV262 CD search match to acetyl transferase family
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:19 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4816.2920.seq)

SWISS-PROT ANNOTATION:
ID   ORFS
DE   ORFS, 160 bases, 9B2946CF checksum.

number of residues:  160;   molecular weight:  18.0 kdal
 
         1  MTMQIMRATV DWFIDNGGPS LAMAHLEEAM PVTAEKNEFR EVDYAMYAEL EMSGSLLVLV 
        61  AIDDENNNEF VGYVVGTASP SIHNRGYFEF STTAFYTVPH VREQGVARQL FEALQAVCRQ 
       121  SGVTEINYSV SEGQPMTHEV VKKLGLIKSE TMYSMKVKHE

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 13( 8.1%); C  :  1( 0.6%); D  :  5( 3.1%); E  : 17(10.6%); F  :  7( 4.4%)
G  : 10( 6.2%); H  :  5( 3.1%); I  :  6( 3.8%); K  :  6( 3.8%); L  : 10( 6.2%)
M  : 10( 6.2%); N  :  7( 4.4%); P  :  5( 3.1%); Q  :  6( 3.8%); R  :  6( 3.8%)
S  : 11( 6.9%); T  : 10( 6.2%); V  : 17(10.6%); W  :  1( 0.6%); Y  :  7( 4.4%)

KR      :   12 (  7.5%);   ED      :   22 ( 13.8%);   AGP     :   28 ( 17.5%);
KRED    :   34 ( 21.2%);   KR-ED   :  -10 ( -6.2%);   FIKMNY  :   43 ( 26.9%);
LVIFM   :   50 ( 31.2%);   ST      :   21 ( 13.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000000+000 -000-00000 000000--00 0000-+0-0+ -0-00000-0 -000000000 
        61  00---000-0 0000000000 0000+000-0 0000000000 0+-0000+00 0-000000+0 
       121  0000-00000 0-000000-0 0++0000+0- 00000+0+0-

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  8/30 or 10/45 or 12/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 18/60):  none


Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.075  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.788  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.138  ( ED )

 Expected score/letter:  -0.913;    Average information/letter:   1.956
 Minimal length of displayed segments set to:  20

M_0.01=  7.67  (cv=  4.45, lambda=  1.14070, k=  0.39702, x=  3.22;
                90% confidence interval for segment length:   6 +-   6)
M_0.05=  6.24  (x=  1.79)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.138  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.788  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.075  ( KR )

 Expected score/letter:  -0.663;    Average information/letter:   0.869
 Minimal length of displayed segments set to:  20

M_0.01= 11.20  (cv=  6.82, lambda=  0.74394, k=  0.26140, x=  4.38;
                90% confidence interval for segment length:  14 +-  13)
M_0.05=  9.01  (x=  2.19)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.213  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.788  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.575;    Average information/letter:   1.087
 Minimal length of displayed segments set to:  20

M_0.01=  6.72  (cv=  3.87, lambda=  1.30992, k=  0.41984, x=  2.85;
                90% confidence interval for segment length:  12 +-  10)
M_0.05=  5.48  (x=  1.60)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.788  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.213  ( KEDR )

 Expected score/letter:  -0.912;    Average information/letter:   0.163
 Minimal length of displayed segments set to:  20

M_0.01= 38.52  (cv= 26.42, lambda=  0.19212, k=  0.10291, x= 12.11;
                90% confidence interval for segment length:  65 +-  51)
M_0.05= 30.04  (x=  3.62)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  34 |   8 |   9 |  11 |   9 |  11 |  13 |   6 |   8 | 
lmin1     5 |   6 |   8 |  41 |   9 |  11 |  14 |  11 |  14 |  16 |   8 |  10 | 
lmin2     6 |   7 |   9 |  46 |  11 |  13 |  15 |  13 |  15 |  18 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   1, at   63;
  *  runs >=   4:   0
  0  runs >=  23:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.850;    Average information/letter:   0.557
 Minimal length of displayed segments set to:  15

M_0.01= 24.32  (cv= 14.85, lambda=  0.34175, k=  0.25571, x=  9.47;
                90% confidence interval for segment length:  22 +-  15)
M_0.05= 19.55  (x=  4.70)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.444;    Average information/letter:   0.466
 Minimal length of displayed segments set to:  15

M_0.01= 55.35  (cv= 34.97, lambda=  0.14514, k=  0.19363, x= 20.38;
                90% confidence interval for segment length:  25 +-  19)
M_0.05= 44.12  (x=  9.15);     M_0.30= 30.76  (x= -4.21)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-117-C-42-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   9  (Expected range:   0-- 19)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 2   (6-10) 3   (11-20) 3   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   3  (Expected range:   0--  9)
   1 +plets (f+: 7.5%), 2 -plets (f-: 13.8%)
   Total number of charge altplets: 3 (Critical number: 10)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 1   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
  26- 121        8      i..0....        10       6      /2/././4/././././


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

   6-  23  (1.)     M(  17)M     2 of  11   0.9902   small  2. maximal spacing
  36- 142  (3.)     K( 106)K     1 of   7   0.0085   large maximal spacing
  52- 136  (3.)     M(  84)M     1 of  11   0.0046   large  1. maximal spacing



orfT VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:20 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4811.1533.seq)

SWISS-PROT ANNOTATION:
ID   ORFT
DE   ORFT, 203 bases, 17DE115E checksum.

number of residues:  203;   molecular weight:  20.7 kdal
 
         1  MSKAVRFIGG AVQVIAGIYT ANPALIASGV GSIAGGIAEQ KREDAAKQAV QEQRKAVAIN 
        61  NAQQAIQRQR QIRQTIAEAR VRRALIQSRG FEGGPAGAGD SIYGDAASAI GAANTQQAAA 
       121  FGISAARNRA AMFGLEARSS NSFDAFAGAA NLFGQGYTAY KGGSFEGLFG SGTADTGRTS 
       181  DFGRSGNNSA NFLEFGGAVS GGR

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A++: 41(20.2%); C  :  0( 0.0%); D  :  6( 3.0%); E  :  8( 3.9%); F  : 12( 5.9%)
G++: 31(15.3%); H--:  0( 0.0%); I  : 14( 6.9%); K  :  5( 2.5%); L--:  6( 3.0%)
M- :  2( 1.0%); N  : 10( 4.9%); P--:  2( 1.0%); Q+ : 15( 7.4%); R  : 16( 7.9%)
S  : 16( 7.9%); T  :  7( 3.4%); V- :  8( 3.9%); W  :  0( 0.0%); Y  :  4( 2.0%)

KR      :   21 ( 10.3%);   ED      :   14 (  6.9%);   AGP   ++:   74 ( 36.5%);
KRED    :   35 ( 17.2%);   KR-ED   :    7 (  3.4%);   FIKMNY  :   47 ( 23.2%);
LVIFM --:   42 ( 20.7%);   ST      :   23 ( 11.3%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00+00+0000 0000000000 0000000000 00000000-0 ++--00+000 0-0++00000 
        61  0000000+0+ 00+0000-0+ 0++00000+0 0-0000000- 0000-00000 0000000000 
       121  000000+0+0 00000-0+00 000-000000 0000000000 +0000-0000 0000-00+00 
       181  -00+000000 000-000000 00+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 12/45 or 15/60):  none


Negative charge clusters (cmin =  7/30 or  9/45 or 11/60):  none


Mixed charge clusters (cmin = 13/30 or 17/45 or 22/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.103  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.828  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.069  ( ED )

 Expected score/letter:  -0.759;    Average information/letter:   1.306
 Minimal length of displayed segments set to:  20

M_0.01=  9.48  (cv=  5.72, lambda=  0.92817, k=  0.32682, x=  3.75;
                90% confidence interval for segment length:  10 +-   9)
M_0.05=  7.72  (x=  2.00)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.069  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.828  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.103  ( KR )

 Expected score/letter:  -0.897;    Average information/letter:   2.054
 Minimal length of displayed segments set to:  20

M_0.01=  7.60  (cv=  4.49, lambda=  1.18462, k=  0.40240, x=  3.11;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  6.22  (x=  1.74)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.172  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.828  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.655;    Average information/letter:   1.483
 Minimal length of displayed segments set to:  20

M_0.01=  5.90  (cv=  3.39, lambda=  1.56862, k=  0.51868, x=  2.51;
                90% confidence interval for segment length:   9 +-   7)
M_0.05=  4.86  (x=  1.48)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.828  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.172  ( KEDR )

 Expected score/letter:  -0.552
 Average information/letter:   0.073 < .10; too small !


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   4 |   6 |  43 |   9 |   8 |  11 |  11 |   9 |  13 |  NA |  NA | 
lmin1     6 |   5 |   7 |  52 |  11 |   9 |  13 |  13 |  12 |  16 |  NA |  NA | 
lmin2     7 |   6 |   8 |  58 |  12 |  11 |  15 |  15 |  13 |  18 |  NA |  NA | 
 (Significance level: 0.010000; Minimal displayed length:  6)

(*00)  14(0,0,0); at  173- 186:   TADTGRTSDFGRSG
      (4. quartile)               00-00+00-00+00


Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   1, at   41;
  0  runs >=  29:   1, at    7;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.557;    Average information/letter:   0.452
 Minimal length of displayed segments set to:  15

M_0.01= 25.64  (cv= 16.18, lambda=  0.32831, k=  0.22429, x=  9.46;
                90% confidence interval for segment length:  27 +-  19)
M_0.05= 20.68  (x=  4.49)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -2.650;    Average information/letter:   0.317
 Minimal length of displayed segments set to:  15

M_0.01= 65.23  (cv= 43.27, lambda=  0.12280, k=  0.14912, x= 21.96;
                90% confidence interval for segment length:  36 +-  28)
M_0.05= 51.96  (x=  8.69);     M_0.30= 36.17  (x= -7.10)

 1) From    7 to   38 (type 1):  length= 32, score=50.00
    (pocket at   19 to   23:   length=  5, score=-17.00) 
       7  FIGGAVQVIA GI |YTANP| ALI ASGVGSIAGG IA
    A:  7(21.9%);  G:  7(21.9%);  I:  6(18.8%);

# of segments (>=15 residues) exceeding M_0.30:  1


2. SPACINGS OF C.


H2N-203-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[   9-  12]   GGAV
[ 196- 199]   GGAV

______________________________

[ 111- 114]   GAAN
[ 148- 151]   GAAN


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  19  (Expected range:   5-- 30)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 8   (6-10) 6   (11-20) 4   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 14/30 or 18/45 or 22/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   4  (Expected range:   0--  8)
   3 +plets (f+: 10.3%), 1 -plets (f-: 6.9%)
   Total number of charge altplets: 1 (Critical number: 10)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 1   (11-20) 0   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  90- 117        7      G......          4       4       0
 107- 142        6      A.....           6       6 !     0
 162- 181        5      G....            4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 7)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  36-  90  (2.)     G(  54)G     1 of  32   0.0009   large maximal spacing
  57-  81  (2.)     V(  24)V     2 of   9   0.9841   small  2. maximal spacing
  81- 199  (3.)     V( 118)V     1 of   9   0.0070   large  1. maximal spacing
 123- 204  (4.)     I(  81)I     1 of  15   0.0088   large maximal spacing



orfU VpV262 at least partial homologue to SIO1 gp26
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:22 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4813.6672.seq)

SWISS-PROT ANNOTATION:
ID   ORFU
DE   ORFU, 1536 bases, 7CD7314B checksum.

number of residues: 1536;   molecular weight: 169.7 kdal
 
         1  MAGELNLDTY QKLLAEDNAP EPSLESYMQM LNAAPPRGPV TDARYTGQAA MIAGNPYEIE 
        61  SLAMAAASGD TKLYRSQQAM SWYDDRVHDA HTQLQQTQFG SAEAMLGAAQ LQSERIRMLQ 

       121  DAKSHPLADE VAFATNAAYK GDPEEITRAA VDVALANEIG ELAESTGIGE KIAEFGSFLI 
       181  PFKYAADIAD INEQISANPQ LAELAGEDLE TVVGAWKALP PERQLALVKP LKEAIMKATA 
       241  SFGWTDGNAT KTAGLLAQFF EVDPAAQINS EFAEDIVFGA IDVTPIGAIR GLKPVEKLRH 
       301  AKIIRKVAED AVADTSAAKV AARAGDKESA AKHTVASLVD DNTAAALGTT RDDAAASALP 
       361  LETSEWFTRV LDDDELPAAV ADEMNNMFAI ANGFARTLKE ESGLMQVGLL NRSERNQVVQ 
       421  SFFEKMEDVS EEYLTEGLHM DNLKLVDEDA KGFTYQYTLR DKSRPVAEGE KERLIVRSGK 
       481  VKFSINDVTG TYSATVEGAS AQNFFGNILS PAAWSRRTAE GDFNLEVKRA LQSDDLAVAY 
       541  QDKVGAFLDW ASEPVAGLTQ GKARERVSAV LQAGDEWVNP ATQVEGTVFS PTELAAGVNT 
       601  PLGKIHLTDP REVEAYYRYR LYGDATFINE DFVLRREKEL AGMKDVKLQG LEDGGRAIGK 
       661  PFEDLSSALG SVRDKRGQGV WDSKIGRTVD ITDDYVRKVY DEGDVLVRLE KDWNTKGTGE 
       721  LDLSGEFVQY ARVQKDAISE MPDRILNYRS GYVPKINEAR FVVSQRMPKI ARGRPNLTGS 
       781  QALRAFDSLS DAKLFREEQI ARWMEKHGTD RATAEQIFPE VDLLDEGLGA ARLEEAVGAH 
       841  TGLYHGTRSK DKLLFGLSGQ ELSRVNPLEA FQRHSKHLGA FISQNEVRIG REKRWLNTVR 
       901  QEFPDIPIRG FEDTPLPNTP KGKAAERVRR VIREWNGIPS REEELSDAMW QRLHDWALNG 
       961  ARRFGYADKE SVKSIQWLRS NDPYAVMKTI VMHNLLGVFN IAQLYTQASA MSVALGKFPA 
      1021  KFAPRVINDT AWMHVLDNIV DDKMLGKVYN ILFNGRAIDE TTKATYDAFR RTGLKEAVFN 
      1081  NSDMARIGSH GLGVTRRIIS NADNLSLMLY RSGELAARRA TFAAEFQNWM RTTGKKVPSD 
      1141  LELSEILEEV NKDLLELGPA NRAYWQGGRG TGDVRQLMGV ATQFMQVGAK TMELAGKGLG 
      1201  GVNRGGFTNR QKARIFLSQL AMFGAAGVPL GGLITQAITS GLGIEQMPEE AAELANQGFV 
      1261  GFGARLMLGD VEVSDRFALG AQTTQMLEDI ITSDDPLWIK ALGPAGSGVF GRAWDAIQQL 
      1321  RPLMNADWGH EGVVTEDGLM LAASVLGEIP SSGRNWVKYW MMKNHHAIID QRKRVLIDAS 
      1381  DKGGFDSRTE LGRLLGFTPT AEQNMRMLQL DQKAVDEMVQ DYANYRVQLL HRAIWEYKLD 
      1441  PQTVVAIEEA HKVMDRTIPD YVKTKGRELV NKRIYGPRQE SKEDELVKKF LQRTAPDKIS 
      1501  EDFILDSNAE LSTSVKPISQ PFRNILGGSQ VPQEDE

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :175(11.4%); C  :  0( 0.0%); D  :104( 6.8%); E  :118( 7.7%); F  : 57( 3.7%)
G  :123( 8.0%); H  : 21( 1.4%); I  : 71( 4.6%); K  : 81( 5.3%); L  :140( 9.1%)
M  : 42( 2.7%); N  : 59( 3.8%); P  : 57( 3.7%); Q  : 69( 4.5%); R  :101( 6.6%)
S  : 81( 5.3%); T  : 78( 5.1%); V  : 99( 6.4%); W  : 24( 1.6%); Y  : 36( 2.3%)

KR      :  182 ( 11.8%);   ED      :  222 ( 14.5%);   AGP     :  355 ( 23.1%);
KRED    :  404 ( 26.3%);   KR-ED   :  -40 ( -2.6%);   FIKMNY  :  346 ( 22.5%);
LVIFM   :  409 ( 26.6%);   ST      :  159 ( 10.4%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000-000-00 0+000--000 -000-00000 000000+000 0-0+000000 0000000-0- 
        61  000000000- 0+00+00000 000--+00-0 0000000000 00-0000000 000-+0+000 
       121  -0+00000-- 000000000+ 0-0--00+00 0-00000-00 -00-00000- +00-000000 
       181  00+000-00- 00-0000000 00-000--0- 000000+000 0-+00000+0 0+-000+000 
       241  00000-0000 +000000000 -0-0000000 -00--00000 0-0000000+ 00+00-+0+0 
       301  0+00++00-- 000-0000+0 00+00-+-00 0+0000000- -000000000 +--0000000 
       361  0-00-000+0 0----00000 0--0000000 00000+00+- -000000000 0+0-+00000 
       421  000-+0--00 --000-0000 -00+00---0 +00000000+ -+0+000-0- +-+000+00+ 
       481  0+0000-000 000000-000 0000000000 00000++00- 0-000-0++0 000--00000 
       541  0-+00000-0 00-0000000 0+0+-+0000 0000--0000 0000-00000 00-0000000 
       601  000+0000-0 +-0-000+0+ 000-00000- -000++-+-0 000+-0+000 0--00+000+ 
       661  00--000000 00+-++0000 0-0+00+00- 00--00++00 --0-000+0- +-000+000- 
       721  0-000-0000 0+00+-000- 00-+0000+0 0000+00-0+ 00000+00+0 0+0+000000 
       781  000+00-000 -0+00+--00 0+00-+000- +000-0000- 0-00--0000 0+0--00000 
       841  0000000+0+ -+00000000 -00+0000-0 00+00+0000 00000-0+00 +-++00000+ 
       901  0-00-000+0 0--0000000 +0+00-+0++ 00+-000000 +---00-000 0+00-00000 
       961  0++0000-+- 00+00000+0 0-00000+00 0000000000 0000000000 000000+000 
      1021  +000+000-0 000000-000 --+000+000 00000+00-- 00+000-00+ +000+-0000 
      1081  00-00+0000 00000++000 00-0000000 +00-000++0 0000-00000 +000++000- 
      1141  0-00-00--0 0+-00-0000 0+000000+0 00-0+00000 000000000+ 00-000+000 
      1201  000+00000+ 0+0+000000 0000000000 0000000000 0000-000-- 00-0000000 
      1261  0000+0000- 0-00-+0000 0000000--0 000--0000+ 0000000000 0+00-00000 
      1321  +00000-000 -0000--000 0000000-00 000+000+00 00+000000- 0+++000-00 
      1381  -+000-0+0- 00+0000000 0-000+0000 -0+00--000 -0000+0000 0+000-0+0- 
      1441  0000000--0 0+00-+000- 00+0+0+-00 0++0000+0- 0+---00++0 00+000-+00 
      1501  --000-000- 00000+0000 00+0000000 000---

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 12/30 or 16/45 or 19/60):  none


Negative charge clusters (cmin = 13/30 or 18/45 or 22/60):  none


Mixed charge clusters (cmin = 19/30 or 26/45 or 32/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.118  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.737  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.145  ( ED )

 Expected score/letter:  -0.789;    Average information/letter:   1.194
 Minimal length of displayed segments set to:  20

M_0.01= 12.53  (cv=  8.53, lambda=  0.86057, k=  0.31625, x=  4.01;
                90% confidence interval for segment length:  13 +-  11)
M_0.05= 10.64  (x=  2.11)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.145  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.737  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.118  ( KR )

 Expected score/letter:  -0.685;    Average information/letter:   0.853
 Minimal length of displayed segments set to:  20

M_0.01= 14.63  (cv= 10.14, lambda=  0.72392, k=  0.26058, x=  4.50;
                90% confidence interval for segment length:  18 +-  15)
M_0.05= 12.38  (x=  2.25)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.263  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.737  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.474;    Average information/letter:   0.705
 Minimal length of displayed segments set to:  20

M_0.01= 10.43  (cv=  7.12, lambda=  1.03033, k=  0.30481, x=  3.31;
                90% confidence interval for segment length:  22 +-  17)
M_0.05=  8.85  (x=  1.73)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.737  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.263  ( KEDR )

 Expected score/letter:  -1.367;    Average information/letter:   0.294
 Minimal length of displayed segments set to:  20

M_0.01= 37.03  (cv= 26.60, lambda=  0.27581, k=  0.17823, x= 10.43;
                90% confidence interval for segment length:  50 +-  28)
M_0.05= 31.12  (x=  4.52)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     6 |   6 |   9 |  35 |  10 |  11 |  15 |  12 |  13 |  17 |   7 |   8 | 
lmin1     7 |   8 |  11 |  42 |  13 |  14 |  18 |  15 |  16 |  21 |   8 |  10 | 
lmin2     8 |   9 |  12 |  47 |  14 |  15 |  20 |  17 |  18 |  23 |   9 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   4:   0
  -  runs >=   4:   1, at  372;
  *  runs >=   6:   0
  0  runs >=  23:   2, at  989; 1215;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.188;    Average information/letter:   0.683
 Minimal length of displayed segments set to:  15

M_0.01= 27.49  (cv= 18.78, lambda=  0.39060, k=  0.30152, x=  8.71;
                90% confidence interval for segment length:  23 +-  13)
M_0.05= 23.32  (x=  4.53)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.127;    Average information/letter:   0.580
 Minimal length of displayed segments set to:  15

M_0.01= 65.16  (cv= 45.69, lambda=  0.16058, k=  0.22901, x= 19.47;
                90% confidence interval for segment length:  26 +-  16)
M_0.05= 55.01  (x=  9.32);     M_0.30= 42.93  (x= -2.76)

 1) From 1215 to 1244 (type 1):  length= 30, score=48.00
    (pocket at 1235 to 1236:   length=  2, score=-12.00)
    (pocket at 1218 to 1219:   length=  2, score=-12.00) 
    1215  IFL |SQ| LAMFG AAGVPLGGLI  |TQ| AITSGLGI
    L:  5(16.7%);  A:  4(13.3%);  G:  6(20.0%);  I:  4(13.3%);

# of segments (>=15 residues) exceeding M_0.30:  1


2. SPACINGS OF C.


H2N-1536-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  5

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  9

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  93  (Expected range:  64--121)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 26   (6-10) 22   (11-20) 26   (>=21) 20

3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  55  (Expected range:  26-- 67)
   16 +plets (f+: 11.8%), 39 -plets (f-: 14.5%)
   Total number of charge altplets: 40 (Critical number: 73)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 9   (6-10) 5   (11-20) 18   (>=21) 24

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  33-  72       10      A.........       4       4       0
 249- 280        8      A.......         4       4       0
 318- 341        6      A.....           4       4       0
 823- 862       10      L.........       4       4       0
1002-1025        6      A.....           4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core:10)

Location        Period  Element         Copies  Core    Errors
 259- 312        9      i....0...        6       6      /0/././././2/./././
 283- 309        3      i*.              8       6      /1/3/./
 296- 349        9      *00.0...0        6       6      /0/2/2/./2/./././2/
 675- 728        9      *.0.0....        6       6      /0/./2/./2/././././
 805- 829        5      -.000            5       5      /0/./1/0/0/
1370-1393        4      *0..             6       6      /0/2/././
1463-1502        5      *....            8       8       0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfV VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:24 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4814.3475.seq)

SWISS-PROT ANNOTATION:
ID   ORFV
DE   ORFV, 611 bases, 30B9D8E1 checksum.

number of residues:  611;   molecular weight:  66.6 kdal
 
         1  MAGGPFQTSG NNAGDLTPVN LQGGFTPVAP KSNDAVNAVE SVTQAVGTIA TLNTERLKKE 
        61  SVANIRQEVK AVRDALQISK YPTLQTTYFS EEALQDPYIK SVYKNFQEIK GATDQGRLSQ 
       121  EFAIERMEAL MSQAINRRPQ FADDIRNAAN TAAGANISSK LFSQIMTLTP QQKALQELQA 
       181  EATKLGIPVE TYQGMVQQQF FREQMSEQIE YAKKQGTASL NDLSQQVGLE VTNVTRNLQN 
       241  GLLERIRAGG VVDVPVTVAE ARQEFTMLRN KVLSNIPANV PSSQVSQVVD MIDAEEQRII 
       301  TQIENGTMAK VLQTKGTLFE EIAKDNARTN APDEMQILAI FGNGQSGYAA VNDYTKYKNN 
       361  PQAMAGLYAV DEGGVLTVAS AATQQFKAGQ ILFQGKQAAN DQERRLAGYF GGIFLQKDVN 
       421  QQGENPTPPQ QVLRVVDVVS KMGEEYSTAT LGDAKVAANL RAYKETHPQL INHFNSDVHS 
       481  LKRQYQQLKA EGAFTDENVQ IKNGRVSVTG NTIVSQFGAG SSNPANITGG TGSITAIQRF 
       541  LRHANLTMKM GQTYQGNGVF PESVFRNSNA FLKDLKDATD VEPADLTGGT NGQDEVIRYD 
       601  FDGNGKLIRL E

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 62(10.1%); C  :  0( 0.0%); D  : 28( 4.6%); E  : 38( 6.2%); F  : 24( 3.9%)
G  : 49( 8.0%); H  :  4( 0.7%); I  : 31( 5.1%); K  : 32( 5.2%); L  : 43( 7.0%)
M  : 14( 2.3%); N+ : 43( 7.0%); P  : 21( 3.4%); Q++: 57( 9.3%); R  : 27( 4.4%)
S  : 32( 5.2%); T  : 44( 7.2%); V  : 46( 7.5%); W  :  0( 0.0%); Y  : 16( 2.6%)

KR      :   59 (  9.7%);   ED      :   66 ( 10.8%);   AGP     :  132 ( 21.6%);
KRED    :  125 ( 20.5%);   KR-ED   :   -7 ( -1.1%);   FIKMNY  :  160 ( 26.2%);
LVIFM   :  158 ( 25.9%);   ST      :   76 ( 12.4%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 0000-00000 0000000000 +00-00000- 0000000000 0000-+0++- 
        61  00000+0-0+ 00+-00000+ 0000000000 --000-000+ 000+000-0+ 000-00+000 
       121  -000-+0-00 000000++00 00--0+0000 000000000+ 0000000000 00+000-000 
       181  -00+00000- 0000000000 0+-000-00- 00++000000 0-0000000- 00000+0000 
       241  000-+0+000 00-000000- 0+0-0000+0 +000000000 000000000- 00-0--0+00 
       301  000-00000+ 0000+0000- -00+-00+00 00--000000 0000000000 00-00+0+00 
       361  0000000000 --00000000 000000+000 00000+0000 -0-++00000 000000+-00 
       421  000-000000 000+00-000 +00--00000 00-0+00000 +00+-00000 000000-000 
       481  0++00000+0 -0000--000 0+00+00000 0000000000 0000000000 00000000+0 
       541  0+000000+0 0000000000 0-000+0000 00+-0+-00- 0-00-00000 000--00+0- 
       601  0-000+00+0 -

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 12/45 or 14/60):  none


Negative charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


Mixed charge clusters (cmin = 14/30 or 20/45 or 24/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.097  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.795  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.108  ( ED )

 Expected score/letter:  -0.818;    Average information/letter:   1.477
 Minimal length of displayed segments set to:  20

M_0.01= 10.16  (cv=  6.54, lambda=  0.98069, k=  0.34955, x=  3.62;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  8.50  (x=  1.96)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.108  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.795  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.097  ( KR )

 Expected score/letter:  -0.773;    Average information/letter:   1.275
 Minimal length of displayed segments set to:  20

M_0.01= 10.90  (cv=  7.07, lambda=  0.90744, k=  0.32447, x=  3.83;
                90% confidence interval for segment length:  11 +-  10)
M_0.05=  9.10  (x=  2.03)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.205  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.795  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.591;    Average information/letter:   1.157
 Minimal length of displayed segments set to:  20

M_0.01=  7.51  (cv=  4.72, lambda=  1.35789, k=  0.43887, x=  2.78;
                90% confidence interval for segment length:  13 +-  10)
M_0.05=  6.31  (x=  1.58)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.795  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.205  ( KEDR )

 Expected score/letter:  -0.841;    Average information/letter:   0.144
 Minimal length of displayed segments set to:  20

M_0.01= 48.27  (cv= 35.91, lambda=  0.17865, k=  0.09146, x= 12.36;
                90% confidence interval for segment length:  86 +-  63)
M_0.05= 39.15  (x=  3.24)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   5 |   7 |  41 |   9 |  10 |  13 |  11 |  12 |  15 |   5 |   7 | 
lmin1     6 |   6 |   9 |  50 |  11 |  12 |  15 |  14 |  14 |  18 |   7 |   8 | 
lmin2     7 |   7 |  10 |  55 |  13 |  13 |  17 |  16 |  16 |  21 |   8 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   5:   0
  0  runs >=  27:   1, at  506;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.146;    Average information/letter:   0.753
 Minimal length of displayed segments set to:  15

M_0.01= 23.22  (cv= 15.09, lambda=  0.42509, k=  0.31827, x=  8.13;
                90% confidence interval for segment length:  19 +-  11)
M_0.05= 19.39  (x=  4.29)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.964;    Average information/letter:   0.603
 Minimal length of displayed segments set to:  15

M_0.01= 56.96  (cv= 38.25, lambda=  0.16770, k=  0.23129, x= 18.70;
                90% confidence interval for segment length:  23 +-  15)
M_0.05= 47.24  (x=  8.98);     M_0.30= 35.67  (x= -2.58)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-611-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  5

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  9

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  43  (Expected range:  19-- 56)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 16   (6-10) 9   (11-20) 9   (>=21) 10

3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  14  (Expected range:   1-- 22)
   5 +plets (f+: 9.7%), 9 -plets (f-: 10.8%)
   Total number of charge altplets: 12 (Critical number: 25)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 0   (11-20) 3   (>=21) 11

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 252- 259        2      V.               4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfW VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:25 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4819.4029.seq)

SWISS-PROT ANNOTATION:
ID   ORFW
DE   ORFW, 291 bases, E6E6A4C8 checksum.

number of residues:  291;   molecular weight:  32.5 kdal
 
         1  MDKTKLFVMT LMVTASLFVW SKQDMPQVEF RGQRFSFPEG ATEEQISGVL RDYMETQPTA 
        61  NGMSTPDRSA AIRTENGYAP TESYFGETMD GEEPKLQDVP KLTAEQRRFA EAIAQVETGG 
       121  LDNRFIRTKV KPNGTAAGSS AYGTYQITHG LLSGYLDKGV VKLTEQERAA AEELLRRQEI 
       181  ALTIGGRDRA KYQKGGIYQS QGVRWAKAYG FENVDEFLDA FDYGGTLGLD DDAEFQTLYE 
       241  SFARKMLNKH LDEAGGNPFK AARIWHGGPK GAGATTDNYE RKVRRIYEQG N

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 28( 9.6%); C  :  0( 0.0%); D  : 17( 5.8%); E  : 25( 8.6%); F  : 14( 4.8%)
G  : 31(10.7%); H  :  3( 1.0%); I  : 10( 3.4%); K  : 17( 5.8%); L  : 20( 6.9%)
M  :  8( 2.7%); N  :  9( 3.1%); P  : 10( 3.4%); Q  : 16( 5.5%); R  : 20( 6.9%)
S  : 12( 4.1%); T  : 22( 7.6%); V- : 13( 4.5%); W  :  3( 1.0%); Y  : 13( 4.5%)

KR      :   37 ( 12.7%);   ED      :   42 ( 14.4%);   AGP     :   69 ( 23.7%);
KRED    :   79 ( 27.1%);   KR-ED   :   -5 ( -1.7%);   FIKMNY  :   71 ( 24.4%);
LVIFM - :   65 ( 22.3%);   ST      :   34 ( 11.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0-+0+00000 0000000000 0+0-0000-0 +00+0000-0 00--000000 +-00-00000 
        61  000000-+00 00+0-00000 0-0000-00- 0--0+00-00 +000-0++00 -00000-000 
       121  0-0+00+0+0 +000000000 0000000000 000000-+00 0+00-0-+00 0--00++0-0 
       181  000000+-+0 +00+000000 000+00+000 0-00--00-0 0-0000000- --0-00000- 
       241  000++000+0 0--000000+ 00+000000+ 000000-00- ++0++00-00 0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 14/45 or 17/60):  none


Negative charge clusters (cmin = 12/30 or 15/45 or 19/60):  none


Mixed charge clusters (cmin = 17/30 or 24/45 or 30/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.127  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.729  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.144  ( ED )

 Expected score/letter:  -0.763;    Average information/letter:   1.082
 Minimal length of displayed segments set to:  20

M_0.01= 11.12  (cv=  6.96, lambda=  0.81533, k=  0.29999, x=  4.17;
                90% confidence interval for segment length:  12 +-  11)
M_0.05=  9.12  (x=  2.17)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.144  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.729  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.127  ( KR )

 Expected score/letter:  -0.694;    Average information/letter:   0.866
 Minimal length of displayed segments set to:  20

M_0.01= 12.29  (cv=  7.80, lambda=  0.72748, k=  0.26342, x=  4.49;
                90% confidence interval for segment length:  15 +-  14)
M_0.05= 10.05  (x=  2.25)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.271  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.729  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.457;    Average information/letter:   0.651
 Minimal length of displayed segments set to:  20

M_0.01=  9.14  (cv=  5.75, lambda=  0.98714, k=  0.28673, x=  3.39;
                90% confidence interval for segment length:  20 +-  17)
M_0.05=  7.49  (x=  1.74)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.729  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.271  ( KEDR )

 Expected score/letter:  -1.443;    Average information/letter:   0.317
 Minimal length of displayed segments set to:  20

M_0.01= 29.76  (cv= 19.59, lambda=  0.28961, k=  0.19088, x= 10.17;
                90% confidence interval for segment length:  39 +-  23)
M_0.05= 24.13  (x=  4.54)

 1) From  132 to  156 (type 1):  length= 25, score=25.00  * 
     132  PNGTAAGSSA YGTYQITHGL LSGYL
    L:  3(12.0%);  A:  3(12.0%);  G:  5(20.0%);  S:  3(12.0%);
    T:  3(12.0%);  Y:  3(12.0%);

# of segments (>=20 residues) exceeding M_0.05:  1


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   5 |   8 |  28 |   9 |  10 |  13 |  11 |  11 |  15 |   5 |   7 | 
lmin1     6 |   7 |  10 |  34 |  11 |  12 |  16 |  13 |  14 |  18 |   7 |   8 | 
lmin2     7 |   8 |  11 |  38 |  13 |  13 |  18 |  15 |  16 |  20 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   1, at  230;
  *  runs >=   5:   0
  0  runs >=  19:   1, at  132;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.368;    Average information/letter:   0.784
 Minimal length of displayed segments set to:  15

M_0.01= 20.86  (cv= 12.90, lambda=  0.43988, k=  0.33438, x=  7.97;
                90% confidence interval for segment length:  17 +-  10)
M_0.05= 17.16  (x=  4.26)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.485;    Average information/letter:   0.689
 Minimal length of displayed segments set to:  15

M_0.01= 47.45  (cv= 30.26, lambda=  0.18750, k=  0.25267, x= 17.20;
                90% confidence interval for segment length:  19 +-  12)
M_0.05= 38.76  (x=  8.50);     M_0.30= 28.42  (x= -1.84)

 1) From    6 to   20 (type 1):  length= 15, score=39.00  * 
       6  LFVMTLMVTA SLFVW
    L:  3(20.0%);  V:  3(20.0%);  T:  2(13.3%);  F:  2(13.3%);
    M:  2(13.3%);

# of segments (>=15 residues) exceeding M_0.30:  1


2. SPACINGS OF C.


H2N-291-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  22  (Expected range:   5-- 30)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 7   (6-10) 7   (11-20) 4   (>=21) 5

3. Clusters of amino acid multiplets (cmin = 13/30 or 16/45 or 19/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  11  (Expected range:   0-- 19)
   5 +plets (f+: 12.7%), 6 -plets (f-: 14.4%)
   Total number of charge altplets: 7 (Critical number: 22)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 2   (6-10) 2   (11-20) 3   (>=21) 5

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


orfX VpV262 low quality frame candidate
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:27 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4812.1811.seq)

SWISS-PROT ANNOTATION:
ID   ORFX
DE   ORFX, 56 bases, CAC39C85 checksum.

number of residues:   56;   molecular weight:   6.4 kdal
 
         1  MNKVIKKGIW WVEVNKDGKE TGRASRKRFE PKPEPEVKKK AAPKKKAASK KDEGEQ

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  5( 8.9%); C  :  0( 0.0%); D  :  2( 3.6%); E  :  7(12.5%); F  :  1( 1.8%)
G  :  4( 7.1%); H  :  0( 0.0%); I  :  2( 3.6%); K  : 15(26.8%); L  :  0( 0.0%)
M  :  1( 1.8%); N  :  2( 3.6%); P  :  4( 7.1%); Q  :  1( 1.8%); R  :  3( 5.4%)
S  :  2( 3.6%); T  :  1( 1.8%); V  :  4( 7.1%); W  :  2( 3.6%); Y  :  0( 0.0%)

KR      :   18 ( 32.1%);   ED      :    9 ( 16.1%);   AGP     :   13 ( 23.2%);
KRED    :   27 ( 48.2%);   KR-ED   :    9 ( 16.1%);   FIKMNY  :   21 ( 37.5%);
LVIFM   :    8 ( 14.3%);   ST      :    3 (  5.4%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00+00++000 00-00+-0+- 00+00+++0- 0+0-0-0+++ 000+++000+ +--0-0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 19/30 or 26/45 or 33/60):  none


Negative charge clusters (cmin = 12/30 or 17/45 or 21/60):  none


Mixed charge clusters (cmin = 25/30 or 35/45 or 44/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.321  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.518  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.161  ( ED )

 Expected score/letter:  -0.196
 Average information/letter:   0.046 < .10; too small !


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.161  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.518  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.321  ( KR )

 Expected score/letter:  -0.839;    Average information/letter:   0.956
 Minimal length of displayed segments set to:  20

M_0.01= 10.35  (cv=  5.65, lambda=  0.71304, k=  0.28786, x=  4.71;
                90% confidence interval for segment length:  11 +-  11)
M_0.05=  8.06  (x=  2.42)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.482  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.518  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.036
 Average information/letter:   0.004 < .10; too small !


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.518  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.482  ( KEDR )

 Expected score/letter:  -3.339;    Average information/letter:   0.924
 Minimal length of displayed segments set to:  20

M_0.01= 12.01  (cv=  6.14, lambda=  0.65551, k=  0.46981, x=  5.87;
                90% confidence interval for segment length:  12 +-   3)
M_0.05=  9.52  (x=  3.38)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     7 |   5 |  11 |  12 |  10 |   7 |  12 |   9 |   7 |  11 |  NA |  NA | 
lmin1     9 |   6 |  13 |  15 |  12 |   9 |  15 |  11 |   9 |  14 |  NA |  NA | 
lmin2    10 |   7 |  15 |  16 |  13 |  10 |  17 |  13 |  10 |  15 |  NA |  NA | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   5:   0
  -  runs >=   3:   0
  *  runs >=   7:   0
  0  runs >=   8:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -3.946;    Average information/letter:   1.491
 Minimal length of displayed segments set to:  15

M_0.01= 11.32  (cv=  5.75, lambda=  0.69947, k=  0.49170, x=  5.56;
                90% confidence interval for segment length:   8 +-   3)
M_0.05=  8.99  (x=  3.23)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -8.000;    Average information/letter:   1.542
 Minimal length of displayed segments set to:  15

M_0.01= 24.53  (cv= 12.89, lambda=  0.31220, k=  0.38065, x= 11.64;
                90% confidence interval for segment length:   7 +-   4)
M_0.05= 19.31  (x=  6.42);     M_0.30= 13.10  (x=  0.21)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-56-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

______________________________
Simple tandem repeat: 

[  38-  43]    KKKAAP
[  44-  49]    KKKAAS
[  50-  51]    KK


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   7  (Expected range:   0-- 13)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 5   (6-10) 2   (11-20) 0   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 18/30 or 23/45 or 28/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   6  (Expected range:   0-- 12)
   5 +plets (f+: 32.1%), 1 -plets (f-: 16.1%)
   Total number of charge altplets: 3 (Critical number: 13)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 4   (6-10) 2   (11-20) 1   (>=21) 0

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors
  32-  55        6      KK....           4       4 !    /0/1/././././


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 7)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
  26-  53        2      *.              12       8       2


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF1 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:28 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4811.4029.seq)

SWISS-PROT ANNOTATION:
ID   ORF1
DE   ORF1, 174 bases, 33A7F57 checksum.

number of residues:  174;   molecular weight:  19.2 kdal
 
         1  MKALALKTVQ SNIRSIVRNS NPAHLQEVFA STIWHGLIHS NIMPAAMKAL RDSDAPSKLK 
        61  AGISKYMPMK WDKKREAYVF NAAKREKLLS DLALTDESSF EDLANSLPQL FEKKSEPTAY 
       121  SRDAYLANVA KKLEKEGEQN VQDIIGLVSA MLDNPELVSA AAKAVFKQTA AKAA

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 27(15.5%); C  :  0( 0.0%); D  :  9( 5.2%); E  : 11( 6.3%); F  :  5( 2.9%)
G  :  4( 2.3%); H  :  3( 1.7%); I  :  8( 4.6%); K  : 19(10.9%); L  : 18(10.3%)
M  :  6( 3.4%); N  :  9( 5.2%); P  :  7( 4.0%); Q  :  6( 3.4%); R  :  6( 3.4%)
S  : 16( 9.2%); T  :  5( 2.9%); V  :  9( 5.2%); W  :  2( 1.1%); Y  :  4( 2.3%)

KR      :   25 ( 14.4%);   ED      :   20 ( 11.5%);   AGP     :   38 ( 21.8%);
KRED    :   45 ( 25.9%);   KR-ED   :    5 (  2.9%);   FIKMNY  :   51 ( 29.3%);
LVIFM   :   46 ( 26.4%);   ST      :   21 ( 12.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0+0000+000 000+000+00 000000-000 0000000000 0000000+00 +-0-000+0+ 
        61  0000+0000+ 0-+++-0000 000++-+000 -0000--000 --00000000 0-++0-0000 
       121  0+-0000000 ++0-+-0-00 00-0000000 00-00-0000 00+000+000 0+00

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 15/45 or 19/60):  none


Negative charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.144  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.741  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.115  ( ED )

 Expected score/letter:  -0.684;    Average information/letter:   0.857
 Minimal length of displayed segments set to:  20

M_0.01= 11.58  (cv=  7.10, lambda=  0.72678, k=  0.26115, x=  4.48;
                90% confidence interval for segment length:  14 +-  14)
M_0.05=  9.34  (x=  2.24)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.115  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.741  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.144  ( KR )

 Expected score/letter:  -0.799;    Average information/letter:   1.241
 Minimal length of displayed segments set to:  20

M_0.01=  9.81  (cv=  5.86, lambda=  0.87967, k=  0.32271, x=  3.94;
                90% confidence interval for segment length:  10 +-   9)
M_0.05=  7.96  (x=  2.09)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.259  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.741  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.483;    Average information/letter:   0.733
 Minimal length of displayed segments set to:  20

M_0.01=  8.17  (cv=  4.90, lambda=  1.05315, k=  0.31435, x=  3.27;
                90% confidence interval for segment length:  17 +-  15)
M_0.05=  6.62  (x=  1.72)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.741  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.259  ( KEDR )

 Expected score/letter:  -1.328;    Average information/letter:   0.282
 Minimal length of displayed segments set to:  20

M_0.01= 29.77  (cv= 19.21, lambda=  0.26862, k=  0.17163, x= 10.56;
                90% confidence interval for segment length:  41 +-  26)
M_0.05= 23.70  (x=  4.50)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   4 |   7 |  28 |   9 |   9 |  12 |  11 |  10 |  14 |   6 |   7 | 
lmin1     6 |   6 |   9 |  34 |  11 |  11 |  15 |  13 |  12 |  17 |   7 |   9 | 
lmin2     7 |   7 |  10 |  38 |  13 |  12 |  16 |  15 |  14 |  19 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   1, at   73;
  -  runs >=   3:   0
  *  runs >=   5:   1, at   72;
  0  runs >=  19:   1, at   28;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.270;    Average information/letter:   0.743
 Minimal length of displayed segments set to:  15

M_0.01= 20.84  (cv= 12.48, lambda=  0.41336, k=  0.31793, x=  8.36;
                90% confidence interval for segment length:  17 +-  10)
M_0.05= 16.89  (x=  4.41)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.195;    Average information/letter:   0.617
 Minimal length of displayed segments set to:  15

M_0.01= 49.17  (cv= 30.50, lambda=  0.16916, k=  0.23655, x= 18.67;
                90% confidence interval for segment length:  19 +-  13)
M_0.05= 39.53  (x=  9.04);     M_0.30= 28.07  (x= -2.43)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-174-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[ 147- 150]   LVSA
[ 157- 160]   LVSA


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  11  (Expected range:   2-- 23)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 3   (6-10) 3   (11-20) 4   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   6  (Expected range:   0-- 12)
   4 +plets (f+: 14.4%), 2 -plets (f-: 11.5%)
   Total number of charge altplets: 7 (Critical number: 14)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 1   (11-20) 3   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

 141- 148  (4.)     V(   7)V    10 of  10   0.0097   large minimal spacing
 158- 165  (4.)     V(   7)V     9 of  10   0.0097     matching minimum



ORF2 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:30 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4816.2920.seq)

SWISS-PROT ANNOTATION:
ID   ORF2
DE   ORF2, 116 bases, 4FE4F868 checksum.

number of residues:  116;   molecular weight:  13.2 kdal
 
         1  MKLRESIAPL RDAVSGMPAL HGHQTFFENM ASQYEHCKIK CVNPIHAMRW NVWFAIPKTA 
        61  RDRIIVQSVG TADYVGGYFE SVNDSHIDSL LRKAMPHDYM ARFLAHAQLK AAKQGR

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 14(12.1%); C  :  2( 1.7%); D  :  6( 5.2%); E  :  4( 3.4%); F  :  5( 4.3%)
G  :  6( 5.2%); H  :  7( 6.0%); I  :  7( 6.0%); K  :  7( 6.0%); L  :  7( 6.0%)
M  :  6( 5.2%); N  :  4( 3.4%); P  :  5( 4.3%); Q  :  5( 4.3%); R  :  8( 6.9%)
S  :  7( 6.0%); T  :  3( 2.6%); V  :  7( 6.0%); W  :  2( 1.7%); Y  :  4( 3.4%)

KR      :   15 ( 12.9%);   ED      :   10 (  8.6%);   AGP     :   25 ( 21.6%);
KRED    :   25 ( 21.6%);   KR-ED   :    5 (  4.3%);   FIKMNY  :   33 ( 28.4%);
LVIFM   :   32 ( 27.6%);   ST      :   10 (  8.6%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0+0+-00000 +-00000000 0000000-00 0000-00+0+ 00000000+0 0000000+00 
        61  +-+0000000 00-000000- 000-000-00 0++0000-00 0+0000000+ 00+00+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 11/30 or 14/45 or 18/60):  none


Negative charge clusters (cmin =  8/30 or 11/45 or 13/60):  none


Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.129  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.784  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.086  ( ED )

 Expected score/letter:  -0.698;    Average information/letter:   0.976
 Minimal length of displayed segments set to:  20

M_0.01= 10.25  (cv=  6.03, lambda=  0.78816, k=  0.28033, x=  4.22;
                90% confidence interval for segment length:  12 +-  12)
M_0.05=  8.19  (x=  2.15)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.086  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.784  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.129  ( KR )

 Expected score/letter:  -0.871;    Average information/letter:   1.702
 Minimal length of displayed segments set to:  20

M_0.01=  7.93  (cv=  4.51, lambda=  1.05510, k=  0.37413, x=  3.43;
                90% confidence interval for segment length:   7 +-   6)
M_0.05=  6.39  (x=  1.88)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.216  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.784  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.569;    Average information/letter:   1.061
 Minimal length of displayed segments set to:  20

M_0.01=  6.55  (cv=  3.68, lambda=  1.29198, k=  0.41266, x=  2.88;
                90% confidence interval for segment length:  12 +-  10)
M_0.05=  5.29  (x=  1.61)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.784  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.216  ( KEDR )

 Expected score/letter:  -0.940;    Average information/letter:   0.171
 Minimal length of displayed segments set to:  20

M_0.01= 36.11  (cv= 24.10, lambda=  0.19722, k=  0.10732, x= 12.01;
                90% confidence interval for segment length:  60 +-  47)
M_0.05= 27.85  (x=  3.74)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   4 |   6 |  32 |   9 |   8 |  11 |  10 |   9 |  13 |   7 |   9 | 
lmin1     6 |   5 |   8 |  39 |  11 |   9 |  13 |  13 |  11 |  16 |   9 |  12 | 
lmin2     7 |   6 |   9 |  43 |  12 |  11 |  15 |  14 |  13 |  18 |  10 |  13 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  21:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.793;    Average information/letter:   0.538
 Minimal length of displayed segments set to:  15

M_0.01= 22.94  (cv= 13.69, lambda=  0.34729, k=  0.24954, x=  9.25;
                90% confidence interval for segment length:  21 +-  15)
M_0.05= 18.24  (x=  4.56)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.724;    Average information/letter:   0.550
 Minimal length of displayed segments set to:  15

M_0.01= 48.07  (cv= 29.20, lambda=  0.16282, k=  0.21706, x= 18.87;
                90% confidence interval for segment length:  21 +-  15)
M_0.05= 38.06  (x=  8.86);     M_0.30= 26.15  (x= -3.05)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-36-C-3-C-75-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   5  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 0   (11-20) 3   (>=21) 2

3. Clusters of amino acid multiplets (cmin =  9/30 or 11/45 or 13/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  8)
   1 +plets (f+: 12.9%), 0 -plets (f-: 8.6%)
   Total number of charge altplets: 3 (Critical number: 9)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  20-  90  (2.)     L(  70)L     1 of   8   0.0092   large  1. maximal spacing
  80-  84  (3.)     -(   4)-    11 of  11   0.0085   large minimal spacing
  84-  88  (3.)     -(   4)-    10 of  11   0.0085     matching minimum
  91- 104  (4.)     L(  13)L     2 of   8   0.9892   small  2. maximal spacing



ORF3 VpV262 low quality frame candidate
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:31 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4716.4029.seq)

SWISS-PROT ANNOTATION:
ID   ORF3
DE   ORF3, 56 bases, 9BBCFDEF checksum.

number of residues:   56;   molecular weight:   6.2 kdal
 
         1  MNIDTVFKWL AFISATITIP MDPSQGAAYF VGVAVLMTVY LFIAAPIVWV YVRITS

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  7(12.5%); C  :  0( 0.0%); D  :  2( 3.6%); E  :  0( 0.0%); F  :  4( 7.1%)
G  :  2( 3.6%); H  :  0( 0.0%); I  :  7(12.5%); K  :  1( 1.8%); L  :  3( 5.4%)
M  :  3( 5.4%); N  :  1( 1.8%); P  :  3( 5.4%); Q  :  1( 1.8%); R  :  1( 1.8%)
S  :  3( 5.4%); T  :  5( 8.9%); V  :  8(14.3%); W  :  2( 3.6%); Y  :  3( 5.4%)

KR      :    2 (  3.6%);   ED      :    2 (  3.6%);   AGP     :   12 ( 21.4%);
KRED    :    4 (  7.1%);   KR-ED   :    0 (  0.0%);   FIKMNY  :   19 ( 33.9%);
LVIFM   :   25 ( 44.6%);   ST      :    8 ( 14.3%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000-000+00 0000000000 0-00000000 0000000000 0000000000 00+000

A. CHARGE CLUSTERS.


Positive charge clusters:  not evaluated (frequency of + < 5%, too low)


Negative charge clusters:  not evaluated (frequency of - < 5%, too low)


Mixed charge clusters (cmin =  7/30 or 10/45 or 12/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.036  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.929  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.036  ( ED )

 Expected score/letter:  -0.929;    Average information/letter:   3.156
 Minimal length of displayed segments set to:  20

M_0.01=  5.04  (cv=  2.59, lambda=  1.55610, k=  0.45704, x=  2.45;
                90% confidence interval for segment length:   4 +-   3)
M_0.05=  3.99  (x=  1.41)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.036  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.929  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.036  ( KR )

 Expected score/letter:  -0.929;    Average information/letter:   3.156
 Minimal length of displayed segments set to:  20

M_0.01=  5.04  (cv=  2.59, lambda=  1.55610, k=  0.45704, x=  2.45;
                90% confidence interval for segment length:   4 +-   3)
M_0.05=  3.99  (x=  1.41)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.071  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.929  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.857;    Average information/letter:   3.172
 Minimal length of displayed segments set to:  20

M_0.01=  3.27  (cv=  1.57, lambda=  2.56495, k=  0.79121, x=  1.70;
                90% confidence interval for segment length:   4 +-   2)
M_0.05=  2.64  (x=  1.07)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.929  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.071  ( KEDR )

 Expected score/letter:   0.357
! expected score > -.001; too big !


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     3 |   3 |   3 |  81 |   6 |   6 |   7 |   7 |   7 |   9 |  NA |  NA | 
lmin1     4 |   4 |   4 | 100 |   7 |   7 |   9 |   9 |   9 |  11 |  NA |  NA | 
lmin2     4 |   4 |   5 | 110 |   8 |   8 |  10 |  10 |  10 |  12 |  NA |  NA | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   3:   0
  0  runs >=  54:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -0.250
 Average information/letter:   0.019 < .10; too small !


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:   0.161
! expected score > -.001; too big !


2. SPACINGS OF C.


H2N-56-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   2  (Expected range:   0-- 11)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 0   (6-10) 0   (11-20) 2   (>=21) 1

3. Clusters of amino acid multiplets (cmin =  8/30 or 10/45 or 12/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   0  (Expected range:   0--  2)
   0 +plets (f+: 3.6%), 0 -plets (f-: 3.6%)
   Total number of charge altplets: 0 (Critical number: 2)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 4)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
   3-  56        9      i.0...0.0        6       6      /0/./2/./././2/./1/
  31-  44        2      i0               7       7      /0/2/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF4 VpV262 lowquality frame candidate
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:32 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4815.7504.seq)

SWISS-PROT ANNOTATION:
ID   ORF4
DE   ORF4, 46 bases, E74BD5A1 checksum.

number of residues:   46;   molecular weight:   5.6 kdal
 
         1  MKYIDKLTAY HNGKYYLPVE AFDCCTSNWD ADDWIRVIDY YNGWTK

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  3( 6.5%); C  :  2( 4.3%); D  :  6(13.0%); E  :  1( 2.2%); F  :  1( 2.2%)
G  :  2( 4.3%); H  :  1( 2.2%); I  :  3( 6.5%); K  :  4( 8.7%); L  :  2( 4.3%)
M  :  1( 2.2%); N  :  3( 6.5%); P  :  1( 2.2%); Q  :  0( 0.0%); R  :  1( 2.2%)
S  :  1( 2.2%); T  :  3( 6.5%); V  :  2( 4.3%); W  :  3( 6.5%); Y  :  6(13.0%)

KR      :    5 ( 10.9%);   ED      :    7 ( 15.2%);   AGP     :    6 ( 13.0%);
KRED    :   12 ( 26.1%);   KR-ED   :   -2 ( -4.3%);   FIKMNY  :   18 ( 39.1%);
LVIFM   :    9 ( 19.6%);   ST      :    4 (  8.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0+00-+0000 000+00000- 00-000000- 0--00+00-0 00000+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


Negative charge clusters (cmin = 12/30 or 16/45 or 20/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.109  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.739  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.152  ( ED )

 Expected score/letter:  -0.826;    Average information/letter:   1.344
 Minimal length of displayed segments set to:  20

M_0.01=  8.00  (cv=  4.18, lambda=  0.91693, k=  0.33649, x=  3.83;
                90% confidence interval for segment length:   8 +-   8)
M_0.05=  6.23  (x=  2.05)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.152  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.739  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.109  ( KR )

 Expected score/letter:  -0.652;    Average information/letter:   0.766
 Minimal length of displayed segments set to:  20

M_0.01= 10.23  (cv=  5.58, lambda=  0.68621, k=  0.24362, x=  4.65;
                90% confidence interval for segment length:  13 +-  14)
M_0.05=  7.85  (x=  2.27)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.261  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.739  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.478;    Average information/letter:   0.719
 Minimal length of displayed segments set to:  20

M_0.01=  6.97  (cv=  3.68, lambda=  1.04145, k=  0.30946, x=  3.29;
                90% confidence interval for segment length:  15 +-  14)
M_0.05=  5.40  (x=  1.73)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.739  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.261  ( KEDR )

 Expected score/letter:  -1.348;    Average information/letter:   0.288
 Minimal length of displayed segments set to:  20

M_0.01= 24.55  (cv= 14.06, lambda=  0.27230, k=  0.17501, x= 10.49;
                90% confidence interval for segment length:  33 +-  24)
M_0.05= 18.57  (x=  4.51)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   4 |   6 |  23 |   7 |   8 |  11 |   8 |   9 |  12 |   5 |   6 | 
lmin1     5 |   6 |   8 |  28 |   9 |  10 |  13 |  10 |  12 |  14 |   7 |   8 | 
lmin2     6 |   7 |   9 |  31 |  10 |  11 |  14 |  11 |  13 |  16 |   8 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  16:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.043;    Average information/letter:   0.610
 Minimal length of displayed segments set to:  15

M_0.01= 18.57  (cv=  9.91, lambda=  0.38639, k=  0.28511, x=  8.66;
                90% confidence interval for segment length:  17 +-  11)
M_0.05= 14.35  (x=  4.44)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.565;    Average information/letter:   0.795
 Minimal length of displayed segments set to:  15

M_0.01= 31.92  (cv= 17.29, lambda=  0.22141, k=  0.25621, x= 14.63;
                90% confidence interval for segment length:  13 +-   9)
M_0.05= 24.56  (x=  7.26);     M_0.30= 15.80  (x= -1.49)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-23
CC   at   24
  -21-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   4  (Expected range:   0--  9)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 0   (6-10) 4   (11-20) 1   (>=21) 0

3. Clusters of amino acid multiplets (cmin = 14/30 or 18/45 or 21/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  5)
   0 +plets (f+: 10.9%), 1 -plets (f-: 15.2%)
   Total number of charge altplets: 1 (Critical number: 6)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 1   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 7)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF5 VpV262 low quality frame candidate
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:34 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.2250.8890.seq)

SWISS-PROT ANNOTATION:
ID   ORF5
DE   ORF5, 84 bases, 936E4563 checksum.

number of residues:   84;   molecular weight:  10.2 kdal
 
         1  MDKVMERTHF ELPDKLDTPM IHSHYYAWLD KHCNHTWHNI HTRAKYLASL GYDRDKHYLT 
        61  QDDLTVLKCI LHHHFFINVP IESE

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  3( 3.6%); C  :  2( 2.4%); D  :  8( 9.5%); E  :  4( 4.8%); F  :  3( 3.6%)
G  :  1( 1.2%); H  : 11(13.1%); I  :  5( 6.0%); K  :  6( 7.1%); L  :  9(10.7%)
M  :  3( 3.6%); N  :  3( 3.6%); P  :  3( 3.6%); Q  :  1( 1.2%); R  :  3( 3.6%)
S  :  3( 3.6%); T  :  6( 7.1%); V  :  3( 3.6%); W  :  2( 2.4%); Y  :  5( 6.0%)

KR      :    9 ( 10.7%);   ED      :   12 ( 14.3%);   AGP     :    7 (  8.3%);
KRED    :   21 ( 25.0%);   KR-ED   :   -3 ( -3.6%);   FIKMNY  :   25 ( 29.8%);
LVIFM   :   23 ( 27.4%);   ST      :    9 ( 10.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0-+00-+000 -00-+0-000 000000000- +000000000 00+0+00000 00-+-+0000 
        61  0--0000+00 0000000000 0-0-

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 13/45 or 16/60):  none


Negative charge clusters (cmin = 11/30 or 15/45 or 19/60):  none


Mixed charge clusters (cmin = 16/30 or 22/45 or 28/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.107  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.750  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.143  ( ED )

 Expected score/letter:  -0.821;    Average information/letter:   1.355
 Minimal length of displayed segments set to:  20

M_0.01=  8.60  (cv=  4.80, lambda=  0.92375, k=  0.33723, x=  3.80;
                90% confidence interval for segment length:   8 +-   8)
M_0.05=  6.84  (x=  2.04)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.143  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.750  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.107  ( KR )

 Expected score/letter:  -0.679;    Average information/letter:   0.855
 Minimal length of displayed segments set to:  20

M_0.01= 10.55  (cv=  6.08, lambda=  0.72822, k=  0.26045, x=  4.47;
                90% confidence interval for segment length:  13 +-  13)
M_0.05=  8.32  (x=  2.23)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.250  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.750  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.500;    Average information/letter:   0.792
 Minimal length of displayed segments set to:  20

M_0.01=  7.22  (cv=  4.03, lambda=  1.09861, k=  0.33333, x=  3.19;
                90% confidence interval for segment length:  14 +-  13)
M_0.05=  5.74  (x=  1.70)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.750  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.250  ( KEDR )

 Expected score/letter:  -1.250;    Average information/letter:   0.259
 Minimal length of displayed segments set to:  20

M_0.01= 28.25  (cv= 17.41, lambda=  0.25450, k=  0.15869, x= 10.84;
                90% confidence interval for segment length:  40 +-  28)
M_0.05= 21.85  (x=  4.44)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  27 |   8 |   9 |  11 |   9 |  10 |  13 |   9 |  11 | 
lmin1     5 |   6 |   8 |  32 |  10 |  11 |  14 |  11 |  12 |  15 |  11 |  14 | 
lmin2     6 |   7 |   9 |  36 |  11 |  12 |  15 |  13 |  14 |  17 |  12 |  15 | 
 (Significance level: 0.010000; Minimal displayed length:  6)

(H..)  14(0,0,0); at   30-  43:   DKHCNHTWHNIHTR
      (2. quartile)               


Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   1, at   53;
  0  runs >=  18:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.250;    Average information/letter:   0.785
 Minimal length of displayed segments set to:  15

M_0.01= 18.06  (cv= 10.14, lambda=  0.43713, k=  0.32058, x=  7.92;
                90% confidence interval for segment length:  15 +-   9)
M_0.05= 14.33  (x=  4.19)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.845;    Average information/letter:   0.858
 Minimal length of displayed segments set to:  15

M_0.01= 37.68  (cv= 21.54, lambda=  0.20572, k=  0.27804, x= 16.14;
                90% confidence interval for segment length:  13 +-   9)
M_0.05= 29.75  (x=  8.22);     M_0.30= 20.33  (x= -1.21)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-32-C-35-C-15-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   4  (Expected range:   0-- 12)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 2   (11-20) 0   (>=21) 2

3. Clusters of amino acid multiplets (cmin =  9/30 or 12/45 or 14/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  7)
   0 +plets (f+: 10.7%), 1 -plets (f-: 14.3%)
   Total number of charge altplets: 5 (Critical number: 9)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors
  32-  43        3      H..              4       4 !     0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF6 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:35 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4813.2920.seq)

SWISS-PROT ANNOTATION:
ID   ORF6
DE   ORF6, 120 bases, 3DC19403 checksum.

number of residues:  120;   molecular weight:  13.2 kdal
 
         1  MSMQYTILLG AGAHIDDTTS AKAMRADLVQ THIKSWTEQL CAVFGGCTMS IVDGNYVMNN 
        61  GELCTESSFQ FTCIKFGDNT VEQAQAWIEV KLIARDMCKV FEQECVLISR HACDAELVGV

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 11( 9.2%); C  :  7( 5.8%); D  :  7( 5.8%); E  :  8( 6.7%); F  :  5( 4.2%)
G  :  8( 6.7%); H  :  3( 2.5%); I  :  8( 6.7%); K  :  5( 4.2%); L  :  8( 6.7%)
M  :  6( 5.0%); N  :  4( 3.3%); P  :  0( 0.0%); Q  :  7( 5.8%); R  :  3( 2.5%)
S  :  7( 5.8%); T  :  9( 7.5%); V  : 10( 8.3%); W  :  2( 1.7%); Y  :  2( 1.7%)

KR      :    8 (  6.7%);   ED      :   15 ( 12.5%);   AGP     :   19 ( 15.8%);
KRED    :   23 ( 19.2%);   KR-ED   :   -7 ( -5.8%);   FIKMNY  :   30 ( 25.0%);
LVIFM   :   37 ( 30.8%);   ST      :   16 ( 13.3%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 00000--000 0+00+0-000 000+000-00 0000000000 00-0000000 
        61  0-000-0000 0000+00-00 0-000000-0 +000+-00+0 0-0-00000+ 000-0-0000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  7/30 or  9/45 or 11/60):  none


Negative charge clusters (cmin = 10/30 or 14/45 or 17/60):  none


Mixed charge clusters (cmin = 14/30 or 19/45 or 23/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.067  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.808  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.125  ( ED )

 Expected score/letter:  -0.925;    Average information/letter:   2.146
 Minimal length of displayed segments set to:  20

M_0.01=  7.03  (cv=  3.96, lambda=  1.20847, k=  0.41054, x=  3.07;
                90% confidence interval for segment length:   6 +-   5)
M_0.05=  5.68  (x=  1.72)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.125  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.808  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.067  ( KR )

 Expected score/letter:  -0.692;    Average information/letter:   1.001
 Minimal length of displayed segments set to:  20

M_0.01= 10.10  (cv=  5.95, lambda=  0.80491, k=  0.28337, x=  4.15;
                90% confidence interval for segment length:  12 +-  11)
M_0.05=  8.07  (x=  2.12)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.192  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.808  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.617;    Average information/letter:   1.280
 Minimal length of displayed segments set to:  20

M_0.01=  6.00  (cv=  3.33, lambda=  1.43922, k=  0.47045, x=  2.67;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  4.87  (x=  1.54)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.808  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.192  ( KEDR )

 Expected score/letter:  -0.725;    Average information/letter:   0.114
 Minimal length of displayed segments set to:  20

M_0.01= 43.33  (cv= 30.62, lambda=  0.15635, k=  0.07326, x= 12.71;
                90% confidence interval for segment length:  86 +-  73)
M_0.05= 32.90  (x=  2.28)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     3 |   4 |   6 |  36 |   7 |   9 |  11 |   9 |  11 |  13 |   6 |   7 | 
lmin1     5 |   6 |   7 |  44 |   9 |  11 |  13 |  11 |  13 |  15 |   7 |   9 | 
lmin2     6 |   7 |   8 |  49 |  10 |  12 |  14 |  12 |  15 |  17 |   8 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  24:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.617;    Average information/letter:   0.449
 Minimal length of displayed segments set to:  15

M_0.01= 26.07  (cv= 15.86, lambda=  0.30179, k=  0.21876, x= 10.21;
                90% confidence interval for segment length:  25 +-  18)
M_0.05= 20.67  (x=  4.81)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -2.958;    Average information/letter:   0.368
 Minimal length of displayed segments set to:  15

M_0.01= 59.18  (cv= 37.43, lambda=  0.12792, k=  0.16250, x= 21.76;
                90% confidence interval for segment length:  30 +-  24)
M_0.05= 46.44  (x=  9.01);     M_0.30= 31.28  (x= -6.15)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-40-C-5-C-16-C-8-C-24-C-6-C-7-C-7-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   6  (Expected range:   0-- 15)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 3   (11-20) 1   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 10/30 or 12/45 or 15/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  7)
   0 +plets (f+: 6.7%), 1 -plets (f-: 12.5%)
   Total number of charge altplets: 1 (Critical number: 8)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 1   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


ORF7 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:37 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4814.3752.seq)

SWISS-PROT ANNOTATION:
ID   ORF7
DE   ORF7, 134 bases, 250A6EBF checksum.

number of residues:  134;   molecular weight:  14.9 kdal
 
         1  MSSAWSFKDP DDLELETPDG DCVYVCGSGE RIMYGAGKGE ADNLVYYVNV GDYQRAIDVL 
        61  RKYKDSSVTG WGHQLLADLE NYMLKSTPDP VASIDEADSF GIGQPNRKST KQWEPDTPDP 
       121  DVDLMKSIRD ACHR

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  8( 6.0%); C  :  3( 2.2%); D  : 18(13.4%); E  :  7( 5.2%); F  :  2( 1.5%)
G  : 11( 8.2%); H  :  2( 1.5%); I  :  5( 3.7%); K  :  8( 6.0%); L  :  9( 6.7%)
M  :  4( 3.0%); N  :  4( 3.0%); P  :  8( 6.0%); Q  :  4( 3.0%); R  :  6( 4.5%)
S  : 11( 8.2%); T  :  5( 3.7%); V  :  9( 6.7%); W  :  3( 2.2%); Y  :  7( 5.2%)

KR      :   14 ( 10.4%);   ED      :   25 ( 18.7%);   AGP     :   27 ( 20.1%);
KRED    :   39 ( 29.1%);   KR-ED   :  -11 ( -8.2%);   FIKMNY  :   30 ( 22.4%);
LVIFM   :   29 ( 21.6%);   ST      :   16 ( 11.9%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000+-0 --0-0-00-0 -00000000- +000000+0- 0-00000000 0-00+00-00 
        61  ++0+-00000 0000000-0- 0000+000-0 0000--0-00 000000++00 +00-0-00-0 
       121  -0-00+00+- 000+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 12/45 or 15/60):  none


Negative charge clusters (cmin = 14/30 or 18/45 or 23/60):  none


Mixed charge clusters (cmin = 18/30 or 25/45 or 31/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.104  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.709  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.187  ( ED )


 Expected score/letter:  -0.873;    Average information/letter:   1.459
 Minimal length of displayed segments set to:  20

M_0.01=  8.90  (cv=  5.16, lambda=  0.94942, k=  0.35218, x=  3.75;
                90% confidence interval for segment length:   8 +-   8)
M_0.05=  7.19  (x=  2.03)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.187  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.709  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.104  ( KR )

 Expected score/letter:  -0.545;    Average information/letter:   0.489
 Minimal length of displayed segments set to:  20

M_0.01= 14.34  (cv=  9.02, lambda=  0.54322, k=  0.18130, x=  5.32;
                90% confidence interval for segment length:  23 +-  23)
M_0.05= 11.34  (x=  2.32)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.291  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.709  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.418;    Average information/letter:   0.537
 Minimal length of displayed segments set to:  20

M_0.01=  9.09  (cv=  5.50, lambda=  0.89032, k=  0.24635, x=  3.59;
                90% confidence interval for segment length:  22 +-  20)
M_0.05=  7.26  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.709  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.291  ( KEDR )

 Expected score/letter:  -1.619;    Average information/letter:   0.371
 Minimal length of displayed segments set to:  20

M_0.01= 24.83  (cv= 15.24, lambda=  0.32148, k=  0.21991, x=  9.60;
                90% confidence interval for segment length:  31 +-  18)
M_0.05= 19.76  (x=  4.53)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   6 |   7 |  24 |   8 |  10 |  12 |   9 |  11 |  14 |   5 |   7 | 
lmin1     5 |   7 |   9 |  29 |  10 |  12 |  15 |  11 |  14 |  17 |   7 |   8 | 
lmin2     6 |   8 |  11 |  32 |  11 |  13 |  17 |  13 |  15 |  19 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   5:   0
  0  runs >=  16:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.522;    Average information/letter:   0.862
 Minimal length of displayed segments set to:  15

M_0.01= 17.98  (cv= 10.41, lambda=  0.47050, k=  0.35436, x=  7.57;
                90% confidence interval for segment length:  14 +-   8)
M_0.05= 14.52  (x=  4.11)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.978;    Average information/letter:   0.836
 Minimal length of displayed segments set to:  15

M_0.01= 38.23  (cv= 22.81, lambda=  0.21476, k=  0.27593, x= 15.42;
                90% confidence interval for segment length:  14 +-   9)
M_0.05= 30.64  (x=  7.83);     M_0.30= 21.61  (x= -1.20)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-21-C-3-C-105-C-2-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  87-  90]   TPDP
[ 117- 120]   TPDP


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   5  (Expected range:   0-- 17)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 2   (11-20) 1   (>=21) 2

3. Clusters of amino acid multiplets (cmin =  8/30 or 10/45 or 12/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   4  (Expected range:   0-- 12)
   2 +plets (f+: 10.4%), 2 -plets (f-: 18.7%)
   Total number of charge altplets: 4 (Critical number: 13)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 2   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


ORF8 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:38 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.2250.8967.seq)

SWISS-PROT ANNOTATION:
ID   ORF8
DE   ORF8, 56 bases, 857665F2 checksum.

number of residues:   56;   molecular weight:   6.6 kdal
 
         1  MQFYDVLEVL FESANGGYKV YSELDECVYT LTAEDLKHER ELGNEVYDPY QLVFFN

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  2( 3.6%); C  :  1( 1.8%); D  :  4( 7.1%); E  :  8(14.3%); F  :  4( 7.1%)
G  :  3( 5.4%); H  :  1( 1.8%); I  :  0( 0.0%); K  :  2( 3.6%); L  :  7(12.5%)
M  :  1( 1.8%); N  :  3( 5.4%); P  :  1( 1.8%); Q  :  2( 3.6%); R  :  1( 1.8%)
S  :  2( 3.6%); T  :  2( 3.6%); V  :  6(10.7%); W  :  0( 0.0%); Y  :  6(10.7%)

KR      :    3 (  5.4%);   ED      :   12 ( 21.4%);   AGP     :    6 ( 10.7%);
KRED    :   15 ( 26.8%);   KR-ED   :   -9 (-16.1%);   FIKMNY  :   16 ( 28.6%);
LVIFM   :   18 ( 32.1%);   ST      :    4 (  7.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000-00-00 0-000000+0 00-0--0000 000--0+0-+ -000-00-00 000000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  6/30 or  8/45 or 10/60):  none


Negative charge clusters (cmin = 15/30 or 20/45 or 25/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.054  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.732  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.214  ( ED )

 Expected score/letter:  -1.054;    Average information/letter:   2.671
 Minimal length of displayed segments set to:  20

M_0.01=  5.80  (cv=  2.98, lambda=  1.34903, k=  0.44752, x=  2.81;
                90% confidence interval for segment length:   4 +-   4)
M_0.05=  4.59  (x=  1.61)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.214  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.732  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.054  ( KR )

 Expected score/letter:  -0.411;    Average information/letter:   0.279
 Minimal length of displayed segments set to:  20

M_0.01= 15.52  (cv=  9.61, lambda=  0.41883, k=  0.11951, x=  5.91;
                90% confidence interval for segment length:  34 +-  38)
M_0.05= 11.63  (x=  2.02)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.268  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.732  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.464;    Average information/letter:   0.674
 Minimal length of displayed segments set to:  20

M_0.01=  7.36  (cv=  4.00, lambda=  1.00552, k=  0.29443, x=  3.36;
                90% confidence interval for segment length:  16 +-  15)
M_0.05=  5.74  (x=  1.74)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.732  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.268  ( KEDR )

 Expected score/letter:  -1.411;    Average information/letter:   0.307
 Minimal length of displayed segments set to:  20

M_0.01= 24.46  (cv= 14.19, lambda=  0.28371, k=  0.18547, x= 10.28;
                90% confidence interval for segment length:  33 +-  22)
M_0.05= 18.72  (x=  4.53)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     3 |   5 |   6 |  23 |   6 |  10 |  11 |   7 |  11 |  12 |   5 |   6 | 
lmin1     4 |   7 |   8 |  28 |   8 |  12 |  13 |   9 |  13 |  15 |   7 |   8 | 
lmin2     5 |   8 |   9 |  31 |   9 |  13 |  15 |  10 |  15 |  16 |   7 |   9 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   4:   0
  0  runs >=  16:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.000;    Average information/letter:   0.560
 Minimal length of displayed segments set to:  15

M_0.01= 22.10  (cv= 12.21, lambda=  0.32960, k=  0.26130, x=  9.88;
                90% confidence interval for segment length:  19 +-  13)
M_0.05= 17.15  (x=  4.94)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.179;    Average information/letter:   0.565
 Minimal length of displayed segments set to:  15

M_0.01= 47.73  (cv= 26.97, lambda=  0.14926, k=  0.22291, x= 20.76;
                90% confidence interval for segment length:  18 +-  14)
M_0.05= 36.81  (x=  9.84);     M_0.30= 23.82  (x= -3.15)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-26-C-29-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   2  (Expected range:   0-- 11)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 0   (11-20) 1   (>=21) 1

3. Clusters of amino acid multiplets (cmin =  8/30 or 10/45 or 12/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   2  (Expected range:   0--  7)
   0 +plets (f+: 5.4%), 2 -plets (f-: 21.4%)
   Total number of charge altplets: 1 (Critical number: 6)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 1   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF9 VpV262 poor match to SIO1 gp17
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:40 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.8967.seq)

SWISS-PROT ANNOTATION:
ID   ORF9
DE   ORF9, 69 bases, DEE0CBC3 checksum.

number of residues:   69;   molecular weight:   8.2 kdal
 
         1  MPTVYVIKMI QRGTDEPDYQ IEKQFTGKSI AEQLSLAIDH AEWLNERNEA SWRHYFASQY 
        61  PPDEIIWNG

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  5( 7.2%); C  :  0( 0.0%); D  :  4( 5.8%); E  :  7(10.1%); F  :  2( 2.9%)
G  :  3( 4.3%); H  :  2( 2.9%); I  :  7(10.1%); K  :  3( 4.3%); L  :  3( 4.3%)
M  :  2( 2.9%); N  :  3( 4.3%); P  :  4( 5.8%); Q  :  5( 7.2%); R  :  3( 4.3%)
S  :  4( 5.8%); T  :  3( 4.3%); V  :  2( 2.9%); W  :  3( 4.3%); Y  :  4( 5.8%)

KR      :    6 (  8.7%);   ED      :   11 ( 15.9%);   AGP     :   12 ( 17.4%);
KRED    :   17 ( 24.6%);   KR-ED   :   -5 ( -7.2%);   FIKMNY  :   21 ( 30.4%);
LVIFM   :   16 ( 23.2%);   ST      :    7 ( 10.1%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000+00 0+00--0-00 0-+0000+00 0-000000-0 0-000-+0-0 00+0000000 
        61  00--00000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  8/30 or 11/45 or 13/60):  none


Negative charge clusters (cmin = 12/30 or 16/45 or 20/60):  none


Mixed charge clusters (cmin = 16/30 or 22/45 or 28/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.087  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.754  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.159  ( ED )

 Expected score/letter:  -0.899;    Average information/letter:   1.731
 Minimal length of displayed segments set to:  20

M_0.01=  7.45  (cv=  4.01, lambda=  1.05591, k=  0.37880, x=  3.44;
                90% confidence interval for segment length:   7 +-   6)
M_0.05=  5.90  (x=  1.89)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.159  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.754  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.087  ( KR )

 Expected score/letter:  -0.609;    Average information/letter:   0.673
 Minimal length of displayed segments set to:  20

M_0.01= 11.34  (cv=  6.54, lambda=  0.64712, k=  0.22378, x=  4.80;
                90% confidence interval for segment length:  16 +-  16)
M_0.05=  8.82  (x=  2.28)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.246  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.754  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.507;    Average information/letter:   0.818
 Minimal length of displayed segments set to:  20

M_0.01=  6.94  (cv=  3.79, lambda=  1.11803, k=  0.34142, x=  3.15;
                90% confidence interval for segment length:  14 +-  12)
M_0.05=  5.48  (x=  1.70)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.754  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.246  ( KEDR )

 Expected score/letter:  -1.217;    Average information/letter:   0.250
 Minimal length of displayed segments set to:  20

M_0.01= 28.00  (cv= 17.04, lambda=  0.24855, k=  0.15325, x= 10.96;
                90% confidence interval for segment length:  40 +-  29)
M_0.05= 21.44  (x=  4.40)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  26 |   7 |   9 |  11 |   8 |  10 |  12 |   6 |   7 | 
lmin1     5 |   6 |   8 |  32 |   9 |  11 |  13 |  10 |  13 |  15 |   7 |   9 | 
lmin2     6 |   7 |   9 |  35 |  10 |  12 |  15 |  12 |  14 |  17 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  18:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.333;    Average information/letter:   0.825
 Minimal length of displayed segments set to:  15

M_0.01= 16.90  (cv=  9.23, lambda=  0.45870, k=  0.33919, x=  7.67;
                90% confidence interval for segment length:  14 +-   9)
M_0.05= 13.35  (x=  4.12)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.841;    Average information/letter:   0.849
 Minimal length of displayed segments set to:  15

M_0.01= 35.52  (cv= 19.91, lambda=  0.21261, k=  0.27735, x= 15.60;
                90% confidence interval for segment length:  13 +-   9)
M_0.05= 27.85  (x=  7.94);     M_0.30= 18.73  (x= -1.18)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-69-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   2  (Expected range:   0-- 10)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 2   (6-10) 0   (11-20) 0   (>=21) 1

3. Clusters of amino acid multiplets (cmin =  7/30 or  9/45 or 11/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   2  (Expected range:   0--  7)
   0 +plets (f+: 8.7%), 2 -plets (f-: 15.9%)
   Total number of charge altplets: 2 (Critical number: 7)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 1   (11-20) 1   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 7)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF10 VpV262 poor frame candidate
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:41 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4816.9244.seq)

SWISS-PROT ANNOTATION:
ID   ORF10
DE   ORF10, 65 bases, 33B5E21F checksum.

number of residues:   65;   molecular weight:   7.2 kdal
 
         1  MEWINDIIFA IAKEAGCSPT QVRVLKWDMD GELTVCTCEL PRRGLTPIGI PDTMFASVSG 
        61  VHTLH

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  4( 6.2%); C  :  3( 4.6%); D  :  4( 6.2%); E  :  4( 6.2%); F  :  2( 3.1%)
G  :  5( 7.7%); H  :  2( 3.1%); I  :  6( 9.2%); K  :  2( 3.1%); L  :  5( 7.7%)
M  :  3( 4.6%); N  :  1( 1.5%); P  :  4( 6.2%); Q  :  1( 1.5%); R  :  3( 4.6%)
S  :  3( 4.6%); T  :  6( 9.2%); V  :  5( 7.7%); W  :  2( 3.1%); Y  :  0( 0.0%)

KR      :    5 (  7.7%);   ED      :    8 ( 12.3%);   AGP     :   13 ( 20.0%);
KRED    :   13 ( 20.0%);   KR-ED   :   -3 ( -4.6%);   FIKMNY  :   14 ( 21.5%);
LVIFM   :   21 ( 32.3%);   ST      :    9 ( 13.8%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0-000-0000 00+-000000 00+00+0-0- 0-000000-0 0++0000000 0-00000000 
        61  00000

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  8/30 or 10/45 or 12/60):  none


Negative charge clusters (cmin = 10/30 or 14/45 or 17/60):  none


Mixed charge clusters (cmin = 14/30 or 19/45 or 24/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.077  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.800  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.123  ( ED )

 Expected score/letter:  -0.892;    Average information/letter:   1.891
 Minimal length of displayed segments set to:  20

M_0.01=  6.98  (cv=  3.72, lambda=  1.12285, k=  0.39067, x=  3.26;
                90% confidence interval for segment length:   6 +-   6)
M_0.05=  5.53  (x=  1.81)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.123  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.800  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.077  ( KR )

 Expected score/letter:  -0.708;    Average information/letter:   1.040
 Minimal length of displayed segments set to:  20

M_0.01=  9.21  (cv=  5.10, lambda=  0.81802, k=  0.28992, x=  4.11;
                90% confidence interval for segment length:  10 +-  11)
M_0.05=  7.22  (x=  2.12)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.200  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.800  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.600;    Average information/letter:   1.200
 Minimal length of displayed segments set to:  20

M_0.01=  5.75  (cv=  3.01, lambda=  1.38629, k=  0.45000, x=  2.74;
                90% confidence interval for segment length:  10 +-   8)
M_0.05=  4.58  (x=  1.57)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.800  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.200  ( KEDR )

 Expected score/letter:  -0.800;    Average information/letter:   0.133
 Minimal length of displayed segments set to:  20

M_0.01= 36.94  (cv= 24.44, lambda=  0.17079, k=  0.08492, x= 12.50;
                90% confidence interval for segment length:  68 +-  59)
M_0.05= 27.39  (x=  2.95)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     3 |   4 |   5 |  32 |   7 |   8 |  10 |   8 |  10 |  12 |   6 |   7 | 
lmin1     5 |   5 |   7 |  39 |   9 |  10 |  12 |  10 |  12 |  15 |   7 |   9 | 
lmin2     5 |   6 |   8 |  43 |  10 |  11 |  14 |  12 |  13 |  16 |   8 |  10 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  21:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.600;    Average information/letter:   0.443
 Minimal length of displayed segments set to:  15

M_0.01= 24.05  (cv= 13.88, lambda=  0.30079, k=  0.21416, x= 10.17;
                90% confidence interval for segment length:  24 +-  18)
M_0.05= 18.63  (x=  4.75)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -2.846;    Average information/letter:   0.346
 Minimal length of displayed segments set to:  15

M_0.01= 56.27  (cv= 34.01, lambda=  0.12276, k=  0.15453, x= 22.26;
                90% confidence interval for segment length:  29 +-  24)
M_0.05= 42.99  (x=  8.98);     M_0.30= 27.19  (x= -6.81)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-16-C-18
CTC  at   36
   -27-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   2  (Expected range:   0-- 10)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 0   (6-10) 1   (11-20) 0   (>=21) 2

3. Clusters of amino acid multiplets (cmin =  7/30 or  9/45 or 11/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  5)
   1 +plets (f+: 7.7%), 0 -plets (f-: 12.3%)
   Total number of charge altplets: 1 (Critical number: 6)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF11 VpV262 poor frame candidate
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:43 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4810.9522.seq)

SWISS-PROT ANNOTATION:
ID   ORF11
DE   ORF11, 104 bases, 5E0A9B22 checksum.

number of residues:  104;   molecular weight:  12.0 kdal
 
         1  MNHLLYHSGD RVVFVAPQYG VYRPLPAYVL KTVQTGVPFS CPHCGKKVKL HPDNVQRYGY 
        61  YLDCWYGVYG TDCCKPFIYS QLKRVHCPSE HPYKFINGAI TLVE

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  3( 2.9%); C  :  6( 5.8%); D  :  4( 3.8%); E  :  2( 1.9%); F  :  4( 3.8%)
G  :  8( 7.7%); H  :  6( 5.8%); I  :  3( 2.9%); K  :  7( 6.7%); L  :  8( 7.7%)
M  :  1( 1.0%); N  :  3( 2.9%); P  :  9( 8.7%); Q  :  4( 3.8%); R  :  4( 3.8%)
S  :  4( 3.8%); T  :  4( 3.8%); V  : 12(11.5%); W  :  1( 1.0%); Y  : 11(10.6%)

KR      :   11 ( 10.6%);   ED      :    6 (  5.8%);   AGP     :   20 ( 19.2%);
KRED    :   17 ( 16.3%);   KR-ED   :    5 (  4.8%);   FIKMNY  :   29 ( 27.9%);
LVIFM   :   28 ( 26.9%);   ST      :    8 (  7.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  000000000- +000000000 00+0000000 +000000000 00000++0+0 00-000+000 
        61  00-0000000 0-00+00000 00++00000- 000+000000 000-

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 13/45 or 15/60):  none


Negative charge clusters (cmin =  6/30 or  8/45 or 10/60):  none


Mixed charge clusters (cmin = 13/30 or 17/45 or 21/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.106  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.837  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.058  ( ED )

 Expected score/letter:  -0.740;    Average information/letter:   1.254
 Minimal length of displayed segments set to:  20

M_0.01=  8.89  (cv=  5.10, lambda=  0.91119, k=  0.31939, x=  3.80;
                90% confidence interval for segment length:   9 +-   9)
M_0.05=  7.10  (x=  2.01)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.058  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.837  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.106  ( KR )

 Expected score/letter:  -0.933;    Average information/letter:   2.378
 Minimal length of displayed segments set to:  20

M_0.01=  6.50  (cv=  3.60, lambda=  1.29026, k=  0.42442, x=  2.90;
                90% confidence interval for segment length:   5 +-   4)
M_0.05=  5.24  (x=  1.64)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.163  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.837  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.673;    Average information/letter:   1.585
 Minimal length of displayed segments set to:  20

M_0.01=  5.29  (cv=  2.84, lambda=  1.63269, k=  0.54156, x=  2.44;
                90% confidence interval for segment length:   8 +-   6)
M_0.05=  4.29  (x=  1.44)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.837  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.163  ( KEDR )

 Expected score/letter:  -0.471
 Average information/letter:   0.056 < .10; too small !


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   3 |   5 |  42 |   8 |   7 |  10 |  10 |   8 |  12 |   7 |   9 | 
lmin1     5 |   4 |   6 |  51 |  10 |   9 |  12 |  12 |  10 |  15 |   9 |  11 | 
lmin2     6 |   5 |   7 |  56 |  11 |  10 |  13 |  14 |  12 |  16 |  10 |  13 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   3:   0
  0  runs >=  28:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.356;    Average information/letter:   0.377
 Minimal length of displayed segments set to:  15

M_0.01= 25.66  (cv= 15.74, lambda=  0.29509, k=  0.18764, x=  9.92;
                90% confidence interval for segment length:  29 +-  22)
M_0.05= 20.13  (x=  4.40)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -3.125;    Average information/letter:   0.474
 Minimal length of displayed segments set to:  15

M_0.01= 49.38  (cv= 30.29, lambda=  0.15333, k=  0.18757, x= 19.09;
                90% confidence interval for segment length:  23 +-  19)
M_0.05= 38.75  (x=  8.46);     M_0.30= 26.10  (x= -4.19)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-40
CPHC at   41
    -19-C-8
CC   at   73   (l=  34)
  -12-C-17-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  19-  22]   YGVY
[  66-  69]   YGVY


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   5  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 1   (11-20) 2   (>=21) 2

3. Clusters of amino acid multiplets (cmin =  9/30 or 12/45 or 14/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   2  (Expected range:   0--  5)
   2 +plets (f+: 10.6%), 0 -plets (f-: 5.8%)
   Total number of charge altplets: 1 (Critical number: 6)


2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 3

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  13-  44        8      V.......         4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 7)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  18-  34  (2.)     Q(  16)Q     5 of   5   0.0028   large minimal spacing
  56-  81  (3.)     Q(  25)Q     1 of   5   0.9977   small maximal spacing



ORF12 VpV262 weak CD search match to nucleoside phosphatase family
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:44 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4813.5345.seq)

SWISS-PROT ANNOTATION:
ID   ORF12
DE   ORF12, 322 bases, A14E237E checksum.

number of residues:  322;   molecular weight:  36.3 kdal
 
         1  MAISLSSLLH AGEFGQFMTN RRTVLTVCRH NGYRSELGGG STIIWRESGT VDHLHGTLDH 
        61  HRNDVSALRL VRRVPNLRRA ANGATYWTIA GYRARIECRR RPNSQDEEWK ASIHREDGSW 
       121  KCNIEYNMYG IPRTMPDGSD FDLVWSHDDV DRERQAIREI DKMPDPAHKD NAPHQHGVAA 
       181  TLGHLQSNGG LRGHSVGDVF PYVPFYKGAS CEQWVRCPDG TEEGPFSSIA AMHQFIEMNE 
       241  LVKEQERLPV FTEPVLQMGA AAASHIPVTP ESAEQIIEVL EYIGVPPQDM ENIGVIDDTG 
       301  DGFYSLREYL DIVEQSNKLL ED

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 22( 6.8%); C  :  5( 1.6%); D  : 22( 6.8%); E  : 26( 8.1%); F  :  9( 2.8%)
G  : 27( 8.4%); H+ : 15( 4.7%); I  : 19( 5.9%); K  :  7( 2.2%); L  : 22( 6.8%)
M  :  9( 2.8%); N  : 13( 4.0%); P  : 17( 5.3%); Q  : 12( 3.7%); R  : 25( 7.8%)
S  : 21( 6.5%); T  : 14( 4.3%); V  : 21( 6.5%); W  :  6( 1.9%); Y  : 10( 3.1%)

KR      :   32 (  9.9%);   ED      :   48 ( 14.9%);   AGP     :   66 ( 20.5%);
KRED    :   80 ( 24.8%);   KR-ED - :  -16 ( -5.0%);   FIKMNY  :   67 ( 20.8%);
LVIFM   :   80 ( 24.8%);   ST      :   35 ( 10.9%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 00-0000000 ++000000+0 000+0-0000 00000+-000 0-000000-0 
        61  0+0-0000+0 0++0000++0 0000000000 00+0+0-0++ +0000---0+ 0000+--000 
       121  +000-00000 00+000-00- 0-00000--0 -+-+000+-0 -+00-000+- 0000000000 
       181  0000000000 0+00000-00 000000+000 0-000+00-0 0--0000000 000000-00- 
       241  00+-0-+000 00-0000000 0000000000 -00-000-00 -0000000-0 -00000--00 
       301  -00000+-00 -00-000+00 --

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 12/45 or 15/60):  none


Negative charge clusters (cmin = 12/30 or 16/45 or 19/60):  none


Mixed charge clusters (cmin = 16/30 or 22/45 or 28/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.099  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.752  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.149  ( ED )

 Expected score/letter:  -0.851;    Average information/letter:   1.489
 Minimal length of displayed segments set to:  20

M_0.01=  9.60  (cv=  5.94, lambda=  0.97201, k=  0.35331, x=  3.66;
                90% confidence interval for segment length:   9 +-   8)
M_0.05=  7.93  (x=  1.99)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.149  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.752  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.099  ( KR )

 Expected score/letter:  -0.652;    Average information/letter:   0.783
 Minimal length of displayed segments set to:  20

M_0.01= 12.87  (cv=  8.28, lambda=  0.69717, k=  0.24662, x=  4.59;
                90% confidence interval for segment length:  17 +-  15)
M_0.05= 10.54  (x=  2.25)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.248  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.752  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.503;    Average information/letter:   0.803
 Minimal length of displayed segments set to:  20

M_0.01=  8.39  (cv=  5.22, lambda=  1.10691, k=  0.33679, x=  3.17;
                90% confidence interval for segment length:  17 +-  14)
M_0.05=  6.92  (x=  1.70)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.752  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.248  ( KEDR )

 Expected score/letter:  -1.236;    Average information/letter:   0.255
 Minimal length of displayed segments set to:  20

M_0.01= 33.81  (cv= 22.92, lambda=  0.25195, k=  0.15636, x= 10.89;
                90% confidence interval for segment length:  48 +-  31)
M_0.05= 27.34  (x=  4.42)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   7 |  31 |   9 |  10 |  13 |  10 |  12 |  15 |   8 |  10 | 
lmin1     6 |   7 |   9 |  38 |  11 |  12 |  15 |  13 |  14 |  18 |   9 |  12 | 
lmin2     7 |   8 |  10 |  42 |  12 |  14 |  17 |  14 |  16 |  20 |  11 |  13 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   1, at   99;
  -  runs >=   4:   0
  *  runs >=   5:   0
  0  runs >=  21:   1, at  171;

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.217;    Average information/letter:   0.745
 Minimal length of displayed segments set to:  15

M_0.01= 21.64  (cv= 13.55, lambda=  0.42615, k=  0.31633, x=  8.09;
                90% confidence interval for segment length:  18 +-  11)
M_0.05= 17.82  (x=  4.27)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.410;    Average information/letter:   0.700
 Minimal length of displayed segments set to:  15

M_0.01= 48.42  (cv= 31.08, lambda=  0.18581, k=  0.25212, x= 17.34;
                90% confidence interval for segment length:  19 +-  12)
M_0.05= 39.65  (x=  8.57);     M_0.30= 29.21  (x= -1.87)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-27-C-69-C-23-C-88-C-5-C-105-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  22  (Expected range:   5-- 31)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 7   (6-10) 5   (11-20) 6   (>=21) 5

3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  10  (Expected range:   0-- 18)
   4 +plets (f+: 9.9%), 6 -plets (f-: 14.9%)
   Total number of charge altplets: 9 (Critical number: 20)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 3   (6-10) 1   (11-20) 1   (>=21) 6

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
 260- 263        1      A                4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
 226- 243        3      i0.              6       6      /0/2/./
 271- 330       10      -00.00..00       6       6 !    /0/1/1/./1/1/././2/1/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

  24-  27  (1.)     V(   3)V    22 of  22   0.0081   large minimal spacing
  71-  74  (1.)     V(   3)V    20 of  22   0.0081     matching minimum
 196- 199  (3.)     V(   3)V    21 of  22   0.0081     matching minimum
 315- 323  (4.)     Q(   8)Q    13 of  13   0.0093   large minimal spacing



ORF13 VpV262 weak match to tail protein
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:46 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4817.6454.seq)

SWISS-PROT ANNOTATION:
ID   ORF13
DE   ORF13, 105 bases, D1ECCB18 checksum.

number of residues:  105;   molecular weight:  12.4 kdal
 
         1  MRTNNQHYQS KGRKPAPKKQ KKNANRMFFD PMVYVLDIIR KHFDEKAFLT HNQKHGHRVI 
        61  KYRHTTGKRL VLGTPANSRL VYDFAFAEMD EEACEQLCQE LGQVK

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  7( 6.7%); C  :  2( 1.9%); D  :  5( 4.8%); E  :  6( 5.7%); F  :  6( 5.7%)
G  :  5( 4.8%); H  :  6( 5.7%); I  :  3( 2.9%); K  : 12(11.4%); L  :  7( 6.7%)
M  :  4( 3.8%); N  :  6( 5.7%); P  :  4( 3.8%); Q  :  7( 6.7%); R  :  8( 7.6%)
S  :  2( 1.9%); T  :  5( 4.8%); V  :  6( 5.7%); W  :  0( 0.0%); Y  :  4( 3.8%)

KR      :   20 ( 19.0%);   ED      :   11 ( 10.5%);   AGP     :   16 ( 15.2%);
KRED    :   31 ( 29.5%);   KR-ED   :    9 (  8.6%);   FIKMNY  :   35 ( 33.3%);
LVIFM   :   26 ( 24.8%);   ST      :    7 (  6.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0+00000000 +0++000++0 ++000+000- 000000-00+ +00--+0000 000+000+00 
        61  +0+0000++0 00000000+0 00-0000-0- --00-0000- 0000+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 14/30 or 19/45 or 23/60):  none


Negative charge clusters (cmin =  9/30 or 12/45 or 15/60):  none


Mixed charge clusters (cmin = 18/30 or 25/45 or 31/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.190  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.705  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.105  ( ED )

 Expected score/letter:  -0.533;    Average information/letter:   0.464
 Minimal length of displayed segments set to:  20

M_0.01= 14.21  (cv=  8.81, lambda=  0.52851, k=  0.17478, x=  5.40;
                90% confidence interval for segment length:  23 +-  24)
M_0.05= 11.13  (x=  2.32)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.105  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.705  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.190  ( KR )

 Expected score/letter:  -0.876;    Average information/letter:   1.460
 Minimal length of displayed segments set to:  20

M_0.01=  8.66  (cv=  4.91, lambda=  0.94862, k=  0.35253, x=  3.75;
                90% confidence interval for segment length:   8 +-   7)
M_0.05=  6.94  (x=  2.03)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.295  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.705  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.410;    Average information/letter:   0.514
 Minimal length of displayed segments set to:  20

M_0.01=  8.99  (cv=  5.35, lambda=  0.87008, k=  0.23797, x=  3.64;
                90% confidence interval for segment length:  22 +-  20)
M_0.05=  7.11  (x=  1.76)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.705  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.295  ( KEDR )

 Expected score/letter:  -1.657;    Average information/letter:   0.383
 Minimal length of displayed segments set to:  20

M_0.01= 23.66  (cv= 14.18, lambda=  0.32831, k=  0.22607, x=  9.48;
                90% confidence interval for segment length:  29 +-  17)
M_0.05= 18.69  (x=  4.52)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   4 |   7 |  23 |  10 |   8 |  12 |  11 |   9 |  13 |   7 |   9 | 
lmin1     7 |   5 |   9 |  28 |  12 |  10 |  15 |  13 |  11 |  16 |   9 |  11 | 
lmin2     8 |   6 |  10 |  31 |  13 |  11 |  16 |  15 |  12 |  18 |  10 |  13 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   4:   0
  -  runs >=   3:   1, at   90;
  *  runs >=   5:   0
  0  runs >=  15:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.648;    Average information/letter:   0.944
 Minimal length of displayed segments set to:  15

M_0.01= 17.06  (cv=  9.61, lambda=  0.48411, k=  0.36910, x=  7.44;
                90% confidence interval for segment length:  13 +-   7)
M_0.05= 13.69  (x=  4.08)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -5.590;    Average information/letter:   0.989
 Minimal length of displayed segments set to:  15

M_0.01= 36.58  (cv= 21.05, lambda=  0.22107, k=  0.31125, x= 15.53;
                90% confidence interval for segment length:  12 +-   7)
M_0.05= 29.21  (x=  8.16);     M_0.30= 20.44  (x= -0.62)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-93-C-3-C-7-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   7  (Expected range:   0-- 14)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 2   (6-10) 2   (11-20) 2   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   7  (Expected range:   0-- 11)
   5 +plets (f+: 19.0%), 2 -plets (f-: 10.5%)
   Total number of charge altplets: 1 (Critical number: 12)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 3   (6-10) 0   (11-20) 3   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


ORF14 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:47 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.7286.seq)

SWISS-PROT ANNOTATION:
ID   ORF14
DE   ORF14, 62 bases, A3CAE8C checksum.

number of residues:   62;   molecular weight:   6.5 kdal
 
         1  MAMVYGIVAI IAALCCGSIV SFGAKTKENP DVTDYIASGF ILTIGVVCLV AGLDWLNLIP 
        61  EM

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  7(11.3%); C  :  3( 4.8%); D  :  3( 4.8%); E  :  2( 3.2%); F  :  2( 3.2%)
G  :  6( 9.7%); H  :  0( 0.0%); I  :  8(12.9%); K  :  2( 3.2%); L  :  6( 9.7%)
M  :  3( 4.8%); N  :  2( 3.2%); P  :  2( 3.2%); Q  :  0( 0.0%); R  :  0( 0.0%)
S  :  3( 4.8%); T  :  3( 4.8%); V  :  7(11.3%); W  :  1( 1.6%); Y  :  2( 3.2%)

KR      :    2 (  3.2%);   ED      :    5 (  8.1%);   AGP     :   15 ( 24.2%);
KRED    :    7 ( 11.3%);   KR-ED   :   -3 ( -4.8%);   FIKMNY  :   19 ( 30.6%);
LVIFM   :   26 ( 41.9%);   ST      :    6 (  9.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 0000000000 0000+0+-00 -00-000000 0000000000 000-000000 
        61  -0

A. CHARGE CLUSTERS.


Positive charge clusters:  not evaluated (frequency of + < 5%, too low)


Negative charge clusters (cmin =  8/30 or 10/45 or 13/60):  none


Mixed charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.032  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.887  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.081  ( ED )

 Expected score/letter:  -0.984;    Average information/letter:   3.408
 Minimal length of displayed segments set to:  20

M_0.01=  4.92  (cv=  2.55, lambda=  1.61837, k=  0.46825, x=  2.37;
                90% confidence interval for segment length:   3 +-   3)
M_0.05=  3.92  (x=  1.37)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.081  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.887  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.032  ( KR )

 Expected score/letter:  -0.790;    Average information/letter:   1.673
 Minimal length of displayed segments set to:  20

M_0.01=  7.17  (cv=  3.84, lambda=  1.07607, k=  0.36516, x=  3.34;
                90% confidence interval for segment length:   7 +-   7)
M_0.05=  5.66  (x=  1.82)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.113  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.887  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.774;    Average information/letter:   2.302
 Minimal length of displayed segments set to:  20

M_0.01=  4.04  (cv=  2.00, lambda=  2.06142, k=  0.67566, x=  2.04;
                90% confidence interval for segment length:   5 +-   4)
M_0.05=  3.25  (x=  1.25)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.887  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.113  ( KEDR )

 Expected score/letter:  -0.016
 Average information/letter:   0.000 < .10; too small !


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     3 |   3 |   4 |  55 |   6 |   7 |   8 |   7 |   9 |  10 |  NA |  NA | 
lmin1     4 |   5 |   5 |  67 |   7 |   9 |  10 |   9 |  11 |  13 |  NA |  NA | 
lmin2     4 |   5 |   6 |  74 |   8 |  10 |  11 |  10 |  13 |  14 |  NA |  NA | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   3:   0
  0  runs >=  36:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -0.339
 Average information/letter:   0.029 < .10; too small !


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -0.323
 Average information/letter:   0.006 < .10; too small !


2. SPACINGS OF C.


H2N-14
CC   at   15
  -31-C-14-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   4  (Expected range:   0-- 11)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 2   (6-10) 1   (11-20) 1   (>=21) 1

3. Clusters of amino acid multiplets (cmin = 11/30 or 15/45 or 17/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   0  (Expected range:   0--  3)
   0 +plets (f+: 3.2%), 0 -plets (f-: 8.1%)
   Total number of charge altplets: 1 (Critical number: 3)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 1

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 5)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
  41-  64        3      i..              8       8 !     0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF15 VpV262
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:49 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.8395.seq)

SWISS-PROT ANNOTATION:
ID   ORF15
DE   ORF15, 131 bases, 89229DD9 checksum.

number of residues:  131;   molecular weight:  15.0 kdal
 
         1  MRHQLHPLTL EYTRLAVQDL LQSARTKVLR EFTCLEIKSK VKSDLKQYAT DLTEGQVYAQ 
        61  MELVGDIMVE LAKEFCEDMG IPYHYSFVER HIDEHAKSWD ASHRACDACI GAGEHALRVS 
       121  WLLHLQDDIG A

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 12( 9.2%); C  :  4( 3.1%); D  : 10( 7.6%); E  : 11( 8.4%); F  :  3( 2.3%)
G  :  6( 4.6%); H  :  8( 6.1%); I  :  6( 4.6%); K  :  7( 5.3%); L  : 16(12.2%)
M  :  4( 3.1%); N  :  0( 0.0%); P  :  2( 1.5%); Q  :  7( 5.3%); R  :  7( 5.3%)
S  :  7( 5.3%); T  :  6( 4.6%); V  :  8( 6.1%); W  :  2( 1.5%); Y  :  5( 3.8%)

KR      :   14 ( 10.7%);   ED      :   21 ( 16.0%);   AGP     :   20 ( 15.3%);
KRED    :   35 ( 26.7%);   KR-ED   :   -7 ( -5.3%);   FIKMNY  :   25 ( 19.1%);
LVIFM   :   37 ( 28.2%);   ST      :   13 (  9.9%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0+00000000 -00+0000-0 0000+0+00+ -0000-0+0+ 0+0-0+0000 -00-000000 
        61  0-000-000- 00+-00--00 00000000-+ 00--00+00- 000+00-000 000-000+00 
       121  000000--00 0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 13/45 or 15/60):  none


Negative charge clusters (cmin = 12/30 or 17/45 or 20/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.107  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.733  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.160  ( ED )

 Expected score/letter:  -0.840;    Average information/letter:   1.384
 Minimal length of displayed segments set to:  20

M_0.01=  9.04  (cv=  5.25, lambda=  0.92942, k=  0.34180, x=  3.79;
                90% confidence interval for segment length:   9 +-   8)
M_0.05=  7.29  (x=  2.04)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.160  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.733  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.107  ( KR )

 Expected score/letter:  -0.626;    Average information/letter:   0.690
 Minimal length of displayed segments set to:  20

M_0.01= 12.30  (cv=  7.50, lambda=  0.65013, k=  0.22823, x=  4.80;
                90% confidence interval for segment length:  17 +-  17)
M_0.05=  9.79  (x=  2.30)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.267  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.733  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.466;    Average information/letter:   0.678
 Minimal length of displayed segments set to:  20

M_0.01=  8.18  (cv=  4.83, lambda=  1.00900, k=  0.29588, x=  3.35;
                90% confidence interval for segment length:  18 +-  16)
M_0.05=  6.57  (x=  1.74)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.733  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.267  ( KEDR )

 Expected score/letter:  -1.405;    Average information/letter:   0.305
 Minimal length of displayed segments set to:  20

M_0.01= 27.55  (cv= 17.25, lambda=  0.28259, k=  0.18445, x= 10.30;
                90% confidence interval for segment length:  37 +-  23)
M_0.05= 21.78  (x=  4.53)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   7 |  26 |   8 |   9 |  12 |   9 |  11 |  13 |   8 |  10 | 
lmin1     5 |   7 |   9 |  32 |  10 |  11 |  14 |  11 |  13 |  16 |   9 |  12 | 
lmin2     6 |   8 |  10 |  35 |  11 |  13 |  16 |  13 |  15 |  18 |  10 |  13 | 
 (Significance level: 0.010000; Minimal displayed length:  6)

 (*0)  13(0,0,0); at   35-  47:   LEIKSKVKSDLKQ
      (2. quartile)               0-0+0+0+0-0+0


Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   5:   0
  0  runs >=  17:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.115;    Average information/letter:   0.639
 Minimal length of displayed segments set to:  15

M_0.01= 22.10  (cv= 13.10, lambda=  0.37211, k=  0.28637, x=  9.00;
                90% confidence interval for segment length:  19 +-  12)
M_0.05= 17.72  (x=  4.62)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.237;    Average information/letter:   0.611
 Minimal length of displayed segments set to:  15

M_0.01= 48.68  (cv= 29.58, lambda=  0.16479, k=  0.23385, x= 19.10;
                90% confidence interval for segment length:  19 +-  13)
M_0.05= 38.79  (x=  9.21);     M_0.30= 27.02  (x= -2.56)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-33-C-41-C-29
CDAC at  106
    -22-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   3  (Expected range:   0-- 17)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 2   (6-10) 0   (11-20) 1   (>=21) 1

3. Clusters of amino acid multiplets (cmin =  6/30 or  8/45 or  9/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   3  (Expected range:   0-- 11)
   0 +plets (f+: 10.7%), 3 -plets (f-: 16.0%)
   Total number of charge altplets: 3 (Critical number: 12)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 1   (6-10) 0   (11-20) 1   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
   5-  24        5      L....            4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
  30-  85        8      *.......         7       7       0
  36-  47        2      *.               6       6       0
  57-  92        6      i.....           6       6       0
  62-  81        4      -00.             5       5       0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Location (Quartile) Spacing     Rank       P-value   Interpretation

   6-  84  (2.)     H(  78)H     1 of   9   0.0046   large  1. maximal spacing
  52-  63  (2.)     L(  11)L     2 of  17   0.9945   small  2. maximal spacing
  71- 117  (3.)     L(  46)L     1 of  17   0.0097   large  1. maximal spacing
 103- 115  (4.)     H(  12)H     2 of   9   0.9978   small  2. maximal spacing



ORF16 VpV262 weak matches to dnaJ and a phage repressor
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:50 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4818.9504.seq)

SWISS-PROT ANNOTATION:
ID   ORF16
DE   ORF16, 145 bases, 61EEA456 checksum.

number of residues:  145;   molecular weight:  16.3 kdal
 
         1  MELQTIFNMV LQHLARQDHG ASDFEHGCAY RAEGGRKCAV GYLLSESAYH DDVEGEVIGD 
        61  EHINAAVEES IGELPYGGYN VLSRLQTLHD GDYDMVCSGE EVSGMSKERR VLRMVSAVEE 
       121  LADAFDLIVP RAFYDWIKKY GYERQ

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 12( 8.3%); C  :  3( 2.1%); D  : 11( 7.6%); E  : 16(11.0%); F  :  4( 2.8%)
G  : 14( 9.7%); H  :  6( 4.1%); I  :  6( 4.1%); K  :  4( 2.8%); L  : 12( 8.3%)
M  :  5( 3.4%); N  :  3( 2.1%); P  :  2( 1.4%); Q  :  5( 3.4%); R  :  9( 6.2%)
S  :  9( 6.2%); T  :  2( 1.4%); V  : 12( 8.3%); W  :  1( 0.7%); Y  :  9( 6.2%)

KR      :   13 (  9.0%);   ED      :   27 ( 18.6%);   AGP     :   28 ( 19.3%);
KRED    :   40 ( 27.6%);   KR-ED   :  -14 ( -9.7%);   FIKMNY  :   31 ( 21.4%);
LVIFM   :   39 ( 26.9%);   ST      :   11 (  7.6%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0-00000000 00000+0-00 00-0-00000 +0-00++000 00000-0000 --0-0-000- 
        61  -000000--0 00-0000000 000+00000- 0-0-00000- -00000+-++ 00+00000-- 
       121  00-00-0000 +000-00++0 00-+0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  8/30 or 11/45 or 14/60):  none


Negative charge clusters (cmin = 14/30 or 18/45 or 23/60):  none


Mixed charge clusters (cmin = 18/30 or 24/45 or 30/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.090  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.724  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.186  ( ED )

 Expected score/letter:  -0.917;    Average information/letter:   1.718
 Minimal length of displayed segments set to:  20

M_0.01=  8.25  (cv=  4.77, lambda=  1.04276, k=  0.37920, x=  3.48;
                90% confidence interval for segment length:   7 +-   6)
M_0.05=  6.69  (x=  1.92)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.186  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.724  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.090  ( KR )

 Expected score/letter:  -0.531;    Average information/letter:   0.474
 Minimal length of displayed segments set to:  20

M_0.01= 14.56  (cv=  9.24, lambda=  0.53876, k=  0.17716, x=  5.33;
                90% confidence interval for segment length:  24 +-  24)
M_0.05= 11.54  (x=  2.30)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.276  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.724  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.448;    Average information/letter:   0.624
 Minimal length of displayed segments set to:  20

M_0.01=  8.60  (cv=  5.16, lambda=  0.96508, k=  0.27750, x=  3.44;
                90% confidence interval for segment length:  19 +-  17)
M_0.05=  6.91  (x=  1.75)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.724  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.276  ( KEDR )

 Expected score/letter:  -1.483;    Average information/letter:   0.329
 Minimal length of displayed segments set to:  20

M_0.01= 26.80  (cv= 16.77, lambda=  0.29675, k=  0.19742, x= 10.03;
                90% confidence interval for segment length:  35 +-  21)
M_0.05= 21.31  (x=  4.54)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   6 |   7 |  26 |   8 |  10 |  12 |   9 |  11 |  14 |   7 |   9 | 
lmin1     5 |   7 |   9 |  31 |   9 |  12 |  15 |  11 |  14 |  17 |   8 |  11 | 
lmin2     6 |   8 |  10 |  35 |  11 |  14 |  16 |  12 |  16 |  19 |  10 |  12 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   4:   0
  *  runs >=   5:   0
  0  runs >=  17:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.034;    Average information/letter:   0.580
 Minimal length of displayed segments set to:  15

M_0.01= 23.61  (cv= 14.21, lambda=  0.35018, k=  0.27007, x=  9.40;
                90% confidence interval for segment length:  21 +-  13)
M_0.05= 18.96  (x=  4.74)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.207;    Average information/letter:   0.585
 Minimal length of displayed segments set to:  15

M_0.01= 50.02  (cv= 30.71, lambda=  0.16206, k=  0.22976, x= 19.31;
                90% confidence interval for segment length:  20 +-  14)
M_0.05= 39.96  (x=  9.25);     M_0.30= 27.99  (x= -2.71)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-27-C-9-C-58-C-48-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  66-  69]   AVEE
[ 117- 120]   AVEE


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  10  (Expected range:   0-- 18)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 1   (6-10) 6   (11-20) 2   (>=21) 2

3. Clusters of amino acid multiplets (cmin = 12/30 or 15/45 or 18/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   8  (Expected range:   0-- 12)
   3 +plets (f+: 9.0%), 5 -plets (f-: 18.6%)
   Total number of charge altplets: 2 (Critical number: 13)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 5   (11-20) 2   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
  20-  47        7      G......          4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
  67- 108        7      i.00.0*          6       6      /0/./2/2/./2/2/


--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


ORF17 VpV262 DNA primase
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:52 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4814.2277.seq)

SWISS-PROT ANNOTATION:
ID   ORF17
DE   ORF17, 306 bases, 96841AB3 checksum.

number of residues:  306;   molecular weight:  34.8 kdal
 
         1  MPSTSSYLVL STIGLRSTGM RDSSFDTDIL DMTTDLCIGE RSDHLVCPCC RGGDSGERSL 
        61  LVWCHADGLA YRCYRVKCGL SGKIGQSGYR PVSTKMRKPK CHTRQLHPEP LPNDVLDWYL 
       121  DYFWWADAKM LRVNGVLWDE TTERILYPIK SMTGTHEGYL ARRYDDLVLD KSNFQGGKAK 
       181  AYYNSLPTDY KMTCMMTPLK AQFDEWVVVV EDYPSAMRIN TEIPCVALSG TSIQDATLME 
       241  LVRAGKRKVC FVLDADATSK AASMVYNYGL HFEHLTFVPL FDADPKDMED EDFDDLVDTI 
       301  RRQLDV

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  : 17( 5.6%); C+ : 11( 3.6%); D++: 31(10.1%); E  : 13( 4.2%); F  :  9( 2.9%)
G  : 19( 6.2%); H  :  7( 2.3%); I- : 10( 3.3%); K  : 16( 5.2%); L  : 31(10.1%)
M  : 13( 4.2%); N  :  6( 2.0%); P  : 14( 4.6%); Q  :  6( 2.0%); R  : 19( 6.2%)
S  : 21( 6.9%); T  : 21( 6.9%); V  : 21( 6.9%); W  :  6( 2.0%); Y  : 15( 4.9%)

KR      :   35 ( 11.4%);   ED      :   44 ( 14.4%);   AGP     :   50 ( 16.3%);
KRED    :   79 ( 25.8%);   KR-ED   :   -9 ( -2.9%);   FIKMNY  :   69 ( 22.5%);
LVIFM   :   84 ( 27.5%);   ST      :   42 ( 13.7%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  0000000000 00000+0000 +-000-0-00 -000-0000- +0-0000000 +00-00-+00 
        61  000000-000 0+00+0+000 00+000000+ 0000+0++0+ 000+0000-0 000-00-000 
       121  -00000-0+0 0+000000-- 00-+00000+ 000000-000 0++0--000- +000000+0+ 
       181  00000000-0 +00000000+ 000--00000 --00000+00 0-00000000 0000-0000- 
       241  00+00+++00 000-0-000+ 0000000000 00-0000000 0-0-0+-0-- --0--00-00 
       301  ++00-0

A. CHARGE CLUSTERS.


Positive charge clusters (cmin = 10/30 or 13/45 or 16/60):  none


Negative charge clusters (cmin = 12/30 or 15/45 or 19/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.114  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.742  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.144  ( ED )

 Expected score/letter:  -0.801;    Average information/letter:   1.250
 Minimal length of displayed segments set to:  20

M_0.01= 10.42  (cv=  6.48, lambda=  0.88281, k=  0.32379, x=  3.93;
                90% confidence interval for segment length:  11 +-   9)
M_0.05=  8.57  (x=  2.09)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.144  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.742  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.114  ( KR )

 Expected score/letter:  -0.683;    Average information/letter:   0.855
 Minimal length of displayed segments set to:  20

M_0.01= 12.37  (cv=  7.88, lambda=  0.72609, k=  0.26078, x=  4.48;
                90% confidence interval for segment length:  15 +-  14)
M_0.05= 10.12  (x=  2.24)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.258  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.742  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.484;    Average information/letter:   0.737
 Minimal length of displayed segments set to:  20

M_0.01=  8.69  (cv=  5.42, lambda=  1.05550, k=  0.31534, x=  3.26;
                90% confidence interval for segment length:  18 +-  15)
M_0.05=  7.14  (x=  1.72)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.742  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.258  ( KEDR )

 Expected score/letter:  -1.324;    Average information/letter:   0.281
 Minimal length of displayed segments set to:  20

M_0.01= 31.94  (cv= 21.37, lambda=  0.26789, k=  0.17096, x= 10.58;
                90% confidence interval for segment length:  44 +-  28)
M_0.05= 25.86  (x=  4.49)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     5 |   5 |   7 |  30 |   9 |  10 |  13 |  11 |  11 |  15 |   6 |   8 | 
lmin1     6 |   7 |   9 |  36 |  11 |  12 |  16 |  13 |  14 |  18 |   8 |  10 | 
lmin2     7 |   8 |  11 |  40 |  12 |  13 |  17 |  15 |  16 |  20 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   1, at  246;
  -  runs >=   3:   1, at  289;
  *  runs >=   5:   0
  0  runs >=  20:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -2.137;    Average information/letter:   0.665
 Minimal length of displayed segments set to:  15

M_0.01= 23.69  (cv= 14.90, lambda=  0.38423, k=  0.29426, x=  8.79;
                90% confidence interval for segment length:  20 +-  12)
M_0.05= 19.44  (x=  4.55)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.023;    Average information/letter:   0.594
 Minimal length of displayed segments set to:  15

M_0.01= 51.80  (cv= 33.57, lambda=  0.17049, k=  0.22491, x= 18.23;
                90% confidence interval for segment length:  21 +-  15)
M_0.05= 42.24  (x=  8.67);     M_0.30= 30.87  (x= -2.70)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-36-C-9
CPC  at   47
CC   at   49   (l=   4)
  -13-C-8-C-4-C-22-C-92-C-30-C-24-C-56-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

Aligned matching blocks:


[  39-  46]   GERSDHLV
[  56-  62]   GERS_LLV

______________________________

[ 165- 168]   DDLV
[ 294- 297]   DDLV


B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:  17  (Expected range:   5-- 30)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 5   (6-10) 4   (11-20) 6   (>=21) 3

3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 16/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:  10  (Expected range:   0-- 18)
   4 +plets (f+: 11.4%), 6 -plets (f-: 14.4%)
   Total number of charge altplets: 6 (Critical number: 21)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 3   (6-10) 2   (11-20) 0   (>=21) 6

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 5)

Location        Period  Element         Copies  Core    Errors
   5-  28        6      S.....           4       4       0
  79-  90        3      G..              4       4       0
 207- 210        1      V                4       4       0


B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 9)

Location        Period  Element         Copies  Core    Errors
 264- 311        8      i.......         6       6       0
 286- 303        3      *..              6       6       0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

There are no unusual spacings.


ORF18 VpV262 poor match to SIO1 gp10
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:53 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4811.2554.seq)

SWISS-PROT ANNOTATION:
ID   ORF18
DE   ORF18, 49 bases, B905AA2E checksum.

number of residues:   49;   molecular weight:   6.1 kdal
 
         1  MFDKDTVKEW NAHLMSHNAD SVGLPFEERI TWVMEWTLMW YMMEKFYGK

--------------------------------------------------------------------------------
COMPOSITIONAL ANALYSIS (extremes relative to: ECOLI.q)

A  :  2( 4.1%); C  :  0( 0.0%); D  :  3( 6.1%); E  :  5(10.2%); F  :  3( 6.1%)
G  :  2( 4.1%); H  :  2( 4.1%); I  :  1( 2.0%); K  :  4( 8.2%); L  :  3( 6.1%)
M  :  6(12.2%); N  :  2( 4.1%); P  :  1( 2.0%); Q  :  0( 0.0%); R  :  1( 2.0%)
S  :  2( 4.1%); T  :  3( 6.1%); V  :  3( 6.1%); W  :  4( 8.2%); Y  :  2( 4.1%)

KR      :    5 ( 10.2%);   ED      :    8 ( 16.3%);   AGP     :    5 ( 10.2%);
KRED    :   13 ( 26.5%);   KR-ED   :   -3 ( -6.1%);   FIKMNY  :   18 ( 36.7%);
LVIFM   :   16 ( 32.7%);   ST      :    5 ( 10.2%).

--------------------------------------------------------------------------------
CHARGE DISTRIBUTIONAL ANALYSIS
 
         1  00-+-00+-0 000000000- 000000--+0 0000-00000 000-+000+

A. CHARGE CLUSTERS.


Positive charge clusters (cmin =  9/30 or 12/45 or 15/60):  none


Negative charge clusters (cmin = 12/30 or 17/45 or 21/60):  none


Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60):  none


B. HIGH SCORING (UN)CHARGED SEGMENTS.


______________________________________
High scoring positive charge segments:

score=   2.00 frequency=   0.102  ( KR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.163  ( ED )

 Expected score/letter:  -0.857;    Average information/letter:   1.465
 Minimal length of displayed segments set to:  20

M_0.01=  7.77  (cv=  4.06, lambda=  0.95881, k=  0.35148, x=  3.71;
                90% confidence interval for segment length:   7 +-   7)
M_0.05=  6.07  (x=  2.01)

# of segments (>=20 residues) exceeding M_0.05: none


______________________________________
High scoring negative charge segments:

score=   2.00 frequency=   0.163  ( ED )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )
score=  -2.00 frequency=   0.102  ( KR )

 Expected score/letter:  -0.612;    Average information/letter:   0.659
 Minimal length of displayed segments set to:  20

M_0.01= 10.98  (cv=  6.12, lambda=  0.63589, k=  0.22141, x=  4.86;
                90% confidence interval for segment length:  15 +-  16)
M_0.05=  8.42  (x=  2.30)

# of segments (>=20 residues) exceeding M_0.05: none


___________________________________
High scoring mixed charge segments:

score=   1.00 frequency=   0.265  ( KEDR )
score=   0.00 frequency=   0.000  ( BZX )
score=  -1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )

 Expected score/letter:  -0.469;    Average information/letter:   0.690
 Minimal length of displayed segments set to:  20

M_0.01=  7.15  (cv=  3.82, lambda=  1.01857, k=  0.29989, x=  3.33;
                90% confidence interval for segment length:  15 +-  14)
M_0.05=  5.55  (x=  1.73)

# of segments (>=20 residues) exceeding M_0.05: none


________________________________
High scoring uncharged segments:

score=   1.00 frequency=   0.735  ( LAGSVTIPNFQYHMCW )
score=   0.00 frequency=   0.000  ( BZX )
score=  -8.00 frequency=   0.265  ( KEDR )

 Expected score/letter:  -1.388;    Average information/letter:   0.300
 Minimal length of displayed segments set to:  20

M_0.01= 24.28  (cv= 13.92, lambda=  0.27954, k=  0.18165, x= 10.35;
                90% confidence interval for segment length:  33 +-  22)
M_0.05= 18.45  (x=  4.52)

# of segments (>=20 residues) exceeding M_0.05: none


C. CHARGE RUNS AND PATTERNS.

pattern  (+)|  (-)|  (*)|  (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)|
lmin0     4 |   5 |   6 |  23 |   7 |   9 |  11 |   8 |  10 |  12 |   6 |   8 | 
lmin1     5 |   6 |   8 |  28 |   9 |  10 |  13 |  10 |  12 |  15 |   8 |   9 | 
lmin2     6 |   7 |   9 |  31 |  10 |  12 |  14 |  11 |  13 |  16 |   9 |  11 | 
 (Significance level: 0.010000; Minimal displayed length:  6)
There are no charge runs or patterns exceeding the given minimal lengths.

Run count statistics:

  +  runs >=   3:   0
  -  runs >=   3:   0
  *  runs >=   4:   0
  0  runs >=  15:   0

--------------------------------------------------------------------------------
DISTRIBUTION OF OTHER AMINO ACID TYPES

1. HIGH SCORING SEGMENTS.

__________________________________
High scoring hydrophobic segments:

   2.00 (LVIFM)   1.00 (AGYCW)   0.00 (BZX)  -2.00 (PH)  -4.00 (STNQ)
  -8.00 (KEDR)

 Expected score/letter:  -1.959;    Average information/letter:   0.547
 Minimal length of displayed segments set to:  15

M_0.01= 21.84  (cv= 11.93, lambda=  0.32626, k=  0.25539, x=  9.92;
                90% confidence interval for segment length:  19 +-  14)
M_0.05= 16.85  (x=  4.92)

# of segments (>=15 residues) exceeding M_0.05: none


____________________________________
High scoring transmembrane segments:

   5.00 (LVIF)   2.00 (AGM)   0.00 (BZX)  -1.00 (YCW)  -2.00 (ST)
  -6.00 (P)  -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED)

 Expected score/letter:  -4.163;    Average information/letter:   0.618
 Minimal length of displayed segments set to:  15

M_0.01= 39.49  (cv= 21.86, lambda=  0.17807, k=  0.23239, x= 17.64;
                90% confidence interval for segment length:  16 +-  12)
M_0.05= 30.34  (x=  8.48);     M_0.30= 19.45  (x= -2.41)

# of segments (>=15 residues) exceeding M_0.30: none


2. SPACINGS OF C.


H2N-49-COOH

--------------------------------------------------------------------------------
REPETITIVE STRUCTURES.

A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet.
Repeat core block length:  4

B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet.
   (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C)
Repeat core block length:  8

--------------------------------------------------------------------------------

MULTIPLETS.

A. AMINO ACID ALPHABET.

1. Total number of amino acid multiplets:   2  (Expected range:   0--  9)

2. Histogram of spacings between consecutive amino acid multiplets:
   (1-5) 0   (6-10) 1   (11-20) 1   (>=21) 1

3. Clusters of amino acid multiplets (cmin =  9/30 or 11/45 or 13/60):  none


B. CHARGE ALPHABET.

1. Total number of charge multiplets:   1  (Expected range:   0--  6)
   0 +plets (f+: 10.2%), 1 -plets (f-: 16.3%)
   Total number of charge altplets: 4 (Critical number: 7)

2. Histogram of spacings between consecutive charge multiplets:
   (1-5) 0   (6-10) 0   (11-20) 0   (>=21) 2

--------------------------------------------------------------------------------
PERIODICITY ANALYSIS.

A. AMINO ACID ALPHABET (core:  4; !-core: 4)

Location        Period  Element         Copies  Core    Errors

There are no periodicities of the prescribed length.

B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core:  5; !-core: 6)
   and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core:  6; !-core: 8)

Location        Period  Element         Copies  Core    Errors
  22-  49        4      i...             7       7       0


--------------------------------------------------------------------------------
SPACING ANALYSIS.

Not evaluated (sequence length < 100 aa, too short).


ORF19 VpV262 helicase
SAPS.  Version of April 11, 1996.
Date run: Mon Oct  1 23:15:55 2001



********************************************************************************
Protein    1 (File: wwwtmp/.SAPS.4810.5050.seq)

SWISS-PROT ANNOTATION:
ID   ORF19
DE   ORF19, 408 bases, 7C4D2D59 checksum.

number of residues:  408;   molecular weight:  44.9 kdal
 
         1  MESEASILSA CIADRNAYDV VMSLEANADL SPNGVILFKF ISEYYKRDKK CQSIDVDVLL 
        61  STISAQKPLA YEKLSKVVDN PPEPSPANLI TLLTEQRLKR LGAEMTAAFA AGDSAKGSAL 
       121  AAEYHDVYEM GIQTDDEDPL FEVYNDVSVA DLTQSLREGA NLSLLPDLLG DIVFNMMQGD 
       181  HVIIFGPVNA GKSAVNIQIA GDYCYDDQVV LYVGNEDPAD RM