The GCG package of sequence analysis programs.

        GCG refers to the Wisconsin Package Version 10.3, Genetics Computer Group (GCG), Madison, Wisc. GCG is implemented under Linux on bioinf.  This is a mostly outdated package.  One should not search the internal GCG sequence databases because they are years out of date.  The seqlab graphics interface is no longer implemented at UTHSCSA.  However, many of the command line analytical utilities are still useful, particularly for those who are acustomed to using this package.  Both cpu time and disk space are to be free of charge to university researchers and students. Apply for a bioinf account here.

Current program status.

User interface.

        Our license for the web browser interface SeqWeb has been discontinued.  Hence, access it at the Linux command level.  See linux and vi help links.  To access your bioinf account, you will need a client program on your computer that does makes secure shell (shh) and secure copy (scp) connections. See help on obtaining and using ssh.

Many of the GCG routines have an option to produce .png graphics files that you can download and display on your computer.  Before running the program producing the graphics file, run the command png.  This is an alternative to running the same program under seqlab, which would automatically export the file as an X-windows graphic.
 

Starting GCG.

On request, the bioinformatics center will set up your account so that GCG starts when you log in.  Alternatively:

Documentation.

Help for GCG can be obtained by typing genmanual or genhelp at the command line.  Help on individual functions can be obtained by typing genhelp <function name> from the command line.  These help files are accessed by the lynx non graphic browser by default, so you have to work through them with arrow keys and the tab and space keys.  There is a way to redirect the system to use a better browser.
 

Typical method of use.

Functions available in GCG.

The GCG software package is somewhat dated now.  More powerful alternatives to some of its functions are indicated below where available in the UTHSCSA bioinformatics system.
  1. General utilities to edit, complement, translate sequence, and to map restriction sites or peptidase cleavage sites.
  2. Facilities to retrieve sequences from GenBank, and to reformat them from a variety of formats.
  3. Automated and manual alignment programs.
  4. Blast, Fasta, and other search programs for databases.
  5. Profile and motif construction and search facilities.
  6. Implementations Neighbor Joining for constructing phylogenetic trees.
  7. A variety of algorithms for randomizing, scrambling, or randomly mutating sequence, for use in statistical testing.
  8. Sequence assembly for sequencing projects.
  9. A primer picking program.
  10. General utilities for counting codons and determining codon preferences.
  11. RNA and DNA secondary structure prediction.
  12. General facilities for creation of displays.
  13. Prediction of protein features (helical wheel, transmembrane, secretion signals, coiled-coil regions, etc.).
  14. Screening of internal repetitive sequence.

Troubleshooting GCG.

You may find some of the following information helpful in dealing with some peculiarities of the local GCG implementation.

Last update 5/22/6 - Steve Hardies