next up previous contents
: 12 Parameter descriptions : SAM (Sequence Alignment and : 10 Related programs   Ìܼ¡


11 System installation

The SAM system runs on a variety of Unix workstations (we have checked installation on workstations including DEC DECstation and Alpha, HP 715, IBM RS6000, SGI Onyx Reality Engine, Sun Sparc, Intel Pentium with the Linux operating system, and the UCSC Kestrel parallel processor.

The distribution includes an INSTALL file that discusses installation procedures.

The gnuplot, gunzip, and uncompress programs should be in the user's path, and other programs should be available as required by SAM-T2K. See Section 4.11.


11.1 Environment variables

The SAM system has several environment variables, typically set in csh using a command of the form +setenv PRIOR_PATH /projects/compbio/lib+.

PRIOR_PATH
Directory with prior libraries, regularizers, and makelogo color definitions.. See Section 8.1.1 and Section 10.10.4.
BLASTMAT
Directory for BLAST scoring matrices. See Section 10.2.8.


11.2 Runtime statistics

At the end of each run of buildmodel, a line of statistics is printed out, such as the line


-218.36  -217.00  -217.68   0.96  22 0 149
mentioned in Section 3. These numbers are quite useful for quick comparison of results when, for example, running the program many times using a shell script. The numbers are: minimum NLL-NULL score, maximum score, average score, sample deviation of scores, number of re-estimates, number of surgeries, and the length of the final model. In the above case, the scores are for the training set: if a test set were specified (Section 7.4), the minimum, maximum, average, and sample deviation for the test set would be reported after the model length, followed by the ratio of the average test set score to the average training set score (ideally, this value should be close to unity -- larger values may indicate overfitting of the model to the training set).

11.3 Manipulating large collections of data

The underlying SAM programs were originally written to work with single models and single databases of sequences. Many current users, including UCSC, use SAM to create vast libraries of models and to analysis many independent, small files of sequences.

If this is also appropriate for your application, you will undoubtedly find yourself designing scripts, Makefiles, and databases in a manner similar to any other large project.

Several features in SAM may help you with this design

11.4 Future Features

There are many future features we would like to include in SAM. The following list will also point out some of the things you currently cannot do using the system. The items are of varying difficulty.


11.5 Prior versions

11.5.1 Version 2.2.1

11.5.2 Version 2.2

July, 1998

11.5.3 Version 2.1.2

June, 1998.

11.5.4 Version 2.1.1

April, 1998.

11.5.5 Version 2.1

February, 1998.

11.5.6 Version 2.0

November, 1997.

11.5.7 Version 1.4

August, 1996.

11.5.8 Version 1.3

May, 1996.

11.5.9 Version 1.2

March, 1996.

11.5.10 Version 1.1

November, 1995.

11.5.11 Version 1.0

January, 1995.


next up previous contents
: 12 Parameter descriptions : SAM (Sequence Alignment and : 10 Related programs   Ìܼ¡
SAM
sam-info@cse.ucsc.edu
UCSC Computational Biology Group