CCL
List of Tables
Clozure CL is a fast, mature, open source Common Lisp implementation that runs on Linux, Mac OS X and BSD on either Intel x86-64 or PPC. Clozure CL was forked from Macintosh Common Lisp (MCL) in 1998 and the development has been entirely separate since. Ports to IA32 and Windows are under development.
When it was forked from MCL in 1998, the new Lisp was named OpenMCL. Recently, Clozure renamed its Lisp to Clozure CL, partly because its ancestor MCL has lately been released as open source. Clozure thought it might be confusing for users if there were two independent open-source projects with such similar names. The new name also reflects Clozure CL's current status as the flagship product of Clozure Associates.
Furthermore, the new name refers to Clozure CL's ancestry: in its early years, MCL was known as Coral Common Lisp, or "CCL". For years the package that contains most of Clozure CL's implementation-specific symbols has been named "CCL", an acronym that once stood for the name of the Lisp product. It seems fitting that "CCL" once again stands for the name of the product.
Some commands and source files may still refer to "OpenMCL" instead of Clozure CL.
Clozure CL compiles to native code and supports multithreading using native OS threads. It includes a foreign-function interface, and supports both Lisp code that calls external code, and external code that calls Lisp code. Clozure CL can create standalone executables on all supported platforms.
On Mac OS X, Clozure CL supports building GUI applications that use OS X's native Cocoa frameworks, and the OS X distributions include an IDE written with Cocoa, and distributed with complete sources.
On all supported platforms, Clozure CL can run as a command-line process, or as an inferior Emacs process using either SLIME or ILISP.
Features of Clozure CL include
Very fast compilation speed.
A fast, precise, compacting, generational garbage collector written in hand-optimized C. The sizes of the generations are fully configurable. Typically, a generation can be collected in a millisecond on modern systems.
Fast execution speed, competitive with other Common Lisp implementations on most benchmarks.
Robust and stable. Customers report that their CPU-intensive, multi-threaded applications run for extended periods on Clozure CL without difficulty.
Full native OS threads on all platforms. Threads are automatically distributed across multiple cores. The API includes support for shared memory, locking, and blocking for OS operations such as I/O.
Full Unicode support.
Full SLIME integration.
An IDE on Mac OS X, fully integrated with the Macintosh window system and User Interface standards.
Excellent debugging facilities. The names of all local variables are available in a backtrace.
A complete, mature foreign function interface, including a powerful bridge to Objective-C and Cocoa on Mac OS X.
Many extensions including: files mapped to Common Lisp vectors for fast file I/O; thread-local hash tables and streams to eliminate locking overhead; cons hashing support; and much more
Very efficient use of memory
Although it's an open-source project, available free of charge under a liberal license, Clozure CL is also a fully-supported product of Clozure Associates. Clozure continues to extend, improve, and develop Clozure CL in response to customer and user needs, and offers full support and development services for Clozure CL.
Version 1.3 is the latest stable release of Clozure CL as of April 2009.
Version 1.3 is available for seven platform configurations:
Linux on PowerPC (32-bit and 64-bit implementations)
Mac OS X on PowerPC (32-bit and 64-bit implementations)
Linux on x86 (32-bit and 64-bit implementations)
Mac OS X on x86 (32-bit and 64-bit implementations)
FreeBSD on x86 (32-bit and 64-bit implementations)
Solaris on x86 (32-bit and 64-bit implementations)
MS Windows XP and later on x86 (32-bit and 64-bit implementations)
A 64-bit version of Clozure CL requires a 64-bit processor running a 64-bit OS variant.
Additional platform-specific information is given in the following subsections.
Older versions are still available for downloading as tarballs. Version 1.0 was a stable version released in late 2005. Version 1.1 was under active development until late 2007. A final 1.1 release was never made. It was distributed as a series of development "snapshots" and CVS updates. 1.1 snapshots introduced support for x86-64 platforms, internal use of Unicode, and many other features, but were moving targets. Version 1.2 was a stable version released in April 2008.
Clozure CL requires version 2.2.13 (or later) of the Linux kernel and version 2.1.3 (or later) of the GNU C library (glibc) at a bare minimum.
Because of the nature of Linux distributions, it's difficult to give precise version number requirements. In general, a "fairly modern" (no more than 2 or three years old) kernel and C library are more likely to work well than older versions.
Clozure CL should run on FreeBSD 6.x and 7.x. FreeBSD 7 users will need to install the "compat6x" package in order to use the distributed Clozure CL kernel, which is built on a FreeBSD 6.x system.
Clozure CL runs under Mac OS X versions 10.4 and 10.5.
64-bit versions of Clozure CL require 64-bit processors (e.g., a G5 or Core 2 processor). Some early Intel-based Macintoshes used processors that don't support 64-bit operation, so the 64-bit Clozure CL will not run on them, although the 32-bit Clozure CL will.
Clozure CL hasn't been tested under Darwin proper, but Clozure CL doesn't intentionally use any Mac OS X features beyond the Darwin subset and therefore it seems likely that Clozure CL would run on Darwin versions that correspond to recent Mac OS X versions.
There two main ways to obtain Clozure CL. For Mac OS X, there are disk images that can be used to install Clozure CL in the usual Macintosh way. For other OSes, Subversion is the best way to obtain Clozure CL. Mac OS X users can also use Subversion if they prefer. Tarballs are available for those who prefer them, but if you have Subversion installed, it is simpler and more flexible to use Subversion than tarballs.
There are three popular ways to use Clozure CL: as a stand-alone double-clickable application (Mac OS X only), as a command-line application, or with EMACS and SLIME. The following sections describe these options.
If you are using Mac OS X then you can install and use Clozure CL in the usual Macintosh way. Download and mount a disk image, then drag the ccl folder to the Applications folder or wherever you wish. After that you can double-click the Clozure CL application found inside the ccl directory. The disk images are available at ftp://clozure.com/pub/release/1.3/
So that Clozure CL can locate its source code, and for other
reasons explained in
Section 4.4.2, “Predefined Logical Hosts”, you keep the
Clozure CL application
in the ccl directory. If you use a shell,
you can set the value of the
CCL_DEFAULT_DIRECTORY environment variable
to explicitly indicate the location of
the ccl directory. If you choose to do
that, then the ccl directory and the Clozure CL
application can each be in any location you find
convenient.
It is very easy to download, install, and build Clozure CL using Subversion. This is the preferred way to get either the latest, or a specific version of Clozure CL, unless you prefer the Mac Way. Subversion is a source code control system that is in wide usage. Most modern OSes come with Subversion pre-installed. A complete, buildable and runnable set of Clozure CL sources and binaries can be retrieved with a single Subversion command.
Day-to-day development of Clozure CL takes place in an area of the Subversion repository known as the trunk. At most times, the trunk is perfectly usable, but occasionally it can be unstable or totally broken. If you wish to live on the bleeding edge, the following command will fetch a copy of the trunk for Darwin x86 (both 32- and 64-bit versions):
svn co http://svn.clozure.com/publicsvn/openmcl/trunk/darwinx86/ccl
To get a trunk Clozure CL for another platform, replace "darwinx86" with one of the following names (all versions include both 32- and 64-bit binaries):
darwinx86
linuxx86
freebsdx86
solarisx86
windows
linuxppc
darwinppc
Release versions of Clozure CL are intended to be stable. While bugs will be fixed in the release branches, enhancements and new features will go into the trunk. To get the 1.3 release of Clozure CL type:
svn co http://svn.clozure.com/publicsvn/openmcl/release/1.3/darwinx86/ccl
The above command will fetch the complete sources and binaries for the Darwin x86 build of Clozure CL. To get a Clozure CL for another platform, replace "darwinx86" with one of the following names (all versions include both 32- and 64-bit binaries):
darwinx86
linuxx86
freebsdx86
solarisx86
windows
linuxppc
darwinppc
These distributions contain complete sources and binaries. They use Subversion's "externals" features to share common sources; the majority of source code is the same across all versions.
Once the checkout is complete you can build Clozure CL by
running the lisp kernel and executing
the rebuild-ccl function. For
example:
joe:ccl> ./dx86cl64
Welcome to Clozure Common Lisp Version 1.2 (DarwinX8664)!
? (rebuild-ccl :full t)
<lots of compilation output>
? (quit)
joe:ccl>
If you don't have a C compiler toolchain installed, the
rebuild-ccl will not work. Please
refer to Chapter 3, Building Clozure CL from its Source Code for
addtional details.
If svn co doesn't work, then make sure
that Subversion is installed on your system. Bring up a command
line shell and type:
shell> svn
If Subversion is installed, you will see something like:
Type 'svn help' for usage
If Subversion is not installed, you will see something like:
-bash: svn: command not found
If Subversion is not installed, you'll need to figure out how to install it on your OS. You can find information about obtaining and installing Subversion at the Subversion Packages page.
Tarballs are available at ftp://clozure.com/pub/release/1.3/. Download and extract one on your local disk. Then edit the Clozure CL shell script to set the value of CCL_DEFAULT_DIRECTORY and start up the appropriate Clozure CL kernel. See Section 2.3.1, “The ccl Shell Script” for more information about the Clozure CL shell scripts.
Sometimes it's convenient to use Clozure CL from a Unix shell command line. This is especially true when using Clozure CL as a way to run Common Lisp utilities.
Clozure CL needs to be able to find the
ccl directory in order to support features
such as require and
provide, access to foreign interface
information (see The
Interface Database) and the Lisp build process (see
Building Clozure CL from its Source
Code). Specifically, it needs to set up logical
pathname translations for the "ccl:"
logical host. If this logical host isn't defined (or isn't
defined correctly), some things might work, some things might
not, and it'll generally be hard to invoke and use Clozure CL
productively.
Clozure CL uses the value of the environment variable
CCL_DEFAULT_DIRECTORY to determine the
filesystem location of the ccl directory;
the ccl shell script is intended to provide a way to
invoke Clozure CL with that environment variable set
correctly.
There are two versions of the shell script:
"ccl/scripts/ccl" is used to invoke
32-bit implementations of Clozure CL and
"ccl/scripts/ccl64" is used to invoke
64-bit implementations.
To use the script:
Copy the script to a directory that is on your
PATH. This is often
/usr/local/bin or
~/bin. It is better to do this than to
add ccl/scripts to your
PATH, because the script needs to be edited,
and editing it in-place means that Subversion sees the script as
modified..
Edit the definition of
CCL_DEFAULT_DIRECTORY near the
beginning of the shell script so that it refers to
your ccl directory. Alternately, set
the value of the CCL_DEFAULT_DIRECTORY
environment variable in your .cshrc, .tcshrc,
.bashrc,.bash_profile, .MacOSX/environment.plist, or
wherever you usually set environment variables. If there
is an existing definition of the variable, the ccl
script will not override it. The shell script sets a local
variable (OPENMCL_KERNEL) to the
standard name of the Clozure CL kernel approprate for the
platform, as determined by 'uname -s'. You might prefer to
set this variable manually in the shell script.
Ensure that the shell script is executable, for example:
$ chmod +x
~/ccl/ccl/scripts/ccl64
This command grants execute permission to the named script. If you are using a 32-bit platform, substitute "ccl" in place of "ccl64".
The above command won't work if you are not the owner of the installed copy of Clozure CL. In that case, you can use the "sudo" command like this:
$ sudo chmod +x
~/ccl/ccl/scripts/ccl64
Give your password when prompted.
If the "sudo" command doesn't work, then you are not an administrator on the system you're using, and you don't have the appropriate "sudo" permissions. In that case you'll need to get help from the system's administrator.
Note that most people won't need both
ccl and ccl64 scripts.
You only need both if you sometimes run 32-bit Clozure CL and
sometimes run 64-bit Clozure CL. You can rename the script that
you use to whatever you want. For example, if you are on a
64-bit system, and you only use Clozure CL in 64-bit mode, then
you can rename ccl64 to
ccl so that you only need to type
"ccl" to run it.
Once this is done, it should be possible to invoke Clozure CL
by typing ccl
or ccl64 at a shell prompt:
> ccl [args ...]
Welcome to Clozure CL Version 1.2 (DarwinPPC32)!
?
The ccl shell script passes all of its arguments to the
Clozure CL kernel. See Section 2.3.2, “Invocation” for more
information about these arguments. When invoked this way, the
Lisp should be able to initialize the "ccl:"
logical host so that its translations refer to the
"ccl" directory. To test this, you can call
probe-file in Clozure CL's read-eval-print
loop:
? (probe-file "ccl:level-1;level-1.lisp") ;returns the physical pathname of the file
#P"/Users/alms/my_lisp_stuff/ccl/level-1/level-1.lisp"
By default Clozure CL tries to load the file
"home:ccl-init.lisp" or the compiled
"home:ccl-init.fasl" upon starting up.
Clozure CL does this by executing (load
"home:ccl-init"). If it's unable to load the file
(for example because the file doesn't exist), Clozure CL doesn't
signal an error or warning, it just completes its startup
normally.
On Unix systems, if "ccl-init.lisp" is not
present, Clozure CL will look for ".ccl-init.lisp"
(post 1.2 versions only).
The "home:" prefix to the filename is a
Common Lisp logical host, which Clozure CL initializes to refer to
your home directory. Clozure CL therefore looks for either of the
files
~/ccl-init.lisp or
~/ccl-init.fasl.
Because the init file is loaded the same way as normal Lisp code is, you can put anything you want in it. For example, you can change the working directory, and load packages that you use frequently.
To suppress the loading of this init-file, invoke Clozure CL with the
--no-init option.
When using Clozure CL from the command line, the following
options may be used to modify its behavior. The exact set of
Clozure CL command-line arguments may vary per platform and
slowly changes over time. The current set of command line
options may be retrieved by using the
--help option.
-h (or
--help). Provides a definitive (if
somewhat terse) summary of the command line options
accepted by the Clozure CL implementation and then
exits.
-V (or
--version). Prints the version of
Clozure CL then exits. The version string is the same value
that is returned by
LISP-IMPLEMENTATION-VERSION.
-K
character-encoding-name (or
--terminal-encoding
character-encoding-name).
Specifies the character encoding to use for
*TERMINAL-IO* (see Section 4.3.4, “Character Encodings”). Specifically, the
character-encoding-name string
is uppercased and interned in the KEYWORD package. If an
encoding named by that keyword exists,
CCL:*TERMINAL-CHARACTER-ENCODING-NAME* is set to the name
of that encoding. CCL:*TERMINAL-CHARACTER-ENCODING-NAME* defaults to NIL, which
is a synonym for :ISO-8859-1.
For example:
shell> ccl -K utf-8
has the effect of making the standard CL streams use
:UTF-8 as their character
encoding.
-n (or
--no-init). If this option is given, the
init file is not loaded. This is useful if Clozure CL is being
invoked by a shell script that should not be affected by
whatever customizations a user might have in place.
-e form
(or --eval). An expression is read (via
READ-FROM-STRING) from the string
form and evaluated. If
form contains shell metacharacters,
it may be necessary to escape or quote them to prevent the
shell from interpreting them.
-l path
(or --load
path). Loads file specified by
path.
-T n (or
--set-lisp-heap-gc-threshold
n). Sets the Lisp gc threshold to
n. (see Section 15.3, “GC Page reclamation policy”
-Q (or
--quiet). Suppresses printing of
heralds and prompts when the --batch
command line option is specified.
-R n (or
--heap-reserve). Reserves
n bytes for heap expansion. The
default is 549755813888. (see Section 15.1, “Heap space allocation”)
-S n (or
--stack-size n). Sets the size of the
initial control stack to n. (see Section 6.3.1, “Thread Stack Sizes”)
-Z n (or
--thread-stack-size
n). Sets the size of the first
thread's stack to n. (see Section 6.3.1, “Thread Stack Sizes”)
-b (or --batch). Execute in "batch mode". End-of-file
from *STANDARD-INPUT* causes Clozure CL to exit, as do attempts to
enter a break loop.
--no-sigtrap An obscure option for running under GDB.
-I
image-name (or
--image-name
image-name). Specifies the image
name for the kernel to load. Defaults to the kernel name
with ".image" appended.
The --load and
--eval options can each be provided
multiple times. They're executed in the order specified on
the command line, after the init file (if there is one) is
loaded and before the toplevel read-eval-print loop is
entered.
A very common way to use Clozure CL is to run it within the GNU Emacs editor, using a Lisp interface called SLIME ("Superior Lisp Interaction Mode for Emacs"). SLIME is an Emacs package designed to provide good support within Emacs for any of several Common Lisp implementations; one of the supported implementations is Clozure CL. This page describes how you can download SLIME and set it up to work with your Clozure CL installation.
Why use SLIME? With SLIME, you can do the following things from within an Emacs editing session:
run and control Lisp
evaluate, compile, and load files or expressions
macroexpand expressions
fetch documentation and source code for Lisp symbols
autocomplete symbols and package names
cross-reference function calls
examine stack traces and debug errors
For complete information about SLIME, see the SLIME home page. The SLIME home page provides up-to-date downloads, plus documentation, tutorials, and instructional screencasts.
In order to simplify these instructions, we'll make several assumptions about your system. Specifically, we assume:
You have a working installation of GNU Emacs. If you don't have a working copy of GNU Emacs, see the web page on obtaining Emacs. If you prefer to use XEmacs instead of GNU Emacs, these instructions should still work; SLIME supports XEmacs Version21. Mac OS X includes an Emacs installation. If you want to look into different versions, you can check out theEmacsWiki, which maintains a page, EmacsForMacOS, that provides much more information about using Emacs on the Mac.
A popular version of Emacs among Mac users is Aquamacs. This application is a version of GNU Emacs with a number of customizations meant to make it behave more like a standard Macintosh application, with windows, a menubar, etc. Aquamacs includes SLIME; if you like Aquamacs then you can use SLIME right away, without getting and installing it separately. You just need to tell SLIME where to find your installation of Clozure CL.
You have a working copy of Clozure CL, installed in
"~/ccl"If you prefer to install
Clozure CL in some directory other
than"~/ccl" then these
instructions still work, but you must remember to use your
path to your ccl directory instead of the one that we give
here.
You install emacs add-ons in the folder
"~/emacs/site/"If this directory
doesn't exist on your system, you can just create it.If
you prefer to install Emacs add-ons in some place other
than"~/emacs/site/" then you must
remember to use your path to Emacs add-ons in place of
ours.
You can get SLIME from the SLIME Home Page. Stable releases and CVS snapshots are available as archive files, or you can follow the instructions on the SLIME Home Page to check out the latest version from their CVS repository.
It's worth noting that stable SLIME releases happen very seldom, but the SLIME developers often make changes and improvements that are available through CVS updates. If you asked the SLIM developers, they would most likely recommend that you get SLIME from their CVS repository and update it frequently.
Whether you get it from CVS, or download and unpack one of the available archives, you should end up with a folder named "slime" that contains the SLIME distribution.
Once you have the "slime" folder described in the previous section, installation is a simple matter of copying the folder to the proper place. You can drag it into the "~/emacs/site/" folder, or you can use a terminal command to copy it there. For example, assuming your working directory contains the unpacked "slime" folder:
$ cp -R
slime ~/emacs/site/
That's all it takes.
Once SLIME and Clozure CL are installed, you just need to add a line to your "~/.emacs" file that tells SLIME where to find the script that runs Clozure CL:
(setq inferior-lisp-program "~/ccl/scripts/ccl64")
or
(setq inferior-lisp-program "~/ccl/scripts/ccl")
Aquamacs users should add this line to the file "~/Library/Preferences/Aquamacs Emacs/Preferences.el".
Once the preparations in the previous section are complete, exit Emacs and restart it, to ensure that it reads the changes you made in your ".emacs" file (alternatively, you could tell Emacs to reload the ".emacs" file). If all went well, you should now be ready to run Clozure CL using SLIME.
To run Clozure CL, execute the command "M-x slime". SLIME should start an Clozure CL session in a new buffer. (If you are unfamiliar with the Emacs notation "M-x command", see the GNU Emacs FAQ; specifically, take a look at questions 1, 2, and 128.)
Sometimes you'll get a new version of Clozure CL, set up Emacs to use it with SLIME, and SLIME will fail. Most likely what has happened is that the new version of Clozure CL has a change in the output files produced by the compiler (Clozure CL developers will say "the fasl version has changed." fasl stands for "fast load" aka compiled files). This problem is easy to fix: just delete the existing SLIME fasl files. The next time you launch Emacs and start SLIME, it will automatically recompile the Lisp files, and that should fix the problem.
SLIME's load process stores its fasl files in a hidden folder inside your home folder. The path is
~/.slime/fasl
You can use a shell command to remove the fasl files, or remove them using your system's file browser.
Note for Macintosh Users: The leading "." character in the ".slime" folder's name prevents the Finder from showing this folder to you. If you use the "Go To Folder" menu item in the Finder's "Go" menu, you can type in "~/.slime" and the Finder will show it to you. You can then drag the "fasl" folder to the trash.
SLIME has not been updated to account for recent changes made in Clozure CL to support x86-64 processors. You may run into bugs running on those platforms.
The SLIME backtrace sometimes shows incorrect information.
return-from-frame and
apply-in-frame do not work reliably. (If
they work at all, it's pure luck.)
Some versions of Emacs on the Macintosh may have trouble finding the shell script that runs Clozure CL unless you specify a full path to it. See the above section "Telling Emacs About SLIME" to learn how to specify the path to the shell script.
For more help with Clozure CL on Mac OS X, consult the Clozure CL mailing lists. You can find information about the mailing lists on the Clozure CL wiki.
A number (ok, a small number), of example programs are distributed in the "ccl:examples;" directory of the source distribution. See the README-OPENMCL-EXAMPLES text file in that directory for information about prerequisites and usage.
Some of the example programs are derived from C examples in textbooks, etc.; in those cases, the original author and work are cited in the source code.
Unless the original author or contributor claims other rights, you're free to incorporate any of this example code or derivative thereof in any of your own works without restriction. In doing so, you agree that the code was provided "as is", and that no other party is legally or otherwise responsible for any consequences of your decision to use it.
If you've developed Clozure CL examples that you'd like to see added to the distribution, please send mail to the Clozure CL mailing lists. Any such contributions would be welcome and appreciated (as would bug fixes and improvements to the existing examples.)
Clozure CL, like many other Lisp implementations, consists of a kernel and a heap image. The kernel is an ordinary C program, and is built with a C compiler. It provides very basic and fundamental facilities, such as memory management, garbage collection, and bootstrapping. All the higher-level features are written in Lisp, and compiled into the heap image. Both parts are needed to have a working Lisp implementation; neither the kernel nor the heap image can stand alone.
You may already know that, when you have a C compiler which is written in C, you need a working C compiler to build the compiler. Similarly, the Clozure CL heap image includes a Lisp compiler, which is written in Lisp. You therefore need a working Lisp compiler in order to build the Lisp heap image.
Where will you get a working Lisp compiler? No worries; you can use a precompiled copy of a (slightly older and compatible) version of Clozure CL. This section explains how to do all this.
In principle it should be possible to use another implementation of Common Lisp as the host compiler, rather than an older Clozure CL; this would be a challenging and experimental way to build, and is not described here.
The following terms are used in subsequent sections; it may be helpful to refer to these definitions.
fasl
files are the object files produced
by compile-file. fasl files store the
machine code associated with function definitions and the
external representation of other lisp objects in a compact,
machine-readable form. fasl is short for
“FASt
Loading”. Clozure CL uses different pathname
types (extensions) to name fasl files on different platforms;
see
Table 3.1, “Platform-specific filename conventions”
The Lisp kernel is a C program with a fair amount of platform-specific assembly language code. Its basic job is to map a lisp heap image into memory, transfer control to some compiled lisp code that the image contains, handle any exceptions that occur during the execution of that lisp code, and provide various other forms of runtime support for that code. Clozure CL uses different filenames to name the lisp kernel files on different platforms; see Table 3.1, “Platform-specific filename conventions”.
A heap
image is a file that can be quickly mapped into a
process's address space. Conceptually, it's not too different
from an executable file or shared library in the OS's native
format (ELF or Mach-O/dyld format); for historical reasons,
Clozure CL's own heap images are in their own (fairly simple)
format. The term full heap image refers to a
heap image file that contains all of the code and data that
comprise Clozure CL. Clozure CL uses different filenames to name the
standard full heap image files on different platforms; see
Table 3.1, “Platform-specific filename conventions”.
A bootstrapping image is a minimal heap image used in the process of building Clozure CL itself. The bootstrapping image contains just enough code to load the rest of Clozure CL from fasl files. It may help to think of the bootstrapping image as the egg and the full heap image as the chicken. Clozure CL uses different filenames to name the standard bootstrapping image files on different platforms; see Table 3.1, “Platform-specific filename conventions” .
Each supported platform (and possibly a few
as-yet-unsupported ones) has a uniquely named subdirectory of
ccl/lisp-kernel/; each such
contains a Makefile and may contain some auxiliary files (linker
scripts, etc.) that are used to build the lisp kernel on a
particular platform.The platform-specific name of the kernel
build directory is described in
Table 3.1, “Platform-specific filename conventions”.
Table 3.1. Platform-specific filename conventions
| Platform | kernel | full-image | boot-image | fasl extension | kernel-build directory |
|---|---|---|---|---|---|
| DarwinPPC32 | dppccl | dppccl.image | ppc-boot.image | .dfsl | darwinppc |
| LinuxPPC32 | ppccl | ppccl.image | ppc-boot | .pfsl | linuxppc |
| DarwinPPC64 | dppccl64 | dppccl64.image | ppc-boot64.image | .d64fsl | darwinppc64 |
| LinuxPPC64 | ppccl64 | ppccl64.image | ppc-boot64 | .p64fsl | linuxppc64 |
| LinuxX8664 | lx86cl64 | lx86cl64.image | x86-boot64 | .lx64fsl | linuxx8664 |
| LinuxX8632 | lx86cl | lx86cl.image | x86-boot32 | .lx32fsl | linuxx8632 |
| DarwinX8664 | dx86cl64 | dx86cl64.image | x86-boot64.image | .dx64fsl | darwinx8664 |
| DarwinX8632 | dx86cl | dx86cl.image | x86-boot32.image | .dx32fsl | darwinx8632 |
| FreeBSDX8664 | fx86cl64 | fx86cl64.image | fx86-boot64 | .fx64fsl | freebsdx8664 |
| FreeBSDX8632 | fx86cl | fx86cl.image | fx86-boot32 | .fx32fsl | freebsdx8632 |
| SolarisX64 | sx86cl64 | sx86cl64.image | sx86-boot64 | .sx64fsl | solarisx64 |
| SolarisX86 | sx86cl | sx86cl.image | sx86-boot32 | .sx32fsl | solarisx86 |
| Win64 | wx86cl64.exe | sx86cl64.image | wx86-boot64.image | .wx64fsl | win64 |
| Win32 | wx86cl.exe | wx86cl.image | wx86-boot32.image | .wx32fsl | win32 |
At a given time, there are generally two versions of Clozure CL that you might want to use (and therefore might want to build from source):
The released version
The development version, called the "trunk", which may contain both interesting new features and interesting new bugs
All versions are available for download from svn.clozure.com via the Subversion source control system.
For example, to get a released version (1.3 in this example), use a command like:
svn co http://svn.clozure.com/publicsvn/openmcl/release/1.3/xxx/ccl
To get the trunk version, use:
svn co http://svn.clozure.com/publicsvn/openmcl/trunk/xxx/ccl
Change the "xxx" to one of the following names:
darwinx86,
linuxx86,
freebsdx86,
solarisx86,
window,
linuxppc,
or
darwinppc.
In the case of released versions, there may also be tar archives available. See the Clozure CL Trac for details.
Subversion client programs are pre-installed on Mac OS X 10.5 and later and are typically either pre-installed or readily available on Linux and FreeBSD platforms. The Subversion web page contains links to subversion client programs for many platforms; users of Mac OS X 10.4 can also install Subversion clients via Fink or MacPorts.
Given that you now have everything you need, do the following in a running Clozure CL to bring your Lisp system completely up to date.
? (ccl:rebuild-ccl :full t)
That call to the function rebuild-ccl
performs the following steps:
Deletes all fasl files and other object files in the
ccl directory tree
Runs an external process that does a
make in the current platform's kernel
build directory to create a new kernel.
This step can only work if the C compiler and related
tools are installed; see Section 3.4.1, “Kernel build prerequisites”.
Does (compile-ccl t) in the running
lisp, to produce a set of fasl files from the “higher
level” lisp sources.
Does (xload-level-0 :force) in the
running lisp, to compile the lisp sources in the
“ccl:level-0;” directory into fasl files and
then create a bootstrapping image from those fasl
files.
Runs another external process, which causes the newly compiled lisp kernel to load the new bootstrapping image. The bootsrtrapping image then loads the “higher level” fasl files and a new copy of the platform's full heap image is then saved.
If all goes well, it'll all happen without user intervention and with some simple progress messages. If anything goes wrong during execution of either of the external processes, the process output is displayed as part of a lisp error message.
rebuild-ccl is essentially just a short
cut for running all the individual steps involved in rebuilding
the system. You can also execute these steps individually, as
described below.
The Lisp kernel is the executable that you run to use Lisp. It doesn't actually contain the entire Lisp implementation; rather, it loads a heap image which contains the specifics—the "library", as it might be called if this was a C program. The kernel also provides runtime support to the heap image, such as garbage collection, memory allocation, exception handling, and the OS interface.
The Lisp kernel file has different names on different
platforms. See
Table 3.1, “Platform-specific filename conventions”. On all
platforms the lisp kernel sources reside
in ccl/lisp-kernel.
This section gives directions on how to rebuild the Lisp kernel from its source code. Most Clozure CL users will rarely have to do this. You probably will only need to do it if you are attempting to port Clozure CL to a new architecture or extend or enhance its kernel in some way. As mentioned above, this step happens automatically when you do
? (rebuild-ccl :full t)
The Clozure CL kernel can be bult with the following widely available tools:
cc or gcc- the GNU C compiler
ld - the GNU linker
m4 or gm4- the GNU m4 macro processor
as - the GNU assembler (version 2.10.1 or later)
make - either GNU make or, on FreeBSD, the default BSD make program
In general, the more recent the versions of those
tools, the better; some versions of gcc 3.x on Linux have
difficulty compiling some of the kernel source code correctly
(so gcc 4.0 should be used, if possible.) On Mac OS X, the
versions of the tools distributed with Xcode should work fine;
on Linux, the versions of the tools installed with the OS (or
available through its package management system) should work
fine if they're "recent enough". On FreeBSD, the installed
version of the m4 program doesn't support
some features that the kernel build process depends on; the
GNU version of the m4 macroprocessor (called
gm4 on FreeBSD) should be installed.
In order to build the lisp kernel on Mac OS X 10.6 Snow Leopard, you must install the optional 10.4 support when installing Xcode.
The initial heap image is loaded by the Lisp kernel, and provides most of the language implementation The heap image captures the entire state of a running Lisp (except for external resources, such as open files and TCP sockets). After it is loaded, the contents of the new Lisp process's memory are exactly the same as those of the old Lisp process when the image was created.
The heap image is how we get around the fact that we can't run Lisp code until we have a working Lisp implementation, and we can't make our Lisp implementation work until we can run Lisp code. Since the heap image already contains a fully-working implementation, all we need to do is load it into memory and start using it.
If you're building a new version of Clozure CL, you need to build a new heap image.
(You might also wish to build a heap image if you have a
large program that is very complicated or time-consuming to
load, so that you will be able to load it once, save an image,
and thenceforth never have to load it again. At any time, a heap
image capturing the entire memory state of a running Lisp can be
created by calling the function
ccl:save-application.)
Creating a new Clozure CL full heap image consists of the following steps:
Using your existing Clozure CL, create a bootstrapping image
Using your existing Clozure CL, recompile your updated Clozure CL sources
Invoke Clozure CL with the bootstrapping image you just created (rather than with the existing full heap image).
When you invoke Clozure CL with the bootstrapping image, it starts up, loads all of the Clozure CL fasl files, and saves out a new full heap image. Voila. You've created a new heap image.
A few points worth noting:
There's a circular dependency between the full heap image and the bootstrapping image, in that each is used to build the other.
There are some minor implementation differences, but the environment in effect after the bootstrapping image has loaded its fasl files is essentially equivalent to the environment provided by the full heap image; the latter loads a lot faster and is easier to distribute, of course.
If the full heap image doesn't work (because of an OS compatibilty problem or other bug), it's very likely that the bootstrapping image will suffer the same problems.
Given a bootstrapping image and a set of up-to-date fasl files, the development cycle usually involves editing lisp sources (or updating those sources via cvs update), recompiling modified files, and using the bootstrapping image to produce a new heap image.
The bootstrapping image isn't provided in Clozure CL distributions. It can be built from the source code provided in distributions (using a lisp image and kernel provided in those distributions) using the procedure described below.
The bootstrapping image is built by invoking a special
utility inside a running Clozure CL heap image to load files
contained in the ccl/level-0 directory. The
bootstrapping image loads several dozen fasl files. After
it's done so, it saves a heap image via
save-application. This process is called
"cross-dumping".
Given a source distribution, a lisp kernel, and a heap image, one can produce a bootstrapping image by first invoking Clozure CL from the shell:
shell> ccl Welcome to Clozure CL .... ! ?
then calling ccl:xload-level-0 at the
lisp prompt
? (ccl:xload-level-0)
This function compiles the lisp sources in the ccl/level-0 directory if they're newer than the corresponding fasl files and then loads the resulting fasl files into a simulated lisp heap contained in data structures inside the running lisp. That simulated heap image is then written to disk.
xload-level-0 should be called
whenever your existing boot image is out-of-date with respect
to the source files in ccl:level-0;
:
? (ccl:xload-level-0 :force)
forces recompilation of the level-0 sources.
Calling:
? (ccl:compile-ccl)
at the lisp prompt compiles any fasl files that are
out-of-date with respect to the corresponding lisp sources;
(ccl:compile-ccl t) forces
recompilation. ccl:compile-ccl reloads
newly-compiled versions of some files;
ccl:xcompile-ccl is analogous, but skips
this reloading step.
Unless there are bootstrapping considerations involved, it usually doesn't matter whether these files are reloaded after they're recompiled.
Calling compile-ccl or
xcompile-ccl in an environment where fasl
files don't yet exist may produce warnings to that effect
whenever files are required during
compilation; those warnings can be safely ignored. Depending
on the maturity of the Clozure CL release, calling
compile-ccl or
xcompile-ccl may also produce several
warnings about undefined functions, etc. They should be
cleaned up at some point.
To build a full image from a bootstrapping image, just invoke the kernel with the bootstrapping image as an argument
$ cd ccl # wherever your ccl directory is $ ./KERNEL BOOT_IMAGE
Where KERNEL and
BOOT_IMAGE are the names of
the kernel and boot image appropriate to the platform you are
running on. See Table 3.1, “Platform-specific filename conventions”
That should load a few dozen fasl files (printing a message as each file is loaded.) If all of these files successfully load, the lisp will print a prompt. You should be able to do essentially everything in that environment that you can in the environment provided by a "real" heap image. If you're confident that things loaded OK, you can save that image.
? (ccl:save-application "image_name") ; Overwiting the existing heap image
Where image_name is the name of
the full heap image for your platform. See
Table 3.1, “Platform-specific filename conventions”.
If things go wrong in the early stages of the loading sequence, errors are often difficult to debug; until a fair amount of code (CLOS, the CL condition system, streams, the reader, the read-eval-print loop) is loaded, it's generally not possible for the lisp to report an error. Errors that occur during these early stages ("the cold load") sometimes cause the lisp kernel debugger (see ) to be invoked; it's primitive, but can sometimes help one to get oriented.
The Common Lisp standard allows considerable latitude in the details of an implementation, and each particular Common Lisp system has some idiosyncrasies. This chapter describes ordinary user-level features of Clozure CL, including features that may be part of the Common Lisp standard, but which may have quirks or details in the Clozure CL implementation that are not described by the standard. It also describes extensions to the standard; that is, features of Clozure CL that are not part of the Common Lisp standard at all.
Clozure CL's tracing facility is invoked by an extended version of the Common Lisp trace macro. Extensions allow tracing of methods, as well as finer control over tracing actions.
TRACE {keyword
global-value}* {spec |
(spec {keyword
local-value}*)}* [Macro]
The trace macro encapsulates the functions named by
specs, causing trace actions to take place on entry and
exit from each function. The default actions print a message on function entry and
exit. Keyword/value options
can be used to specify changes in the default behavior.
Invoking (trace) without arguments returns a list of functions being traced.
A spec is either a symbol that is the name of a function, or an
expression of the form (setf symbol), or a
specific method of a generic function in the form (:method
gf-name {qualifier}*
({specializer}*)), where a
specializer can be the name of a class or an EQL
specializer.
A spec can also be a string naming a package, or equivalently a
list (:package package-name), in order to
request that all functions in the package to be traced.
By default, whenever a traced function is entered or exited, a short message is
printed on *trace-output* showing the arguments on entry and
values on exit. Options specified as key/value pairs can be used to modify this
behavior. Options preceding the function specs apply to
all the functions being traced. Options specified along with a
spec apply to that spec only and override any
global options. The following options are supported:
If true, and if applied to a spec naming a generic
function, arranges to trace all the methods of the generic function in addition to the
generic function itself.
outside-spec
| ({outside-spec}*)
Inhibits all trace actions unless the current
invocation of the function being traced is inside one of the
outside-spec's, i.e. unless a function named by one of the
outside-spec's is currently on the stack.
outside-spec can name a function, a
method, or a package, as above.
form,
:condition form
Evaluates form whenever the function being traced is
about to be entered, and inhibits all trace actions if form
returns nil. The form may reference the lexical variable ccl::args,
which is a list of the arguments in this call. :condition is just a
synonym for :if, though if both are specified, both must return non-nil.
form
Evaluates form whenever the function being traced is
about to be entered, and inhibits the entry trace actions if
form returns nil. The form may reference the lexical variable
ccl::args, which is a list of the arguments in this call. If both
:if and :before-if are specified, both must return
non-nil in order for the before entry actions to happen.
form
Evaluates form whenever the function being traced has
just exited, and inhibits the exit trace actions if form
returns nil. The form may reference the lexical variable ccl::vals,
which is a list of values returned by this call. If both :if and
:after-if are specified, both must return non-nil in order for the
after exit actions to happen.
form
Evaluates form whenever the function being traced is
about to be entered, and prints the result before printing the standard entry message.
The form may reference the lexical variable ccl::args, which is a list
of the arguments in this call. To see multiple forms, use values:
:print-before (values (one-thing) (another-thing)).
form
Evaluates form whenever the function being traced has
just exited, and prints the result after printing the standard exit message. The form may
reference the lexical variable ccl::vals, which is a list of values
returned by this call. To see multiple forms, use values:
:print-after (values (one-thing) (another-thing)).
form
Equivalent to :print-before form :print-after form.
form
Evaluates form whenever the function being traced is
about to be entered. The form may reference the lexical variable
ccl::args, which is a list of the arguments in this call.
form
Evaluates form whenever the function being has just
exited. The form may reference the lexical variable ccl::vals, which
is a list of values returned by this call.
form
Equivalent to :eval-before form
:eval-after form.
form
Evaluates form whenever the function being traced is
about to be entered, and if the result is non-nil, enters a debugger break loop. The form
may reference the lexical variable ccl::args, which is a list of the
arguments in this call.
form
Evaluates form whenever the function being traced has
just exited, and if the result is non-nil, enters a debugger break loop. The form may
reference the lexical variable ccl::vals, which is a list of values
returned by this call.
form
Equivalent to :break-before form :break-after form.
form,
:backtrace form
Evaluates form whenever the function being traced is
about to be entered. The form may reference the lexical variable
ccl::args, which is a list of the arguments in this call. The value
returned by form is intepreted as follows:
does nothing
prints a detailed backtrace to *trace-output*.
integer)
prints the top integer frames of detailed
backtrace to *trace-output*.
integer
prints top integer frames of a terse
backtrace to *trace-output*.
prints a terse backtrace to *trace-output*.
Note that unlike with the other options, :backtrace is equivalent to :backtrace-before only, not both before and after, since it's usually not helpful to print the same backtrace both before and after the function call.
form
Evaluates form whenever the function being traced has
just exited. The form may reference the lexical variable ccl::vals,
which is a list of values returned by this call. The value returned by
form is intepreted as follows:
does nothing
prints a detailed backtrace to *trace-output*.
integer)
prints the top integer frames of detailed
backtrace to *trace-output*.
integer
prints top integer frames of a terse
backtrace to *trace-output*.
prints a terse backtrace to *trace-output*.
action
specifies the action to be taken just before the traced function is entered. action is one of:
The default, prints a short indented message showing the function name and the invocation arguments
Equivalent to :before :print :break-before t
Equivalent to :before :print :backtrace-before t
function
Any other value is interpreted as a function to call on entry instead of printing the standard entry message. It is called with its first argument being the name of the function being traced, the remaining arguments being all the arguments to the function being traced, and ccl:*trace-level* bound to the current nesting level of trace actions.
action
specifies the action to be taken just after the traced function exits. action is one of:
The default, prints a short indented message showing the function name and the returned values
Equivalent to :after :print :break-after t
Equivalent to :after :print :backtrace-after t
function
Any other value is interpreted as a function to call on exit instead of printing the standard exit message. It is called with its first argument being the name of the function being traced, the remaining arguments being all the values returned by the function being traced, and ccl:*trace-level* bound to the current nesting level of trace actions.
Variable bound to the current nesting level during execution of before and after trace actions. The default printing actions use it to determine the amount of indentation.
CCL:*TRACE-MAX-INDENT* [Variable]
The default before and after print actions will not indent by more than the value of ccl:*trace-max-indent* regardless of the current trace level.
CCL:TRACE-FUNCTION spec &key {keyword value}* [Function]
This is a functional version of the TRACE macro. spec and
keywords are as for TRACE, except that all arguments are evaluated.
CCL:*TRACE-PRINT-LEVEL* [Variable]
The default print actions bind CL:*PRINT-LEVEL* to this value while printing. Note that this rebinding is only in effect during the default entry and exit messages. It does not apply to printing of :print-before/:print-after forms or any explicit printing done by user code.
CCL:*TRACE-PRINT-LENGTH* [Variable]
The default print actions bind CL:*PRINT-LENGTH* to this value while printing. Note that this rebinding is only in effect during the default entry and exit messages. It does not apply to printing of :print-before/:print-after forms or any explicit printing done by user code.
CCL:*TRACE-BAR-FREQUENCY* [Variable]
By default, this is nil. If non-nil it should be a integer, and the default entry and exit messages will print a | instead of space every this many levels of indentation.
All characters and strings in Clozure CL fully support Unicode by
using UTF-32. There is only one CHARACTER type
and one STRING type in Clozure CL. There has been a
lot of discussion about this decision which can be found by
searching the openmcl-devel archives at http://clozure.com/pipermail/openmcl-devel/. Suffice it
to say that we decided that the simplicity and speed advantages of
only supporting UTF-32 outweigh the space disadvantage.
There is one CHARACTER type in Clozure CL.
All CHARACTERs are
BASE-CHARs. CHAR-CODE-LIMIT
is now #x110000, which means that all Unicode
characters can be directly represented. As of Unicode 5.0, only
about 100,000 of 1,114,112 possible CHAR-CODEs
are actually defined. The function CODE-CHAR
knows that certain ranges of code values (notably
#xd800-#xddff) will never be
valid character codes and will return NIL for
arguments in that range, but may return a
non-NIL value (an undefined/non-standard
CHARACTER object) for other unassigned code
values.
Clozure CL supports character names of the form
u+xxxx—where x is a
sequence of one or more hex digits. The value of the hex digits
denotes the code of the character. The +
character is optional, so #\u+0020,
#\U0020, and #\U+20 all
refer to the #\Space character.
Characters with codes in the range
#xa0-#x7ff also have
symbolic names These are the names from the Unicode standard with
spaces replaced by underscores. So
#\Greek_Capital_Letter_Epsilon can be used to
refer to the character whose CHAR-CODE is
#x395. To see the complete list of supported
character names, look just below the definition for
register-character-name in
ccl:level-1;l1-reader.lisp.
OPEN, LOAD, and
COMPILE-FILE all take an
:EXTERNAL-FORMAT keyword argument. The value
of :EXTERNAL-FORMAT can be
:DEFAULT (the default value), a line
termination keyword (see Section 4.3.3, “Line Termination Keywords”), a character encoding
keyword (see Section 4.3.4, “Character Encodings”), an
external-format object created using
CCL::MAKE-EXTERNAL-FORMAT (see make-external-format), or a plist with keys:
:DOMAIN, :CHARACTER-ENCODING
and :LINE-TERMINATION. If
argument is a plist, the result of
(APPLY #'MAKE-EXTERNAL-FORMAT
will be used.argument)
If :DEFAULT is specified, then the value
of CCL:*DEFAULT-EXTERNAL-FORMAT* is used. If
no line-termination is specified, then the value of
CCL:*DEFAULT-LINE-TERMINATION* is used, which
defaults to :UNIX. If no character encoding is
specified, then
CCL:*DEFAULT-FILE-CHARACTER-ENCODING* is used
for file streams and
CCL:*DEFAULT-SOCKET-CHARACTER-ENCODING* is used
for socket streams. The default, default character encoding is
NIL which is a synonym for
:ISO-8859-1.
Note that the set of keywords used to denote CHARACTER-ENCODINGs and the set of keywords used to denote line-termination conventions is disjoint: a keyword denotes at most a character encoding or a line termination convention, but never both.
EXTERNAL-FORMATs are objects (structures) with three read-only fields that can be accessed via the functions: EXTERNAL-FORMAT-DOMAIN, EXTERNAL-FORMAT-LINE-TERMINATION and EXTERNAL-FORMAT-CHARACTER-ENCODING.
domain---This is used to indicate where the external
format is to be used. Its value can be almost
anything. It defaults to NIL.
There are two domains that have a pre-defined meaning in
Clozure CL: :FILE indicates
encoding for a file in the file system and
:SOCKET indicates i/o to/from a
socket. The value of domain
affects the default values for
character-encoding and
line-termination.
character-encoding---A keyword that specifies the character encoding
for the external format. Section 4.3.4, “Character Encodings”. Defaults to
:DEFAULT which means if
domain is
:FILE use the value of the variable
CCL:*DEFAULT-FILE-CHARACTER-ENCODING*
and if domain is
:SOCKET, use the value of the
variable
CCL:*DEFAULT-SOCKET-CHARACTER-ENCODING*.
The initial value of both of these variables is
NIL, which means the
:ISO-8859-1 encoding.
line-termination---A keyword that indicates a line termination
keyword Section 4.3.3, “Line Termination Keywords”.
Defaults to :DEFAULT which means
use the value of the variable
CCL:*DEFAULT-LINE-TERMINATION*.
external-format---An external-format object as described above.
Line termination keywords indicate which characters are used
to indicate the end of a line. On input, the external line
termination characters are replaced by #\Newline
and on output, #\Newlines are converted to the
external line termination characters.
Table 4.1. Line Termination Keywords
| keyword | character(s) |
|---|---|
:UNIX
|
#\Linefeed
|
:MACOS
|
#\Return
|
:CR
|
#\Return
|
:CRLF
|
#\Return #\Linefeed
|
:CP/M
|
#\Return #\Linefeed
|
:MSDOS
|
#\Return #\Linefeed
|
:DOS
|
#\Return #\Linefeed
|
:WINDOWS
|
#\Return #\Linefeed
|
:INFERRED
|
see below |
:UNICODE
|
#\Line_Separator
|
:INFERRED means that a stream's
line-termination convention is determined by looking at the contents
of a file. It is only useful for FILE-STREAMs
that're open for :INPUT or
:IO. The first buffer full of data is examined,
and if a #\Return character occurs before any
#\Linefeed character, then the line termination
type is set to :MACOS, otherwise it is set to
:UNIX.
Internally, all characters and strings in Clozure CL are in UTF-32. Externally, files or socket streams may encode characters in a wide variety of ways. The International Organization for Standardization, widely known as ISO, defines many of these character encodings. Clozure CL implements some of these encodings as detailed below. These encodings are part of the specification of external formats Section 4.3.2, “External Formats”. When reading from a stream, characters are converted from the specified external character encoding to UTF-32. When writing to a stream, characters are converted from UTF-32 to the specified character encoding.
Internally, CHARACTER-ENCODINGs are objects (structures) that are named by character encoding keywords (:ISO-8859-1, :UTF-8, etc.). The structures contain attributes of the encoding and functions used to encode/decode external data, but unless you're trying to define or debug an encoding there's little reason to know much about the CHARACTER-ENCODING objects and it's usually preferable to refer to a character encoding by its name.
On output to streams with character encodings that can encode the full range of Unicode—and on input from any stream—"unencodable characters" are represented using the Unicode #\Replacement_Character (= #\U+fffd); the presence of such a character usually indicates that something got lost in translation. Either data wasn't encoded properly or there was a bug in the decoding process.
The endianness of a character encoding is sometimes
explicit, and sometimes not. For example,
:UTF-16BE indicates big-endian, but
:UTF-16 does not specify endianness. A byte
order mark is a special character that may appear at the
beginning of a stream of encoded characters to specify the
endianness of a multi-byte character encoding. (It may also be
used with UTF-8 character encodings, where it is simply used to
indicate that the encoding is UTF-8.)
Clozure CL writes a byte order mark as the first character of a file or socket stream when the endianness of the character encoding is not explicit. Clozure CL also expects a byte order mark on input from streams where the endianness is not explicit. If a byte order mark is missing from input data, that data is assumed to be in big-endian order.
A byte order mark from a UTF-8 encoded input stream is not treated specially and just appears as a normal character from the input stream. It is probably a good idea to skip over this character.
The set of character encodings supported by Clozure CL can be retrieved by calling CCL:DESCRIBE-CHARACTER-ENCODINGS.
The list of supported encodings is reproduced here. Most
encodings have aliases, e.g. the encoding named
:ISO-8859-1 can also be referred to by the
names :LATIN1 and :IBM819,
among others. Where possible, the keywordized name of an
encoding is equivalent to the preferred MIME charset name (and
the aliases are all registered IANA charset names.)
:ISO-8859-1
An 8-bit, fixed-width character encoding in which all character codes map to their Unicode equivalents. Intended to support most characters used in most Western European languages.
Clozure CL uses ISO-8859-1 encoding for
*TERMINAL-IO* and for all streams whose
EXTERNAL-FORMAT isn't explicitly specified. The default for
*TERMINAL-IO* can be set via the
-K command-line argument (see Section 2.5, “Command Line Options”).
ISO-8859-1 just covers the first 256 Unicode code points, where the first 128 code points are equivalent to US-ASCII. That should be pretty much equivalent to what earliers versions of Clozure CL did that only supported 8-bit characters, but it may not be optimal for users working in a particular locale.
Aliases: :ISO_8859-1, :LATIN1, :L1,
:IBM819, :CP819, :CSISOLATIN1
:ISO-8859-2
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in most languages used in Central/Eastern Europe.
Aliases: :ISO_8859-2, :LATIN-2, :L2,
:CSISOLATIN2
:ISO-8859-3
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in most languages used in Southern Europe.
Aliases: :ISO_8859-3, :LATIN,3 :L3,
:CSISOLATIN3
:ISO-8859-4
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in most languages used in Northern Europe.
Aliases: :ISO_8859-4, :LATIN4, :L4, :CSISOLATIN4
:ISO-8859-5
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in the Cyrillic alphabet.
Aliases: :ISO_8859-5, :CYRILLIC, :CSISOLATINCYRILLIC,
:ISO-IR-144
:ISO-8859-6
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in the Arabic alphabet.
Aliases: :ISO_8859-6, :ARABIC, :CSISOLATINARABIC,
:ISO-IR-127
:ISO-8859-7
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in the Greek alphabet.
Aliases: :ISO_8859-7, :GREEK, :GREEK8, :CSISOLATINGREEK,
:ISO-IR-126, :ELOT_928, :ECMA-118
:ISO-8859-8
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in the Hebrew alphabet.
Aliases: :ISO_8859-8, :HEBREW, :CSISOLATINHEBREW,
:ISO-IR-138
:ISO-8859-9
An 8-bit, fixed-width character encoding in which codes #x00-#xcf map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in the Turkish alphabet.
Aliases: :ISO_8859-9, :LATIN5, :CSISOLATIN5,
:ISO-IR-148
:ISO-8859-10
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in Nordic alphabets.
Aliases: :ISO_8859-10, :LATIN6, :CSISOLATIN6,
:ISO-IR-157
:ISO-8859-11
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found the Thai alphabet.
:ISO-8859-13
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in Baltic alphabets.
:ISO-8859-14
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in Celtic languages.
Aliases: :ISO_8859-14, :ISO-IR-199, :LATIN8, :L8,
:ISO-CELTIC
:ISO-8859-15
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in Western European languages (including the Euro sign and some other characters missing from ISO-8859-1.
Aliases: :ISO_8859-15, :LATIN9
:ISO-8859-16
An 8-bit, fixed-width character encoding in which codes #x00-#x9f map to their Unicode equivalents and other codes map to other Unicode character values. Intended to provide most characters found in Southeast European languages.
Aliases: :ISO_8859-16, :ISO-IR-199, :LATIN8, :L8,
:ISO-CELTIC
:MACINTOSH
An 8-bit, fixed-width character encoding in which codes #x00-#x7f map to their Unicode equivalents and other codes map to other Unicode character values. Traditionally used on Classic MacOS to encode characters used in western languages.
Aliases: :MACOS-ROMAN, :MACOSROMAN, :MAC-ROMAN,
:MACROMAN
:UCS-2
A 16-bit, fixed-length encoding in which characters with CHAR-CODEs less than #x10000 can be encoded in a single 16-bit word. The endianness of the encoded data is indicated by the endianness of a byte-order-mark character (#u+feff) prepended to the data; in the absence of such a character on input, the data is assumed to be in big-endian order.
:UCS-2BE
A 16-bit, fixed-length encoding in which characters with CHAR-CODEs less than #x10000 can be encoded in a single 16-bit big-endian word. The encoded data is implicitly big-endian; byte-order-mark characters are not interpreted on input or prepended to output.
:UCS-2LE
A 16-bit, fixed-length encoding in which characters with CHAR-CODEs less than #x10000 can be encoded in a single 16-bit little-endian word. The encoded data is implicitly little-endian; byte-order-mark characters are not interpreted on input or prepended to output.
:US-ASCII
An 7-bit, fixed-width character encoding in which all character codes map to their Unicode equivalents.
Aliases: :CSASCII, :CP63,7 :IBM637, :US,
:ISO646-US, :ASCII, :ISO-IR-6
:UTF-16
A 16-bit, variable-length encoding in which characters with CHAR-CODEs less than #x10000 can be encoded in a single 16-bit word and characters with larger codes can be encoded in a pair of 16-bit words. The endianness of the encoded data is indicated by the endianness of a byte-order-mark character (#u+feff) prepended to the data; in the absence of such a character on input, the data is assumed to be in big-endian order. Output is written in native byte-order with a leading byte-order mark.
:UTF-16BE
A 16-bit, variable-length encoding in which characters with CHAR-CODEs less than #x10000 can be encoded in a single 16-bit big-endian word and characters with larger codes can be encoded in a pair of 16-bit big-endian words. The endianness of the encoded data is implicit in the encoding; byte-order-mark characters are not interpreted on input or prepended to output.
:UTF-16LE
A 16-bit, variable-length encoding in which characters with CHAR-CODEs less than #x10000 can be encoded in a single 16-bit little-endian word and characters with larger codes can be encoded in a pair of 16-bit little-endian words. The endianness of the encoded data is implicit in the encoding; byte-order-mark characters are not interpreted on input or prepended to output.
:UTF-32
A 32-bit, fixed-length encoding in which all Unicode characters can be encoded in a single 32-bit word. The endianness of the encoded data is indicated by the endianness of a byte-order-mark character (#u+feff) prepended to the data; in the absence of such a character on input, input data is assumed to be in big-endian order. Output is written in native byte order with a leading byte-order mark.
Alias: :UTF-4
:UTF-32BE
A 32-bit, fixed-length encoding in which all Unicode characters encoded in a single 32-bit word. The encoded data is implicitly big-endian; byte-order-mark characters are not interpreted on input or prepended to output.
Alias: :UCS-4BE
:UTF-8
An 8-bit, variable-length character encoding in which characters with CHAR-CODEs in the range #x00-#x7f can be encoded in a single octet; characters with larger code values can be encoded in 2 to 4 bytes.
:UTF-32LE
A 32-bit, fixed-length encoding in which all Unicode characters can encoded in a single 32-bit word. The encoded data is implicitly little-endian; byte-order-mark characters are not interpreted on input or prepended to output.
Alias: :UCS-4LE
:Windows-31j
An 8-bit, variable-length character encoding in which character code points in the range #x00-#x7f can be encoded in a single octet; characters with larger code values can be encoded in 2 bytes.
Aliases: :CP932, :CSWINDOWS31J
:EUC-JP
An 8-bit, variable-length character encoding in which character code points in the range #x00-#x7f can be encoded in a single octet; characters with larger code values can be encoded in 2 bytes.
Alias: :EUCJP
Clozure CL provides functions to encode and decode strings to and from vectors of type (simple-array (unsigned-byte 8)).
Decodes the octets in vector (or the subsequence of it delimited by start and end) into a string according to external-format.
If string is supplied, output will be written into it. It must be large enough to hold the decoded characters. If string is not supplied, a new string will be allocated to hold the decoded characters.
Returns, as multiple values, the decoded string and the position in vector where the decoding ended.
encode-string-to-octets
string
&key
start
end
external-format
use-byte-order-mark
vector
vector-offset
Encodes string (or the substring delimited by start and end) into external-format and returns, as multiple values, a vector of octets containing the encoded data and an integer that specifies the offset into the vector where the encoded data ends.
When use-byte-order-mark is true, a byte-order mark will be included in the encoded data.
If vector is supplied, output will be written to it. It must be of type (simple-array (unsigned-byte 8)) and be large enough to hold the encoded data. If it is not supplied, the function will allocate a new vector.
If vector-offset is supplied, data will be written into the output vector starting at that offset.
Leading tilde (~) characters in physical pathname namestrings are expanded in the way that most shells do:
"~user/..." can be used to refer to an absolute pathname rooted
at the home directory of the user named "user".
"~/..." can be used to refer to an absolute pathname rooted at
the home directory of the current user.
Clozure CL sets up logical pathname translations for logical hosts: ccl and home
The CCL logical host should point to the
ccl directory. It is used for a variety of
purposes by Clozure CL including: locating Clozure CL source code,
require and provide, accessing
foreign function information, and the Clozure CL build process. It
is set to the value of the environment variable
CCL_DEFAULT_DIRECTORY, which is set by the
openmcl shell script Section 2.3.1, “The ccl Shell Script”. If
CCL_DEFAULT_DIRECTORY is not set, then it is set
to the directory containing the current heap image.
Pathname strings are treated as null-terminated strings coded in the encoding named by the value returned by the function CCL:PATHNAME-ENCODING-NAME. This value may be changed with SETF.
In release 1.2 and later, Clozure CL supports memory-mapped files. On operating systems that support memory-mapped files (including Mac OS X, Linux, and FreeBSD), the operating system can arrange for a range of virtual memory addresses to refer to the contents of an open file. As long as the file remains open, programs can read values from the file by reading addresses in the mapped range.
Using memory-mapped files may in some cases be more efficient than reading the contents of a file into a data structure in memory.
Clozure CL provides the functions CCL:MAP-FILE-TO-IVECTOR and CCL:MAP-FILE-TO-OCTET-VECTOR to support memory-mapping. These functions return vectors whose contents are the contents of memory-mapped files. Reading an element of such a vector returns data from the corresponding position in the file.
Without memory-mapped files, a common idiom for reading the contents of files might be something like this:
(let* ((stream (open pathname :direction :input :element-type '(unsigned-byte 8)))
(vector (make-array (file-size-to-vector-size stream)
:element-type '(unsigned-byte 8))))
(read-sequence vector stream))
Using a memory-mapped files has a result that is the same in that, like the above example, it returns a vector whose contents are the same as the contents of the file. It differs in that the above example creates a new vector in memory and copies the file's contents into it; using a memory-mapped file instead arranges for the vector's elements to point to the file's contents on disk directly, without copying them into memory first.
The vectors returned by CCL:MAP-FILE-TO-IVECTOR and CCL:MAP-FILE-TO-OCTET-VECTOR are read-only; any attempt to change an element of a vector returned by these functions results in a memory-access error. Clozure CL does not currently support writing data to memory-mapped files.
Vectors created by CCL:MAP-FILE-TO-IVECTOR and CCL:MAP-FILE-TO-OCTET-VECTOR are required to respect Clozure CL's limit on the total size of an array. That means that you cannot use these functions to create a vector longer than ARRAY-TOTAL-SIZE-LIMIT, even if the filesystem supports file sizes that are larger. The value of ARRAY-TOTAL-SIZE-LIMIT is (EXPT 2 24) on 32-but platforms; and (EXPT 2 56) on 64-bit platforms.
CCL:MAP-FILE-TO-IVECTOR
pathname
element-type
[Function]
The pathname of the file to be memory-mapped.
The element-type of the vector to be created. Specified as a type-specifier that names a subtype of either SIGNED-BYTE or UNSIGNED-BYTE.
The map-file-to-ivector function tries to
open the file at pathname for reading. If
successful, the function maps the file's contents to a range of
virtual addresses. If successful, it returns a read-only vector
whose element-type is given
by element-type, and whose contents are
the contents of the memory-mapped file.
The returned vector is a displaced-array whose element-type is (UPGRADED-ARRAY-ELEMENT-TYPE element-type). The target of the displaced array is a vector of type (SIMPLE-ARRAY element-type (*)) whose elements are the contents of the memory-mapped file.
Because of alignment issues, the mapped file's contents start a few bytes (4 bytes on 32-bit platforms, 8 bytes on 64-bit platforms) into the vector. The displaced array returned by CCL:MAP-FILE-TO-IVECTOR hides this overhead, but it's usually more efficient to operate on the underlying simple 1-dimensional array. Given a displaced array (like the value returned by CCL:MAP-FILE-TO-IVECTOR), the function ARRAY-DISPLACEMENT returns the underlying array and the displacement index in elements.
Currently, Clozure CL supports only read operations on memory-mapped files. If you try to change the contents of an array returned by map-file-to-ivector, Clozure CL signals a memory error.
CCL:UNMAP-IVECTOR
displaced-array
[Function]
If the argument is a displaced-array returned by map-file-to-ivector, and if it has not yet been unmapped by this function, then unmap-ivector undoes the memory mapping, closes the mapped file, and changes the displaced-array so that its target is an empty vector (of length zero).
CCL:MAP-FILE-TO-OCTET-VECTOR
pathname
[Function]
This function is a synonym for (CCL:MAP-FILE-TO-IVECTOR pathname '(UNSIGNED-BYTE 8)) It is provided as a convenience for the common case of memory-mapping a file as a vector of bytes.
CCL:UNMAP-OCTET-VECTOR
displaced-array
[Function]
This function is a synonym for (CCL:UNMAP-IVECTOR)
Clozure CL supports the definition of static variables, whose values are the same across threads, and which may not be dynamically bound. The value of a static variable is thus the same across all threads; changing the value in one thread changes it for all threads.
Attempting to dynamically rebind a static variable (for instance, by using LET, or using the variable name as a parameter in a LAMBDA form) signals an error. Static variables are shared global resources; a dynamic binding is private to a single thread.
Static variables therefore provide a simple way to share mutable state across threads. They also provide a simple way to introduce race conditions and obscure bugs into your code, since every thread reads and writes the same instance of a given static variable. You must take care, therefore, in how you change the values of static variables, and use normal multithreaded programming techniques, such as locks or semaphores, to protect against race conditions.
In Clozure CL, access to a static variable is usually faster than access to a special variable that has not been declared static.
DEFSTATIC
var
value
&key
doc-string
[Macro]
The name of the new static variable.
The initial value of the new static variable.
A documentation string that is assigned to the new variable.
Proclaims the variable special, assigns the variable the supplied value, and assigns the doc-string to the variable's VARIABLE documentation. Marks the variable static, preventing any attempt to dynamically rebind it. Any attempt to dynamically rebind var signals an error.
Clozure CL provides the
function CCL:SAVE-APPLICATION, which creates a file
containing an archived Lisp memory image.
Clozure CL consists of a small executable called the Lisp kernel, which implements the very lowest level features of the Lisp system, and an image, which contains the in-memory representation of most of the Lisp system, including functions, data structures, variables, and so on. When you start Clozure CL, you are launching the kernel, which then locates and reads an image file, restoring the archived image in memory. Once the image is fully restored, the Lisp system is running.
Using CCL:SAVE-APPLICATION, you can create a
file that contains a modified image, one that includes any changes
you've made to the running Lisp system. If you later pass your
image file to the Clozure CL kernel as a command-line parameter, it
then loads your image file instead of its default one, and Clozure CL
starts up with your modifications.
If this scenario seems to you like a convenient way to
create an application, that's just as intended. You can create an
application by modifying the running Lisp until it does what you
want, then use CCL:SAVE-APPLICATION to preserve your
changes and later load them for use.
In fact, you can go further than that. You can replace Clozure CL's toplevel function with your own, and then, when the image is loaded, the Lisp system immediately performs your tasks rather than the default tasks that make it a Lisp development system. If you save an image in which you have done this, the resulting Lisp system is your tool rather than a Lisp development system.
You can go a step further still. You can
tell CCL:SAVE-APPLICATION to prepend the Lisp kernel
to the image file. Doing this makes the resulting image into a
self-contained executable binary. When you run the resulting file,
the Lisp kernel immediately loads the attached image file and runs
your saved system. The Lisp system that starts up can have any
behavior you choose. It can be a Lisp development system, but with
your customizations; or it can immediately perform some task of
your design, making it a specialized tool rather than a general
development system.
In other words, you can develop any application you like by
interactively modifying Clozure CL until it does what you want, then
using CCL:SAVE-APPLICATION to preserve your changes
in an executable image.
On Mac OS X,
the application builder
uses CCL:SAVE-APPLICATION to create the executable
portion of the application
bundle. Double-clicking the application bundle runs
the executable image created
by CCL:SAVE-APPLICATION.
Also on Mac OS X, Clozure CL supports an object type
called MACPTR, which is the type of pointers into the
foreign (Mac OS) heap. Examples of
commonly-user MACPTR objects are Cocoa windows and
other dynamically-allocated Mac OS system objects.
Because a MACPTR object is a pointer into a
foreign heap that exists for the lifetime of the running Lisp
process, and because a saved image is used by loading it into a
brand new Lisp process, saved MACPTR objects cannot
be relied on to point to the same things when reconstituted from a
saved image. In fact, a restored MACPTR object might
point to anything at all—for example an arbitrary location
in the middle of a block of code, or a completely nonexistent
virtual address.
For that reason, CCL:SAVE-APPLICATION converts
all MACPTR objects to DEAD-MACPTR
objects when writing them to an image
file. A DEAD-MACPTR is functionally identical to
a MACPTR, except that code that operates
on MACPTR objects distinguishes them
from DEAD-MACPTR objects and can handle them
appropriately—signaling errors, for example.
As of Clozure CL 1.2, there is one exception to the conversion
of MACPTR to DEAD-MACPTR objects:
a MACPTR object that points to the address 0 is not
converted, because address 0 can always be relied upon to refer to
the same thing.
As of Clozure CL 1.2, the constant CCL:+NULL-PTR+
refers to a MACPTR object that points to address 0.
On all supported platforms, you can
use CCL:SAVE-APPLICATION to create a command-line
tool that runs the same way any command-line program
does. Alternatively, if you choose not to prepend the kernel, you
can save an image and then later run it by passing it as a
command-line parameter to the opencml
or opencml64 script.
SAVE-APPLICATION
filename
&key
toplevel-function
init-file
error-handler
application-class
clear-clos-caches
(purify t)
impurify
(mode #o644)
prepend-kernel
[Function]
The pathname of the file to be created when Clozure CL saves the application.
The function to be executed after startup is complete. The toplevel is a function of no arguments that performs whatever actions the lisp system should perform when launched with this image.
If this parameter is not supplied, Clozure CL uses its default toplevel. The default toplevel runs the read-eval-print loop.
The pathname of a Lisp file to be loaded when the image starts up. You can place initialization expressions in this file, and use it to customize the behavior of the Lisp system when it starts up.
The error-handling mode for the saved image. The
supplied value determines what happens when an error is not
handled by the saved image. Valid values
are :quit (Lisp exits with an error
message); :quit-quietly (Lisp exits without an
error message); or :listener (Lisp enters a
break loop, enabling you to debug the problem by interacting
in a listener). If you don't supply this parameter, the
saved image uses the default error handler
(:listener).
The CLOS class that represents the saved Lisp
application. Normally you don't need to supply this
parameter; CCL:SAVE-APPLICATION uses the
class CCL:LISP-DEVELOPMENT-SYSTEM. In some
cases you may choose to create a custom application class;
in that case, pass the name of the class as the value for
this parameter.
If true, ensures that CLOS caches are emptied before saving the image. Normally you don't need to supply this parameter, but if for some reason you want to ensure the CLOS caches are clear when the image starts up, you can pass any true value.
Multiple fasl files can be concatenated into a single file.
out-file--- Name of the file in which to store the concatenation.
fasl-files--- List of names of fasl files to concatenate.
:if-exists---
As for OPEN, defaults to
:error
Creates a fasl file which, when loaded, will have the same effect as loading the individual input fasl files in the specified order. The single file might be easier to distribute or install, and loading it may be at least a little faster than loading the individual files (since it avoids the overhead of opening and closing each file in succession.)
The PATHNAME-TYPE of the output file and of each input file defaults to the current platform's fasl file type (.dx64fsl or whatever.) If any of the input files has a different type/extension an error will be signaled, but it doesn't otherwise try too hard to verify that the input files are real fasl files for the current platform.
In Clozure CL, the Common Lisp types short-float and single-float are implemented as IEEE single precision values; double-float and long-float are IEEE double precision values. On 64-bit platforms, single-floats are immediate values (like fixnums and characters).
Floating-point exceptions are generally enabled and detected. By default, threads start up with overflow, division-by-zero, and invalid enabled, and the rounding mode is set to nearest. The functions SET-FPU-MODE and GET-FPU-MODE provide user control over floating-point behavior.
mode--- One of the keywords :rounding-mode, :overflow, :underflow, :division-by-zero, :invalid, :inexact.
If mode is supplied, returns the value of the corresponding control flag for the current thread.
Otherwise, returns a list of keyword/value pairs which describe the floating-point exception-enable and rounding-mode control flags for the current thread.
rounding-mode--- One of :nearest, :zero, :positive, :negative
overflow, underflow, division-by-zero, invalid, inexact --- If true, the floating-point exception is signaled. If NIL, it is masked.
As of release 1.4, Clozure CL provides a way for lisp objects to be watched so that a condition will be signaled when a thread attempts to write to the watched object. For a certain class of bugs (someone is changing this value, but I don't know who), this can be extremely helpful.
The WATCH function arranges for the specified object to be monitored for writes. This is accomplished by copying the object to its own set of virtual memory pages, which are then write-protected. This protection is enforced by the computer's memory-management hardware; the write-protection does not slow down reads at all.
When any write to the object is attempted, a WRITE-TO-WATCHED-OBJECT condition will be signaled.
When called with no arguments, WATCH returns a freshly-consed list of the objects currently being watched.
WATCH returns NIL if the object cannot be watched (typically because the object is in a static or pure memory area).
WATCH operates at a fairly low level; it is not possible to avoid the details of the internal representation of objects. Nevertheless, as a convenience, WATCHing a standard-instance, a hash-table, or a multi-dimensional or non-simple CL array will watch the underlying slot-vector, hash-table-vector, or data-vector, respectively.
WATCH can monitor any memory-allocated lisp object.
In Clozure CL, a memory-allocated object is either a cons cell or a uvector.
WATCH operates on cons cells, not lists. In order to watch a chain of cons cells, each cons cell must be watched individually. Because each watched cons cell takes up its own own virtual memory page (4 Kbytes), it's only feasible to watch relatively short lists.
If a memory-allocated object isn't a cons cell, then it is a vector-like object called a uvector. A uvector is a memory-allocated lisp object whose first word is a header that describes the object's type and the number of elements that it contains.
So, a hash table is a uvector, as is a string, a standard instance, a double-float, a CL array or vector, and so forth.
Some CL objects, like strings and other simple vectors, map in a straightforward way onto the uvector representation. It is easy to understand what happens in such cases. The uvector index corresponds directly to the vector index:
? (defvar *s* "xxxxx")
*S*
? (watch *s*)
"xxxxx"
? (setf (char *s* 3) #\o)
> Error: Write to watched uvector "xxxxx" at index 3
> Faulting instruction: (movl (% eax) (@ -5 (% r15) (% rcx)))
> While executing: SET-CHAR, in process listener(1).
> Type :POP to abort, :R for a list of available restarts.
> Type :? for other options.
In the case of more complicated objects (e.g., a hash-table, a standard-instance, a package, etc.), the elements of the uvector are like slots in a structure. It's necessary to know which one of those "slots" contains the data that will be changed when the object is written to.
As mentioned above, watch knows about arrays, hash-tables, and standard-instances, and will automatically watch the appropriate data-containing element.
An example might make this clearer.
? (defclass foo ()
(slot-a slot-b slot-c))
#<STANDARD-CLASS FOO>
? (defvar *a-foo* (make-instance 'foo))
*A-FOO*
? (watch *a-foo*)
#<SLOT-VECTOR #xDB00D>
;;; Note that WATCH has watched the internal slot-vector object
? (setf (slot-value *a-foo* 'slot-a) 'foo)
> Error: Write to watched uvector #<SLOT-VECTOR #xDB00D> at index 1
> Faulting instruction: (movq (% rsi) (@ -5 (% r8) (% rdi)))
> While executing: %MAYBE-STD-SETF-SLOT-VALUE-USING-CLASS, in process listener(1).
> Type :POP to abort, :R for a list of available restarts.
> Type :? for other options.
Looking at a backtrace would presumably show what object and slot name were written.
Note that even though the write was to slot-a, the uvector index was 1 (not 0). This is because the first element of a slot-vector is a pointer to the instance that owns the slots. We can retrieve that to look at the object that was modified:
1 > (uvref (write-to-watched-object-object *break-condition*) 0)
#<FOO #x30004113502D>
1 > (describe *)
#<FOO #x30004113502D>
Class: #<STANDARD-CLASS FOO>
Wrapper: #<CLASS-WRAPPER FOO #x300041135EBD>
Instance slots
SLOT-A: #<Unbound>
SLOT-B: #<Unbound>
SLOT-C: #<Unbound>
1 >
This condition is signaled when a watched object is written to. There are three slots of interest:
object--- The actual object that was the destination of the write.
offset--- The byte offset from the tagged object pointer to the address of the write.
instruction--- The disassembled machine instruction that attempted the write.
A few restarts are provided: one will skip over the faulting write instruction and proceed; another offers to unwatch the object and continue.
There is also an emulate restart. In some common cases, the faulting write instruction can be emulated, enabling the write to be performed without having to unwatch the object (and therefore let other threads potentially write to it). If the faulting instruction isn't recognized, the emulate restart will not be offered.
Although some care has been taken to minimize potential problems arising from watching and unwatching objects from multiple threads, there may well be subtle race conditions present that could cause bad behavior.
For example, suppose that a thread attempts to write to a watched object. This causes the operating system to generate an exception. The lisp kernel figures out what the exception is, and calls back into lisp to signal the write-to-watched-object condition and perhaps handle the error.
Now, as soon lisp code starts running again (for the callback), it's possible that some other thread could unwatch the very watched object that caused the exception, perhaps before we even have a chance to signal the condition, much less respond to it.
Having the object unwatched out from underneath a handler may at least confuse it, if not cause deeper trouble. Use caution with unwatch.
Here are a couple more examples in addition to the above examples of watching a string and a standard-instance.
? (defvar *f* (make-array '(2 3) :element-type 'double-float)) *F* ? (watch *f*) #(0.0D0 0.0D0 0.0D0 0.0D0 0.0D0 0.0D0) ;;; Note that the above vector is the underlying data-vector for the array ? (setf (aref *f* 1 2) pi) > Error: Write to watched uvector #<VECTOR 6 type DOUBLE-FLOAT, simple> at index 5 > Faulting instruction: (movq (% rax) (@ -5 (% r8) (% rdi))) > While executing: ASET, in process listener(1). > Type :POP to abort, :R for a list of available restarts. > Type :? for other options. 1 >
In this case, uvector index in the report is the row-major index of the element that was written to.
Hash tables are surprisingly complicated. The representation of a hash table includes an element called a hash-table-vector. The keys and values of the elements are stored pairwise in this vector.
One problem with trying to monitor hash tables for writes is that the underlying hash-table-vector is replaced with an entirely new one when the hash table is rehashed. A previously-watched hash-table-vector will not be the used by the hash table after rehashing, and writes to the new vector will not be caught.
? (defvar *h* (make-hash-table)) *H* ? (setf (gethash 'noise *h*) 'feep) FEEP ? (watch *h*) #<HASH-TABLE-VECTOR #xDD00D> ;;; underlying hash-table-vector ? (setf (gethash 'noise *h*) 'ding) > Error: Write to watched uvector #<HASH-TABLE-VECTOR #xDD00D> at index 35 > Faulting instruction: (lock) > (cmpxchgq (% rsi) (@ (% r8) (% rdx))) > While executing: %STORE-NODE-CONDITIONAL, in process listener(1). > Type :POP to abort, :R for a list of available restarts. > Type :? for other options. ;;; see what value is being replaced... 1 > (uvref (write-to-watched-object-object *break-condition*) 35) FEEP ;;; backtrace shows useful context 1 > :b *(1A109F8) : 0 (%STORE-NODE-CONDITIONAL ???) NIL (1A10A50) : 1 (LOCK-FREE-PUTHASH NOISE #<HASH-TABLE :TEST EQL size 1/60 #x30004117D47D> DING) 653 (1A10AC8) : 2 (CALL-CHECK-REGS PUTHASH NOISE #<HASH-TABLE :TEST EQL size 1/60 #x30004117D47D> DING) 229 (1A10B00) : 3 (TOPLEVEL-EVAL (SETF (GETHASH # *H*) 'DING) NIL) 709 ...
As previously mentioned, WATCH only watches individual cons cells.
? (defun watch-list (list)
(maplist #'watch list))
WATCH-LIST
? (defvar *l* (list 1 2 3))
*L*
? (watch-list *l*)
((1 2 3) (2 3) (3))
? (setf (nth 2 *l*) 'foo)
> Error: Write to the CAR of watched cons cell (3)
> Faulting instruction: (movq (% rsi) (@ 5 (% rdi)))
> While executing: %SETNTH, in process listener(1).
> Type :POP to abort, :R for a list of available restarts.
> Type :? for other options.
In Clozure CL 1.4 and later, code coverage provides information about which paths through generated code have been executed and which haven't. For each source form, it can report one of three possible outcomes:
Not covered: this form was never entered.
Partly covered: This form was entered, and some parts were executed and some weren't.
Fully covered: Every bit of code generated from this form was executed.
While the information gathered for coverage of generated code is complete and precise, the mapping back to source forms is of necessity heuristic, and depends a great deal on the behavior of macros and the path of the source forms through compiler transforms. Source information is not recorded for variables, which further limits the source mapping. In practice, there is often enough information scattered about a partially covered function to figure out which logical path through the code was taken and which wasn't. If that doesn't work, you can try disassembling to see which parts of the compiled code were not executed: in the disassembled code there will be references to #<CODE-NOTE [xxx] ...> where xxx is NIL if the code that follows was never executed and non-NIL if it was.
Sometimes the situation can be improved by modifying macros to try to preserve more of the input forms, rather than destructuring and rebuilding them.
Because the code coverage information is associated with compiled functions, load-time toplevel expressions do not get reported on. You can work around this by creating a function and calling it. I.e. instead of
(progn (do-this) (setq that ...) ...))
do:
(defun init-this-and-that () (do-this) (setq that ...) ...) (init-this-and-that)
Then you can see the coverage information in the definition of init-this-and-that.
In order to gather code coverage information, you first have to
recompile all your code to include code coverage
instrumentation. Compiling files will generate code coverage
instrumentation if CCL:*COMPILE-CODE-COVERAGE*
is true:
(setq ccl:*compile-code-coverage* t) (recompile-all-your-files)
The compilation process will be many times slower than normal, and the fasl files will be many times bigger.
When you execute function loaded from instrumented fasl files, they will record coverage information every time they are executed. The system keeps track of which instrumented files have been loaded.
The following functions can be used to manage the coverage data:
html--- If non-nil, this will generate an HTML report, consisting of an index file and one html file for each instrumented source file that has been loaded in the current session. The individual source file reports are stored in the same directory as the index file.
external-format--- Controls the external format of the html files.
statistics--- If :statistics is non-nil, a comma-separated file is also generated with the summary of statistics. You can specify a filename for the statistics argument, otherwise "statistics.csv" is created in the output directory. See documentation of ccl:coverage-statistics below for a description of the values in the statistics file.
Restores the coverage data previously saved with CCL:SAVE-COVERAGE-IN-FILE, for the set of instrumented fasls that were loaded both at save and restore time. I.e. coverage info is only restored for files that have been loaded in this session. For example if in a previous session you had loaded "foo.lx86fsl" and then saved the coverage info, in this session you must load the same "foo.lx86fsl" before calling ccl:restore-coverage-from-file in order to retrieve the stored coverage info for "foo". Equivalent to (ccl:restore-coverage (ccl:read-coverage-from-file pathname)).
Returns a sequence ccl:coverage-statistics objects, one for each source file, containing the same information as that written to the statistics file by ccl:report-coverage. The following accessors are defined for ccl:coverage-statistics objects:
the name of the source file corresponding to this information
the total number of expressions
the number of source expressions that have been entered (i.e. at least partially covered)
the number of source expressions that were fully covered
the number of conditionals with one branch taken and one not taken
the total number of code forms. A code form is an expression in the final stage of compilation, after all macroexpansion and compiler transforms and simplification
the number of code forms that have been entered
the total number of functions
the number of functions that were fully covered
the number of functions that were partly covered
the number of functions never entered
Clozure CL ships with the complete source code for an integrated development environment written using Cocoa on Mac OS X. This chapter describes how to build and use that environment, referred to hereafter simply as "the IDE".
The IDE provides a programmable text editor, listener windows, an inspector for Lisp data structures, and a means of easily building a Cocoa application in Lisp. In addition, its source code provides an example of a fairly complex Cocoa application written in Lisp.
The current version of the IDE has seen the addition of numerous features and many bugfixes. Although it's by no means a finished product, we hope it will prove more useful than previous versions, and we plan additional work on the IDE for future releases.
Building the Clozure CL IDE is now a very simple process.
In a shell session, cd to the ccl directory.
Run ccl from the shell. The easiest way to do this is generally to execute the ccl or ccl64 command.
Evaluate the form (require :cocoa-application)
For example, assuming that the Clozure CL distribution is installed in "/usr/local/ccl", the following sequence of shell interactions builds the IDE:
oshirion:ccl mikel$ ccl64
Welcome to Clozure Common Lisp Version 1.2-r9198M-trunk (DarwinX8664)!
? (require :cocoa-application)
;Loading #P"ccl:cocoa-ide;fasls;cocoa-utils.dx64fsl.newest"...
;Loading #P"ccl:cocoa-ide;fasls;cocoa-defaults.dx64fsl.newest"...
[...many lines of "Compiling" and "Loading" omitted...]
Saving application to /usr/local/ccl/Clozure CL.app/
oshirion:ccl mikel$
Clozure CL compiles and loads the various subsystems that make up the IDE, then constructs a Cocoa application bundle named "Clozure CL.app" and saves the Lisp image into it. Normally Clozure CL creates the application bundle in the root directory of the Clozure CL distribution.
After it has been built, you can run the "Clozure CL.app" application normally, by double-clicking its icon. When launched, the IDE initially displays a single listener window that you can use to interact with Lisp. You can type Lisp expressions for evaluation at the prompt in the listener window. You can also use Hemlock editing commands to edit the text of expressions in the listener window.
You can open an editor window either by choosing Open from
the File menu and then selecting a text file, or by choosing
New from the File menu. You can also evaluate the
expression (ed) in the listener window; in that
case Clozure CL creates a new window as if you had chosen New from
the File menu.
Editor windows implement Hemlock editing commands. You can use all the editing and customization features of Hemlock within any editor window (including listener windows).
The Lisp menu provides several commands for interacting with the running Lisp session, in addition to the ways you can interact with it by evaluating expressions. You can evaluate a selected range of text in any editing buffer. You can compile and load the contents of editor windows (please note that in the current version, Clozure CL compiles and loads the contents of the file associated with an editor window; that means that if you try to load or compile a window that has not been saved to a file, the result is an error).
You can interrupt computations, trigger breaks, and select restarts from the Lisp menu. You can also display a backtrace or open the Inspector window.
At the bottom of the Lisp menu is an item entitled "Check for Updates". If your copy of Clozure CL came from the Clozure Subversion server (which is the preferred source), and if your internet connection is working, then you can select this menu item to check for updates to your copy of Clozure CL.
When you select "Check for Updates", Clozure CL uses the svn program to query the Clozure Subversion repository and determine whether new updates to Clozure CL are available. (This means that on Mac OS X versions earlier than 10.5, you must ensure that the Subversion client software is installed before using the "Check for Updates" feature. See the wikiHow page on installing Subversion for more information.) If updates are available, Clozure CL automatically downloads and installs them. After a successful download, Clozure CL rebuilds itself, and then rebuilds the IDE on the newly-rebuilt Lisp. Once this process is finished, you should quit the running IDE and start the newly built one (which will be in the same place that the old one was).
Normally, Clozure CL can install updates and rebuild itself without any problems. Occasionally, an unforeseen problem (such as a network outage, or a hardware failure) might interrupt the self-rebuilding process, and leave your copy of Clozure CL unusable. If you are expecting to update your copy of Clozure CL frequently, it might be prudent to keep a backup copy of your working environment ready in case of such situtations. You can also always obtain a full, fresh copy of Clozure CL from Clozure's repository..
The tools menu provides access to the Apropos and Processes windows. The Apropos window searches the running Lisp image for symbols that match any text you enter. You can use the Apropos window to quickly find function names and other useful symbols. The Processes window lists all threads running in the current Lisp session. If you double-click a process entry, Clozure CL opens an Inspector window on that process.
The Inspector window displays information about a Lisp value. The information displayed varies from the very simple, in the case of a simple data value such as a character, to the complex, in the case of structured data such as lists or CLOS objects. The left-hand column of the window's display shows the names of the object's attributes; the righthand column shows the values associated with those attributes. You can inspect the values in the righthand column by double-clicking them.
Inspecting a value in the righthand column changes the Inspector window to display the double-clicked object. You can quickly navigate the fields of structured data this way, inspecting objects and the objects that they refer to. Navigation buttons at the top left of the window enable you to retrace your steps, backing up to return to previously-viewed objects, and going forward again to objects you navigated into previously.
You can change the contents of a structured object by evaluating expressions in a listener window. The refresh button (marked with a curved arrow) updates the display of the Inspector window, enabling you to quickly see the results of changing a data structure.
Clozure CL builds the IDE from sources in the "objc-bridge" and "cocoa-ide" directories in the Clozure CL distribution. The IDE as a whole is a relatively complicated application, and is probably not the best place to look when you are first trying to understand how to build Cocoa applications. For that, you might benefit more from the examples in the "examples/cocoa/" directory. Once you are familiar with those examples, though, and have some experience building your own application features using Cocoa and the Objective-C bridge, you might browse through the IDE sources to see how it implements its features.
The search path for Clozure CL's REQUIRE feature
includes the "objc-bridge" and "cocoa-ide" directories. You can
load features defined in these directories by
using REQUIRE. For example, if you want to use the
Cocoa features of Clozure CL from a terminal session (or from an Emacs
session using SLIME or ILISP), you can evaluate (require
:cocoa).
One important feature of the IDE currently has no Cocoa user interface: the application builder. The application builder constructs a Cocoa application bundle that runs a Lisp image when double-clicked. You can use the application builder to create Cocoa applications in Lisp. These applications are exactly like Cocoa applications created with XCode and Objective-C, except that they are written in Lisp.
To make the application builder available, evaluate the
expression (require :build-application). Clozure CL loads
the required subsystems, if necessary.
BUILD-APPLICATION &key
(name "MyApplication")
(type-string "APPL")
(creator-string "OMCL")
(directory (current-directory))
(copy-ide-resources t)
(info-plist NIL)
(nibfiles NIL)
(main-nib-name NIL)
(application-class 'GUI::COCOA-APPLICATION)
(toplevel-function NIL)
[Function]
The build-application function constructs an application bundle, populates it with the files needed to satisfy Mac OS X that the bundle is a launchable application, and saves an executable Lisp image to the proper subdirectory of the bundle. Assuming that the saved Lisp image contains correct code, a user can subsequently launch the resulting Cocoa application by double-clicking its icon in the Finder, and the saved Lisp environment runs.
The keyword arguments control various aspects of application
bundle as BUILD-APPLICATION builds it.
Specifies the application name of the
bundle. BUILD-APPLICATION creates an application
bundle whose name is given by this parameter, with the
extension ".app" appended. For example, using the default
value for this parameter results in a bundle named
"MyApplication.app".
Specifies type of bundle to create. You should normally never need to change the default value, which Mac OS X uses to identify application bundles.
Specifies the creator code, which uniquely identifies the application under Mac OS X. The default creator code is that of Clozure CL. For more information about reserving and assigning creator codes, see Apple's developer page on the topic.
The directory in which BUILD-APPLICATION
creates the application bundle. By default, it creates the
bundle in the current working directory. Unless you
use CURRENT-DIRECTORY to set the working
directory, the bundle may be created in some unexpected place,
so it's safest to specify a full pathname for this argument. A
typical value might be "/Users/foo/Desktop/"
(assuming, of course, that your username is "foo").
Whether to copy the resource files from the IDE's
application bundle. By
default, BUILD-APPLICATION copies nibfiles
and other resources from the IDE to the newly-created
application bundle. This option is often useful when you
are developing a new application, because it enables your
built application to have a fully-functional user
interface even before you have finished designing one. By
default, the application uses the application menu and
other UI elements of the IDE until you specify
otherwise. Once your application's UI is fully
implemented, you may choose to pass NIL
for the value of this parameter, in which case the IDE
resources are not copied into your application
bundle.
A user-supplied NSDictionary object that defines the
contents of the Info.plist file to be written to the
application bundle. The default value
is NIL, which specifies that the
Info.plist from the IDE is to be used
if copy-ide-resources is true,
and a new dictionary created with default values is to be
used otherwise. You can create a suitable NSDictionary
object using the
function make-info-dict. For details on
the parameters to this function, see its definition in
"ccl/cocoa-ide/builder-utilities.lisp".
A list of pathnames, where each pathname identifies
a nibfile created
with
Apple's InterfaceBuilder
application. BUILD-APPLICATION copies each
nibfile into the appropriate place in the application bundle,
enabling the application to load user-interface elements from
them as-needed. It is safest to provide full pathnames to the
nibfiles in the list. Each nibfile must be in ".nib" format,
not ".xib" format, in order that the application can load
it.
The name of the nibfile to load initially when launching. The user-interface defined in this nibfile becomes the application's main interface. You must supply the name of a suitable nibfile for this parameter, or the resulting application uses the Clozure CL user interface.
The name of the application's CLOS class. The default value is the class provided by Clozure CL for graphical applications. Supply the name of your application class if you implement one. If not, Clozure CL uses the default class.
The toplevel function that runs when the application
launches. Normally the default value, which is Clozure CL's
toplevel, works well, but in some cases you may wish to
customize the behavior of the application's toplevel. The best
source of information about writing your own toplevel is the
Clozure CL source code, especially the implementations
of TOPLEVEL-FUNCTION in
"ccl/level-1/l1-application.lisp"
BUILD-APPLICATION creates a folder named
"name.app" in the
directory directory. Inside that
folder, it creates the "Contents" folder that Mac OS X
application bundles are expected to contain, and populates it
with the "MacOS" and "Resources" folders, and the "Info.plist"
and "PkgInfo" files that must be present in a working
application bundle. It takes the contents of the "Info.plist"
and "PkgInfo" files from the parameters
to BUILD-APPLICATION. If copy-ide-resources
is true then it copies the contents of the "Resources" folder
from the "Resources" folder of the running IDE.
The work needed to produce a running Cocoa application is
very minimal. In fact, if you
supply BUILD-APPLICATION with a valid nibfile and
pathnames, it builds a running Cocoa application that displays
your UI. It doesn't need you to write any code at all to do
this. Of course, the resulting application doesn't do anything
apart from displaying the UI defined in the nibfile. If you want
your UI to accomplish anything, you need to write the code to
handle its events. But the path to a running application with your
UI in it is very short indeed.
Please note that BUILD-APPLICATION is a work in
progress. It can easily build a working Cocoa application, but it
still has limitations that may in some cases prove
inconvenient. For example, in the current version it provides no
easy way to specify an application delegate different from the
default. If you find the current limitations
of BUILD-APPLICATION too restrictive, and want to try
extending it for your use, you can find the source code for it in
"ccl/cocoa-ide/build-application.lisp". You can see the default
values used to populate the "Info.plist" file in
"ccl/cocoa-ide/builder-utilities.lisp".
For more information on how to
use BUILD-APPLICATION, see the Currency Converter
example in "ccl/examples/cocoa/currency-converter/".
It's possible to automate use of the application builder
by running a call to CCL:BUILD-APPLICATION
from the terminal command line. For example, the following
command, entered at a shell prompt in Mac OS X's Terminal
window, builds a working copy of the Clozure CL environment called
"Foo.app":
ccl -b -e "(require :cocoa)" -e "(require :build-application)" -e "(ccl::build-application :name \"Foo\")"
You can use the same method to automate building your
Lisp/Cocoa applications. Clozure CL handles each Lisp expressions
passed with a -e argument in order, so you
can simply evaluate a sequence of Lisp expressions as in the
above example to build your application, ending with a call
to CCL:BUILD-APPLICATION. The call
to CCL:BUILD-APPLICATION can process all the
same arguments as if you evaluated it in a Listener window in
the Clozure CL IDE.
Building a substantial Cocoa application (rather than just reproducing the Lisp environment using defaults, as is done in the above example) is likely to involve a relatively complicated sequence of loading source files and perhaps evaluating Lisp forms. You might be best served to place your command line in a shell script that you can more easily edit and test.
One potentially complicated issue concerns loading all
your Lisp source files in the right order. You might consider
using ASDF to define and load a system that includes all the
parts of your application before
calling CCL:BUILD-APPLICATION. ASDF is a
"another system-definition facility", a sort
of make for Lisp, and is included in the
Clozure CL distribution. You can read more about ASDF at the ASDF
home
page.
Alternatively, you could use the standard features of Common Lisp to load your application's files in the proper order.
Clozure CL provides facilities which enable multiple threads of execution (threads, sometimes called lightweight processes or just processes, though the latter term shouldn't be confused with the OS's notion of a process) within a lisp session. This document describes those facilities and issues related to multithreaded programming in Clozure CL.
Wherever possible, I'll try to use the term "thread" to denote a lisp thread, even though many of the functions in the API have the word "process" in their name. A lisp-process is a lisp object (of type CCL:PROCESS) which is used to control and communicate with an underlying native thread. Sometimes, the distinction between these two (quite different) objects can be blurred; other times, it's important to maintain.
Lisp threads share the same address space, but maintain their own execution context (stacks and registers) and their own dynamic binding context.
Traditionally, Clozure CL's threads have been cooperatively scheduled: through a combination of compiler and runtime support, the currently executing lisp thread arranged to be interrupted at certain discrete points in its execution (typically on entry to a function and at the beginning of any looping construct). This interrupt occurred several dozen times per second; in response, a handler function might observe that the current thread had used up its time slice and another function (the lisp scheduler) would be called to find some other thread that was in a runnable state, suspend execution of the current thread, and resume execution of the newly executed thread. The process of switching contexts between the outgoing and incoming threads happened in some mixture of Lisp and assembly language code; as far as the OS was concerned, there was one native thread running in the Lisp image and its stack pointer and other registers just happened to change from time to time.
Under Clozure CL's cooperative scheduling model, it was possible (via the use of the CCL:WITHOUT-INTERRUPTS construct) to defer handling of the periodic interrupt that invoked the lisp scheduler; it was not uncommon to use WITHOUT-INTERRUPTS to gain safe, exclusive access to global data structures. In some code (including much of Clozure CL itself) this idiom was very common: it was (justifiably) believed to be an efficient way of inhibiting the execution of other threads for a short period of time.
The timer interrupt that drove the cooperative scheduler was only able to (pseudo-)preempt lisp code: if any thread called a blocking OS I/O function, no other thread could be scheduled until that thread resumed execution of lisp code. Lisp library functions were generally attuned to this constraint, and did a complicated mixture of polling and "timed blocking" in an attempt to work around it. Needless to say, this code is complicated and less efficient than it might be; it meant that the lisp was a little busier than it should have been when it was "doing nothing" (waiting for I/O to be possible.)
For a variety of reasons - better utilization of CPU resources on single and multiprocessor systems and better integration with the OS in general - threads in Clozure CL 0.14 and later are preemptively scheduled. In this model, lisp threads are native threads and all scheduling decisions involving them are made by the OS kernel. (Those decisions might involve scheduling multiple lisp threads simultaneously on multiple processors on SMP systems.) This change has a number of subtle effects:
it is possible for two (or more) lisp threads to be executing simultaneously, possibly trying to access and/or modify the same data structures. Such access really should have been coordinated through the use of synchronization objects regardless of the scheduling modeling effect; preemptively scheduled threads increase the chance of things going wrong at the wrong time and do not offer lightweight alternatives to the use of those synchronization objects.
even on a single-processor system, a context switch can happen on any instruction boundary. Since (in general) other threads might allocate memory, this means that a GC can effectively take place at any instruction boundary. That's mostly an issue for the compiler and runtime system to be aware of, but it means that certain practices(such as trying to pass the address of a lisp object to foreign code)that were always discouraged are now discouraged ... vehemently.
there is no simple and efficient way to "inhibit the scheduler"or otherwise gain exclusive access to the entire CPU.
There are a variety of simple and efficient ways to synchronize access to particular data structures.
As a broad generalization: code that's been aggressively tuned to the constraints of the cooperative scheduler may need to be redesigned to work well with the preemptive scheduler (and code written to run under Clozure CL's interface to the native scheduler may be less portable to other CL implementations, many of which offer a cooperative scheduler and an API similar to Clozure CL (< 0.14) 's.) At the same time, there's a large overlap in functionality in the two scheduling models, and it'll hopefully be possible to write interesting and useful MP code that's largely independent of the underlying scheduling details.
The keyword :OPENMCL-NATIVE-THREADS is on *FEATURES* in 0.14 and later and can be used for conditionalization where required.
Much of the functionality described above is similar to that provided by Clozure CL's cooperative scheduler, some other parts of which make no sense in a native threads implementation.
PROCESS-RUN-REASONS and PROCESS-ARREST-REASONS were SETFable process attributes; each was just a list of arbitrary tokens. A thread was eligible for scheduling (roughly equivalent to being "enabled") if its arrest-reasons list was empty and its run-reasons list was not. I don't think that it's appropriate to encourage a programming style in which otherwise runnable threads are enabled and disabled on a regular basis (it's preferable for threads to wait for some sort of synchronization event to occur if they can't occupy their time productively.)
There were a number of primitives for maintaining process queues;that's now the OS's job.
Cooperative threads were based on coroutining primitives associated with objects of type STACK-GROUP. STACK-GROUPs no longerexist.
When you use MAKE-PROCESS to create a thread, you can specify a stack size. Clozure CL does not impose a limit on the stack size you choose, but there is some evidence that choosing a stack size larger than the operating system's limit can cause excessive paging activity, at least on some operating systems.
The maximum stack size is operating-system-dependent. You can use shell commands to determine what it is on your platform. In bash, use "ulimit -s -H" to find the limit; in tcsh, use "limit -h s".
This issue does not affect programs that create threads using the default stack size, which you can do either by specifying no value for the :stack-size argument to MAKE-PROCESS, or by specifying the value CCL::*default-control-stack-size*.
If your program creates threads with a specified stack size, and that size is larger than the OS-specified limit, you may want to consider reducing the stack size in order to avoid possible excessive paging activity.
It's not clear that exposing PROCESS-SUSPEND/PROCESS-RESUME is a good idea: it's not clear that they offer ways to win, and it's clear that they offer ways to lose.
It has traditionally been possible to reset and enable a process that's "exhausted" . (As used here, the term"exhausted" means that the process's initial function has run and returned and the underlying native thread has been deallocated.) One of the principal uses of PROCESS-RESET is to "recycle" threads; enabling an exhausted process involves creating a new native thread (and stacks and synchronization objects and ...),and this is the sort of overhead that such a recycling scheme is seeking to avoid. It might be worth trying to tighten things up and declare that it's an error to apply PROCESS-ENABLE to an exhausted thread (and to make PROCESS-ENABLE detect this error.)
When native threads that aren't created by Clozure CL first call into lisp, a "foreign process" is created, and that process is given its own set of initial bindings and set up to look mostly like a process that had been created by MAKE-PROCESS. The life cycle of a foreign process is certainly different from that of a lisp-created one: it doesn't make sense to reset/preset/enable a foreign process, and attempts to perform these operations should be detected and treated as errors.
Older versions of Clozure CL used what are often called "user-mode threads", a less versatile threading model which does not require specific support from the operating system. This section discusses how to port code which was written for that mode.
It's hard to give step-by-step instructions; there are certainly a few things that one should look at carefully:
It's wise to be suspicious of most uses of WITHOUT-INTERRUPTS; there may be exceptions, but WITHOUT-INTERRUPTS is often used as shorthand for WITH-APPROPRIATE-LOCKING. Determining what type of locking is appropriate and writing the code to implement it is likely to be straightforward and simple most of the time.
I've only seen one case where a process's "run reasons" were used to communicate information as well as to control execution; I don't think that this is a common idiom, but may be mistaken about that.
It's certainly possible that programs written for cooperatively scheduled lisps that have run reliably for a long time have done so by accident: resource-contention issues tend to be timing-sensitive, and decoupling thread scheduling from lisp program execution affects timing. I know that there is or was code in both Clozure CL and commercial MCL that was written under the explicit assumption that certain sequences of open-coded operations were uninterruptable; it's certainly possible that the same assumptions have been made (explicitly or otherwise) by application developers.
Unless and until Clozure CL provides alternatives (via window streams, telnet streams, or some other mechanism) all lisp processes share a common *TERMINAL-IO* stream (and therefore share *DEBUG-IO*, *QUERY-IO*, and other standard and internal interactive streams.)
It's anticipated that most lisp processes other than the "Initial" process run mostly in the background. If a background process writes to the output side of *TERMINAL-IO*, that may be a little messy and a little confusing to the user, but it shouldn't really be catastrophic. All I/O to Clozure CL's buffered streams goes thru a locking mechanism that prevents the worst kinds of resource-contention problems.
Although the problems associated with terminal output from multiple processes may be mostly cosmetic, the question of which process receives input from the terminal is likely to be a great deal more important. The stream locking mechanisms can make a confusing situation even worse: competing processes may "steal" terminal input from each other unless locks are held longer than they otherwise need to be, and locks can be held longer than they need to be (as when a process is merely waiting for input to become available on an underlying file descriptor).
Even if background processes rarely need to intentionally read input from the terminal, they may still need to do so in response to errors or other unanticipated situations. There are tradeoffs involved in any solution to this problem. The protocol described below allows background processes which follow it to reliably prompt for and receive terminal input. Background processes which attempt to receive terminal input without following this protocol will likely hang indefinitely while attempting to do so. That's certainly a harsh tradeoff, but since attempts to read terminal input without following this protocol only worked some of the time anyway, it doesn't seem to be an unreasonable one.
In the solution described here (and introduced in Clozure CL 0.9), the internal stream used to provide terminal input is always locked by some process (the "owning" process.) The initial process (the process that typically runs the read-eval-print loop) owns that stream when it's first created. By using the macro WITH-TERMINAL-INPUT, background processes can temporarily obtain ownership of the terminal and relinquish ownership to the previous owner when they're done with it.
In Clozure CL, BREAK, ERROR, CERROR, Y-OR-N-P, YES-OR-NO-P, and CCL:GET-STRING- FROM-USER are all defined in terms of WITH-TERMINAL-INPUT, as are the :TTY user-interfaces to STEP and INSPECT.
? Welcome to Clozure CL Version (Beta: linux) 0.9!
?
? (process-run-function "sleeper" #'(lambda () (sleep 5) (break "broken")))
#<PROCESS sleeper(1) [Enabled] #x3063B33E>
?
;;
;; Process sleeper(1) needs access to terminal input.
;;
This example was run under ILISP; ILISP often gets confused if one tries to enter input and "point" doesn't follow a prompt. Entering a "simple" expression at this point gets it back in synch; that's otherwise not relevant to this example.
()
NIL
? (:y 1)
;;
;; process sleeper(1) now controls terminal input
;;
> Break in process sleeper(1): broken
> While executing: #<Anonymous Function #x3063B276>
> Type :GO to continue, :POP to abort.
> If continued: Return from BREAK.
Type :? for other options.
1 > :b
(30C38E30) : 0 "Anonymous Function #x3063B276" 52
(30C38E40) : 1 "Anonymous Function #x304984A6" 376
(30C38E90) : 2 "RUN-PROCESS-INITIAL-FORM" 340
(30C38EE0) : 3 "%RUN-STACK-GROUP-FUNCTION" 768
1 > :pop
;;
;; control of terminal input restored to process Initial(0)
;;
?
If a background process ("A") needs access to the terminal input stream and that stream is owned by another background process ("B"), process "A" announces that fact, then waits until the initial process regains control.
? Welcome to Clozure CL Version (Beta: linux) 0.9!
?
? (process-run-function "sleep-60" #'(lambda () (sleep 60) (break "Huh?")))
#<PROCESS sleep-60(1) [Enabled] #x3063BF26>
? (process-run-function "sleep-5" #'(lambda () (sleep 5) (break "quicker")))
#<PROCESS sleep-5(2) [Enabled] #x3063D0A6>
? ;;
;; Process sleep-5(2) needs access to terminal input.
;;
()
NIL
? (:y 2)
;;
;; process sleep-5(2) now controls terminal input
;;
> Break in process sleep-5(2): quicker
> While executing: #x3063CFDE>
> Type :GO to continue, :POP to abort.
> If continued: Return from BREAK.
Type :? for other options.
1 > ;; Process sleep-60(1) will need terminal access when
;; the initial process regains control of it.
;;
()
NIL
1 > :pop
;;
;; Process sleep-60(1) needs access to terminal input.
;;
;;
;; control of terminal input restored to process Initial(0)
;;
? (:y 1)
;;
;; process sleep-60(1) now controls terminal input
;;
> Break in process sleep-60(1): Huh?
> While executing: #x3063BE5E>
> Type :GO to continue, :POP to abort.
> If continued: Return from BREAK.
Type :? for other options.
1 > :pop
;;
;; control of terminal input restored to process Initial(0)
;;
?
This scheme is certainly not bulletproof: imaginative use of PROCESS-INTERRUPT and similar functions might be able to defeat it and deadlock the lisp, and any scenario where several background processes are clamoring for access to the shared terminal input stream at the same time is likely to be confusing and chaotic. (An alternate scheme, where the input focus was magically granted to whatever thread the user was thinking about, was considered and rejected due to technical limitations.)
The longer-term fix would probably involve using network or window-system streams to give each process unique instances of *TERMINAL-IO*.
Existing code that attempts to read from *TERMINAL-IO* from a background process will need to be changed to use WITH-TERMINAL-INPUT. Since that code was probably not working reliably in previous versions of Clozure CL, this requirement doesn't seem to be too onerous.
Note that WITH-TERMINAL-INPUT both requests ownership of the terminal input stream and promises to restore that ownership to the initial process when it's done with it. An ad hoc use of READ or READ-CHAR doesn't make this promise; this is the rationale for the restriction on the :Y command.
In the "tty world", Clozure CL starts out with 2 lisp-level threads:
? :proc
1 : -> listener [Active]
0 : Initial [Active]
If you look at a running Clozure CL with a debugging tool, such as GDB, or Apple's Thread Viewer.app, you'll see an additional kernel-level thread on Darwin; this is used by the Mach exception-handling mechanism.
The initial thread, conveniently named "initial", is the one that was created by the operating system when it launched Clozure CL. It maps the heap image into memory, does some Lisp-level initialization, and, when the Cocoa IDE isn't being used, creates the thread "listener", which runs the top-level loop that reads input, evaluates it, and prints the result.
After the listener thread is created, the initial thread does "housekeeping": it sits in a loop, sleeping most of the time and waking up occasionally to do "periodic tasks". These tasks include forcing output on specified interactive streams, checking for and handling control-C interrupts, etc. Currently, those tasks also include polling for the exit status of external processes and handling some kinds of I/O to and from those processes.
In this environment, the initial thread does these
"housekeeping" activities as necessary, until
ccl:quit is called;
quitting interrupts the initial thread, which
then ends all other threads in as orderly a fashion as possible
and calls the C function #_exit.
The short-term plan is to handle each external-process in a dedicated thread; the worst-case behavior of the current scheme can involve busy-waiting and excessive CPU utilization while waiting for an external process to terminate in some cases.
The Cocoa features use more threads. Adding a Cocoa listener creates two threads:
? :proc
3 : -> Listener [Active]
2 : housekeeping [Active]
1 : listener [Active]
0 : Initial [Active]
The Cocoa event loop has to run in the initial thread; when the event loop starts up, it creates a new thread to do the "housekeeping" tasks which the initial thread would do in the terminal-only mode. The initial thread then becomes the one to receive all Cocoa events from the window server; it's the only thread which can.
It also creates one "Listener" (capital-L) thread for each listener window, with a lifetime that lasts as long as the thread does. So, if you open a second listener, you'll see five threads all together:
? :proc
4 : -> Listener-2 [Active]
3 : Listener [Active]
2 : housekeeping [Active]
1 : listener [Active]
0 : Initial [Active]
Unix signals, such as SIGINT (control-C), invoke a handler installed by the Lisp kernel. Although the OS doesn't make any specific guarantee about which thread will receive the signal, in practice, it seems to be the initial thread. The handler just sets a flag and returns; the housekeeping thread (which may be the initial thread, if Cocoa's not being used) will check for the flag and take whatever action is appropriate to the signal.
In the case of SIGINT, the action is to enter a break
loop, by calling on the thread being interrupted. When there's
more than one Lisp listener active, it's not always clear what
thread that should be, since it really depends on the user's
intentions, which there's no way to divine programmatically. To
make its best guess, the handler first checks whether the value
of ccl:*interactive-abort-process* is a
thread, and, if so, uses it. If that fails, it chooses the
thread which currently "owns" the default terminal input stream;
see .
In the bleeding-edge version of the Cocoa support which is based on Hemlock, an Emacs-like editor, each editor window has a dedicated thread associated with it. When a keypress event comes in which affects that specific window the initial thread sends it to the window's dedicated thread. The dedicated thread is responsible for trying to interpret keypresses as Hemlock commands, applying those commands to the active buffer; it repeats this in a loop, until the window closes. The initial thread handles all other events, such as mouse clicks and drags.
This thread-per-window scheme makes many things simpler, including the process of entering a "recursive command loop" in commands like "Incremental Search Forward", etc. (It might be possible to handle all Hemlock commands in the Cocoa event thread, but these "recursive command loops" would have to maintain a lot of context/state information; threads are a straightforward way of maintaining that information.)
Currently (August 2004), when a dedicated thread needs to alter the contents of the buffer or the selection, it does so by invoking methods in the initial thread, for synchronization purposes, but this is probably overkill and will likely be replaced by a more efficient scheme in the future.
The per-window thread could probably take more responsibility for drawing and handling the screen than it currently does; -something- needs to be done to buffer screen updates a bit better in some cases: you don't need to see everything that happens during something like indentation; you do need to see the results...
When Hemlock is being used, listener windows are editor windows, so in addition to each "Listener" thread, you should also see a thread which handles Hemlock command processing.
The Cocoa runtime may make additional threads in certain special situations; these threads usually don't run lisp code, and rarely if ever run much of it.
Returns a list of all lisp processes (threads) known to Clozure CL as of the precise instant it's called. It's safe to traverse this list and to modify the cons cells that comprise that list (it's freshly consed.) Since other threads can create and kill threads at any time, there's generally no way to get an "accurate" list of all threads, and (generally) no sense in which such a list can be accurate.
make-process
name &key
persistent priority class stack-size vstack-size
tstack-size initial-bindings use-standard-initial-bindings
=> process
name---a string, used to identify the process.
persistent---if true, requests that information about the process be retained by SAVE-APPLICATION so that an equivalent process can be restarted when a saved image is run. The default is nil.
priority---ignored. It shouldn't be ignored of course, but there are complications on some platforms. The default is 0.
class---the class of process object to create; should be a subclass of CCL:PROCESS. The default is CCL:PROCESS.
stack-size---the size, in bytes, of the newly-created process's control stack; used for foreign function calls and to save function return address context. The default is CCL:*DEFAULT-CONTROL-STACK-SIZE*.
vstack-size---the size, in bytes, of the newly-created process's value stack; used for lisp function arguments, local variables, and other stack-allocated lisp objects. The default is CCL:*DEFAULT-VALUE-STACK-SIZE*.
tstack-size---the size, in bytes, of the newly-created process's temp stack; used for the allocation of dynamic-extent objects. The default is CCL:*DEFAULT-TEMP-STACK-SIZE*.
use-standard-initial-bindings---when true, the global "standard initial bindings" are put into effect in the new thread before. See DEF-STANDARD-INITIAL-BINDING. "standard" initial bindings are put into effect before any bindings specified by :initial-bindings are. The default is t.
initial-bindings---an alist of (symbol . valueform) pairs, which can be used to initialize special variable bindings in the new thread. Each valueform is used to compute the value of a new binding of symbol in the execution environment of the newly-created thread. The default is nil.
process---the newly-created process.
Creates and returns a new lisp process (thread) with the specified attributes. process will not begin execution immediately; it will need to be preset (given an initial function to run, as by process-preset) and enabled (allowed to execute, as by process-enable) before it's able to actually do anything.
If valueform is a function, it is called, with no arguments, in the execution environment of the newly-created thread; the primary value it returns is used for the binding of the corresponding symbol.
Otherwise, valueform is evaluated in the execution environment of the newly-created thread, and the resulting value is used.
process---a lisp process (thread).
result---T if process had been runnable and is now suspended; NIL otherwise. That is, T if process's process-suspend-count transitioned from 0 to 1.
Suspends process, preventing it from running, and stopping it if it was already running. This is a fairly expensive operation, because it involves a few calls to the OS. It also risks creating deadlock if used improperly, for instance, if the process being suspended owns a lock or other resource which another process will wait for.
Each call to process-suspend must be reversed by a matching call to process-resume before process is able to run. What process-suspend actually does is increment the process-suspend-count of process.
A process can't suspend itself, though this once worked and this documentation claimed has claimed that it did.
process-suspend was previously called process-disable. process-enable now names a function for which there is no obvious inverse, so process-disable is no longer defined.
process---a lisp process (thread).
result---T if process had been suspended and is now runnable; NIL otherwise. That is, T if process's process-suspend-count transitioned from to 0.
Undoes the effect of a previous call to process-suspend; if all such calls are undone, makes the process runnable. Has no effect if the process is not suspended. What process-resume actually does is decrement the process-suspend-count of process, to a minimum of 0.
This was previously called PROCESS-ENABLE; process-enable now does something slightly different.
process---a lisp process (thread).
result---The number of "outstanding" process-suspend calls on process, or NIL if process has expired.
An "outstanding" process-suspend call is one which has not yet been reversed by a call to process-resume. A process expires when its initial function returns, although it may later be reset.
A process is runnable when it has a process-suspend-count of 0, has been preset as by process-preset, and has been enabled as by process-enable. Newly-created processes have a process-suspend-count of 0.
process---a lisp process (thread).
function---a function, designated by itself or by a symbol which names it.
args---a list of values, appropriate as arguments to function.
result---undefined.
Typically used to initialize a newly-created or newly-reset process, setting things up so that when process becomes enabled, it will begin execution by applying function to args. process-preset does not enable process, although a process must be process-preset before it can be enabled. Processes are normally enabled by process-enable.
process---a lisp process (thread).
timeout---a time interval in seconds. May be any non-negative real number the floor of which fits in 32 bits. The default is 1.
result---undefined.
Tries to begin the execution of process. An error is signaled if process has never been process-preset. Otherwise, process invokes its initial function.
process-enable attempts to synchronize with process, which is presumed to be reset or in the act of resetting itself. If this attempt is not successful within the time interval specified by timeout, a continuable error is signaled, which offers the opportunity to continue waiting.
A process cannot meaningfully attempt to enable itself.
name---a string, used to identify the process. Passed to make-process.
function---a function, designated by itself or by a symbol which names it. Passed to preset-process.
persistent---a boolean, passed to make-process.
priority---ignored.
class---a subclass of CCL:PROCESS. Passed to make-process.
stack-size---a size, in bytes. Passed to make-process.
vstack-size---a size, in bytes. Passed to make-process.
tstack-size---a size, in bytes. Passed to make-process.
process---the newly-created process.
Creates a lisp process (thread) via make-process, presets it via process-preset, and enables it via process-enable. This means that process will immediately begin to execute. process-run-function is the simplest way to create and run a process.
process---a lisp process (thread).
function---a function.
args---a list of values, appropriate as arguments to function.
result---the result of applying function to args if process is the current-process, otherwise NIL.
Arranges for process to apply function to args at some point in the near future (interrupting whatever process was doing.) If function returns normally, process resumes execution at the point at which it was interrupted.
process must be in an enabled state in order to respond to a process-interrupt request. It's perfectly legal for a process to call process-interrupt on itself.
process-interrupt uses asynchronous POSIX signals to interrupt threads. If the thread being interrupted is executing lisp code, it can respond to the interrupt almost immediately (as soon as it has finished pseudo-atomic operations like consing and stack-frame initialization.)
If the interrupted thread is blocking in a system call, that system call is aborted by the signal and the interrupt is handled on return.
It is still difficult to reliably interrupt arbitrary foreign code (that may be stateful or otherwise non-reentrant); the interrupt request is handled when such foreign code returns to or enters lisp.
It would probably be better for result to always be NIL, since the present behavior is inconsistent.
Process-interrupt works by sending signals between threads, via the C function #_pthread_signal. It could be argued that it should be done in one of several possible other ways under Darwin, to make it practical to asynchronously interrupt things which make heavy use of the Mach nanokernel.
process---a lisp process (thread).
kill-option---an internal argument, must be nil.
result---undefined.
Causes process to cleanly exit from any ongoing computation and enter a state where it can be process-preset. This is implemented by signaling a condition of type PROCESS-RESET; user-defined condition handlers should generally refrain from attempting to handle conditions of this type.
The kill-option argument is for internal use only and should not be specified by user code
A process can meaningfully reset itself.
There is in general no way to know precisely when process has completed the act of resetting or killing itself; a process which has either entered the limbo of the reset state or exited has few ways of communicating either fact. process-enable can reliably determine when a process has entered the "limbo of the reset state", but can't predict how long the clean exit from ongoing computation might take: that depends on the behavior of unwind-protect cleanup forms, and of the OS scheduler.
Resetting a process other than *current-process* involves the use of process-interrupt.
process---a lisp process (thread).
condition---a lisp condition. The default is NIL.
Entirely equivalent to calling (process-interrupt process (lambda () (abort condition))). Causes process to transfer control to the applicable handler or restart for abort.
If condition is non-NIL, process-abort does not consider any handlers which are explicitly bound to conditions other than condition.
The clock resolution of the OS scheduler. Currently, both LinuxPPC and DarwinPPC yield an initial value of 100.
This information is primarily for the benefit of debugging tools. whostate is a terse report on what process is doing, or not doing, and why.
If the process is currently waiting in a call to process-wait or process-wait-with-timeout, its process-whostate will be the value which was passed to that function as whostate.
Advises the OS scheduler that the current thread has nothing useful to do and that it should try to find some other thread to schedule in its place. There's almost always a better alternative, such as waiting for some specific event to occur. For example, you could use a lock or semaphore.
whostate---a string, which will be the value of process-whostate while the process is waiting.
function---a function, designated by itself or by a symbol which names it.
args---a list of values, appropriate as arguments to function.
result---NIL.
Causes the current lisp process (thread) to repeatedly apply function to args until the call returns a true result, then returns NIL. After each failed call, yields the CPU as if by process-allow-schedule.
As with process-allow-schedule, it's almost always more efficient to wait for some specific event to occur; this isn't exactly busy-waiting, but the OS scheduler can do a better job of scheduling if it's given the relevant information. For example, you could use a lock or semaphore.
whostate---a string, which will be the value of process-whostate while the process is waiting.
ticks---either a positive integer expressing a duration in "ticks" (see *ticks-per-second*), or NIL.
function---a function, designated by itself or by a symbol which names it.
args---a list of values, appropriate as arguments to function.
result---T if process-wait-with-timeout returned because its function returned true, or NIL if it returned because the duration ticks has been exceeded.
If ticks is NIL, behaves exactly like process-wait, except for returning T. Otherwise, function will be tested repeatedly, in the same kind of test/yield loop as in process-wait until either function returns true, or the duration ticks has been exceeded.
Having already read the descriptions of process-allow-schedule and process-wait, the astute reader has no doubt anticipated the observation that better alternatives should be used whenever possible.
Executes body in an environment in which process-interrupt requests are deferred. As noted in the description of process-interrupt, this has nothing to do with the scheduling of other threads; it may be necessary to inhibit process-interrupt handling when (for instance) modifying some data structure (for which the current thread holds an appropriate lock) in some manner that's not reentrant.
name---any lisp object; saved as part of lock. Typically a string or symbol which may appear in the process-whostates of threads which are waiting for lock.
lock---a newly-allocated object of type CCL:LOCK.
lock---an object of type CCL:LOCK.
body---an implicit progn.
result---the primary value returned by body.
Blocks until lock is owned by the calling thread.
The macro with-lock-grabbed could be defined in terms of grab-lock and release-lock, but it is actually implemented at a slightly lower level.
lock---an object of type CCL:LOCK.
result---T if lock has been obtained, or NIL if it has not.
Tests whether lock can be obtained without blocking - that is, either lock is already free, or it is already owned by *current-process*. If it can, causes it to be owned by the calling lisp process (thread) and returns T. Otherwise, the lock is already owned by another thread and cannot be obtained without blocking; NIL is returned in this case.
Creates and returns an object of type CCL::READ-WRITE-LOCK. A read-write lock may, at any given time, belong to any number of lisp processes (threads) which act as "readers"; or, it may belong to at most one process which acts as a "writer". A read-write lock may never be held by a reader at the same time as a writer. Initially, read-write-lock has no readers and no writers.
read-write-lock---an object of type CCL:READ-WRITE-LOCK.
body---an implicit progn.
result---the primary value returned by body.
Waits until read-write-lock has no writer, ensures that *current-process* is a reader of it, then executes body.
After executing body, if *current-process* was not a reader of read-write-lock before with-read-lock was called, the lock is released. If it was already a reader, it remains one.
read-write-lock---an object of type CCL:READ-WRITE-LOCK.
body---an implicit progn.
result---the primary value returned by body.
Waits until read-write-lock has no readers and no writer other than *current-process*, then ensures that *current-process* is the writer of it. With the lock held, executes body.
After executing body, if *current-process* was not the writer of read-write-lock before with-write-lock was called, the lock is released. If it was already the writer, it remains the writer.
semaphore---an object of type CCL:SEMAPHORE.
result---an integer representing an error identifier which was returned by the underlying OS call.
Atomically increments semaphore's "count" by 1; this may enable a waiting thread to resume execution.
semaphore---an object of type CCL:SEMAPHORE.
result---an integer representing an error identifier which was returned by the underlying OS call.
Waits until semaphore has a positive count that can be atomically decremented; this will succeed exactly once for each corresponding call to SIGNAL-SEMAPHORE.
semaphore---An object of type CCL:SEMAPHORE.
timeout---a time interval in seconds. May be any non-negative real number the floor of which fits in 32 bits. The default is 1.
result---T if timed-wait-on-semaphore returned because it was able to decrement the count of semaphore; NIL if it returned because the duration timeout has been exceeded.
fd---a file descriptor, which is a non-negative integer used by the OS to refer to an open file, socket, or similar I/O connection. See ccl::stream-device.
timeout---either NIL or a time interval in milliseconds. Must be a non-negative integer. The default is NIL.
Wait until input is available on fd. This uses the select() system call, and is generally a fairly efficient way of blocking while waiting for input. More accurately, process-input-wait waits until it's possible to read from fd without blocking, or until timeout, if it is not NIL, has been exceeded.
Note that it's possible to read without blocking if the file is at its end - although, of course, the read will return zero bytes.
process-input-wait has a timeout parameter, and process-output-wait does not. This inconsistency should probably be corrected.
fd---a file descriptor, which is a non-negative integer used by the OS to refer to an open file, socket, or similar I/O connection. See ccl::stream-device.
timeout---either NIL or a time interval in milliseconds. Must be a non-negative integer. The default is NIL.
Wait until output is possible on fd or until timeout, if it is not NIL, has been exceeded. This uses the select() system call, and is generally a fairly efficient way of blocking while waiting to output.
If process-output-wait is called on a network socket which has not yet established a connection, it will wait until the connection is established. This is an important use, often overlooked.
process-input-wait has a timeout parameter, and process-output-wait does not. This inconsistency should probably be corrected.
Controls how attempts to obtain ownership of terminal input are made. When NIL, a message is printed on *TERMINAL-IO*; it's expected that the user will later yield control of the terminal via the :Y toplevel command. When T, a BREAK condition is signaled in the owning process; continuing from the break loop will yield the terminal to the requesting process (unless the :Y command was already used to do so in the break loop.)
p---a lisp process (thread), designated either by an integer which matches its process-serial-number, or by a string which is equal to its process-name.
:Y is a toplevel command, not a function. As such, it can only be used interactively, and only from the initial process.
The command yields control of terminal input to the process p, which must have used with-terminal-input to request access to the terminal input stream.
process---a process, typically created by process-run-function or by make-process
default---A default value to be returned if the specified process doesn't exit normally.
values---The values returned by the specified process's initial function if that function returns, or the value of the default argument, otherwise.
Waits for the specified process to terminate. If the process terminates "normally" (if its initial function returns), returns the values that that initial function returnes. If the process does not terminate normally (e.g., if it's terminated via process-kill and a default argument is provided, returns the value of that default argument. If the process doesn't terminate normally and no default argument is provided, signals an error.
A process can't successfully join itself, and only one process can successfully receive notification of another process's termination.
Clozure CL supports the socket abstraction for interprocess communication. A socket represents a connection to another process, typically (but not necessarily) a TCP/IP network connection to a client or server running on some other machine on the network.
All symbols mentioned in this chapter are exported from the CCL package. As of version 0.13, these symbols are additionally exported from the OPENMCL-SOCKET package.
Clozure CL supports three types of sockets: TCP sockets, UDP sockets, and Unix-domain sockets. This should be enough for all but the most esoteric network situations. All sockets are created by make-socket. The type of socket depends on the arguments to it, as follows:
A buffered bi-directional stream over a TCP/IP connection. tcp-stream is a subclass of stream, and you can read and write to it using all the usual stream functions. Created by (make-socket :address-family :internet :type :stream :connect :active ...) or by (accept-connection ...).
A buffered bi-directional stream over a "UNIX domain" connection. file-socket-stream is a subclass of stream, and you can read and write to it using all the usual stream functions. Created by (make-socket :address-family :file :type :stream :connect :active ...) or by (accept-connection ...),
A passive socket used to listen for incoming TCP/IP connections on a particular port. A listener-socket is not a stream. It doesn't support I/O. It can only be used to create new tcp-streams by accept-connection. Created by (make-socket :type :stream :connect :passive ...)
A passive socket used to listen for incoming UNIX domain connections named by a file in the local filesystem. A listener-socket is not a stream. It doesn't support I/O. It can only be used to create new file-socket-streams by accept-connection. Created by (make-socket :address-family :file :type :stream :connect :passive ...)
A socket representing a packet-based UDP/IP connection. A udp-socket supports I/O but it is not a stream. Instead, you must use the special functions send-to and receive-from to read and write to it. Created by (make-socket :type :datagram ...)
make-socket &key
address-family type connect eol format remote-host
remote-port local-host local-port local-filename
remote-filename keepalive reuse-address nodelay broadcast
linger backlog input-timeout output-timeout connect-timeout
auto-close deadline
address-family---The address/protocol family of this socket. Currently only :internet (the default), meaning IP, and :file, referring to UNIX domain addresses, are supported.
type---One of :stream (the default) to request a connection-oriented socket, or :datagram to request a connectionless socket. The default is :stream.
connect---This argument is only relevant to sockets of type :stream. One of :active (the default) to request a :passive to request a file or TCP listener socket.
eol---This argument is currently ignored (it is accepted for compatibility with Franz Allegro).
format---One of :text (the default), :binary, or :bivalent. This argument is ignored for :stream sockets for now, as :stream sockets are currently always bivalent (i.e. they support both character and byte I/O). For :datagram sockets, the format determines the type of buffer created by receive-from.
remote-host---Required for TCP streams, it specifies the host to connect to (in any format acceptable to lookup-hostname). Ignored for listener sockets. For UDP sockets, it can be used to specify a default host for subsequent calls to send-to or receive-from.
remote-port---Required for TCP streams, it specifies the port to connect to (in any format acceptable to lookup-port). Ignored for listener sockets. For UDP sockets, it can be used to specify a default port for subsequent calls to for subsequent calls to send-to or receive-from.
remote-filename---Required for file-socket streams, it specifies the name of a file in the local filesystem (e.g., NOT mounted via NFS, AFP, SMB, ...) which names and controls access to a UNIX-domain socket.
local-host---Allows you to specify a local host address for a listener or UDP socket, for the rare case where you want to restrict connections to those coming to a specific local address for security reasons.
local-port---Specify a local port for a socket. Most useful for listener sockets, where it is the port on which the socket will listen for connections.
local-filename---Required for file-listener-sockets. Specifies the name of a file in the local filesystem which is used to name a UNIX-domain socket. The actual filesystem file should not previously exist when the file-listener-socket is created; its parent directory should exist and be writable by the caller. The file used to name the socket will be deleted when the file-listener-socket is closed.
keepalive---If true, enables the periodic transmission of "keepalive" messages.
reuse-address---If true, allows the reuse of local ports in listener sockets, overriding some TCP/IP protocol specifications. You will need this if you are debugging a server..
nodelay---If true, disables Nagle's algorithm, which tries to minimize TCP packet fragmentation by introducing transmission delays in the absence of replies. Try setting this if you are using a protocol which involves sending a steady stream of data with no replies and are seeing significant degradations in throughput.
broadcast---If true, requests permission to broadcast datagrams on a UDP socket.
linger---If specified and non-nil, should be the number of seconds the OS is allowed to wait for data to be pushed through when a close is done. Only relevant for TCP sockets.
backlog---For a listener socket, specifies the number of connections which can be pending but not accepted. The default is 5, which is also the maximum on some operating systems.
input-timeout---The number of seconds before an input operation
times out. Must be a real number between zero and one
million. If an input operation takes longer than the
specified number of seconds, an
input-timeout error is signalled.
(see Section 9.1.4, “Stream Timeouts and Deadlines”)
output-timeout---The number of seconds before an output operation
times out. Must be a real number between zero and one
million. If an output operation takes longer than the
specified number of seconds, an
output-timeout error is signalled.
(see Section 9.1.4, “Stream Timeouts and Deadlines”)
connect-timeout---The number of seconds before a connection
attempt times out. [TODO: what are acceptable values?]
If a connection attempt takes longer than the
specified number of seconds, a
socket-error is signalled. This
can be useful if the specified interval is shorter
than the interval that the OS's socket layer imposes,
which is sometimes a minute or two.
auto-close---When non-nil, any resulting socket stream will be closed when the GC can prove that the stream is unreferenced. This is done via CCL's termination mechanism [TODO add xref].
deadline---Specifies an absolute time in
internal-time-units. If an I/O operation on the
stream does not complete before the deadline then a
COMMUNICATION-DEADLINE-EXPIRED
error is signalled. A deadline takes precedence over
any input/output timeouts that may be set. (see Section 9.1.4, “Stream Timeouts and Deadlines”)
socket---The listener-socket to listen on.
wait---If true (the default), and there are no connections waiting to be accepted, waits until one arrives. If false, returns NIL immediately.
Extracts the first connection on the queue of pending connections, accepts it (i.e. completes the connection startup protocol) and returns a new tcp-stream or file-socket-stream representing the newly established connection. The tcp stream inherits any properties of the listener socket that are relevant (e.g. :keepalive, :nodelay, etc.) The original listener socket continues to be open listening for more connections, so you can call accept-connection on it again.
socket---The socket to read from
size---Maximum number of bytes to read. If the packet is larger than this, any extra bytes are discarded.
buffer---If specified, must be either a string or a byte vector which will be used to read in the data. If not specified, a new buffer will be created (of type determined by socket-format).
extract---If true, the subsequence of the buffer corresponding only to the data read in is extracted and returned as the first value. If false (the default) the original buffer is returned even if it is only partially filled.
offset---Specifies the start offset into the buffer at which data is to be stored. The default is 0.
socket---The socket to write to
buffer---A vector containing the data to send. It must be either a string or a byte vector (either one is acceptable regardless of the stream format).
size---Number of bytes to send
remote-host---The host to send the packet to, in any format acceptable to lookup-hostname. The default is the remote host specified in the call to make-socket.
remote-port---The port to send the packet to, in any format acceptable to lookup-port. The default is the remote port specified in the call to make-socket.
offset---The offset in the buffer where the packet data starts
Returns the native OS's representation of the socket, or NIL if the socket is closed. On Unix, this is the Unix 'file descriptor', a small non-negative integer. Note that it is rather dangerous to mess around with tcp-stream fd's, as there is all sorts of buffering and asynchronous I/O going on above the OS level. listener-socket and udp-socket fd's are safer to mess with directly as there is less magic going on.