doc/manual/compiler.texinfo

   1 @node Compiler
   2 @comment  node-name,  next,  previous,  up
   3 @chapter Compiler
   4
   5 This chapter will discuss most compiler issues other than efficiency,
   6 including compiler error messages, the SBCL compiler's unusual
   7 approach to type safety in the presence of type declarations, the
   8 effects of various compiler optimization policies, and the way that
   9 inlining and open coding may cause optimized code to differ from a
  10 naive translation. Efficiency issues are sufficiently varied and
  11 separate that they have their own chapter, @ref{Efficiency}.
  12
  13 @menu
  14 * Diagnostic Messages::
  15 * Handling of Types::
  16 * Compiler Policy::
  17 * Compiler Errors::
  18 * Open Coding and Inline Expansion::
  19 * Interpreter::
  20 @end menu
  21
  22 @node  Diagnostic Messages
  23 @comment  node-name,  next,  previous,  up
  24 @section Diagnostic Messages
  25 @cindex Messages, Compiler
  26 @cindex Compiler messages
  27
  28 @menu
  29 * Controlling Verbosity::
  30 * Diagnostic Severity::
  31 * Understanding Compiler Diagnostics::
  32 @end menu
  33
  34 @node Controlling Verbosity
  35 @comment  node-name,  next,  previous,  up
  36 @subsection Controlling Verbosity
  37
  38 The compiler can be quite verbose in its diagnostic reporting, rather
  39 more then some users would prefer -- the amount of noise emitted can
  40 be controlled, however.
  41
  42 To control emission of compiler diagnostics (of any severity other
  43 than @code{error}: @pxref{Diagnostic Severity}) use the
  44 @code{sb-ext:muffle-conditions} and @code{sb-ext:unmuffle-conditions}
  45 declarations, specifying the type of condition that is to be muffled
  46 (the muffling is done using an associated @code{muffle-warning} restart).
  47
  48 Global control:
  49 @lisp
  50 ;;; Muffle compiler-notes globally
  51 (declaim (sb-ext:muffle-conditions sb-ext:compiler-note))
  52 @end lisp
  53
  54 Local control:
  55 @lisp
  56 ;;; Muffle compiler-notes based on lexical scope
  57 (defun foo (x)
  58   (declare (optimize speed) (fixnum x)
  59            (sb-ext:muffle-conditions sb-ext:compiler-note))
  60   (values (* x 5) ; no compiler note from this
  61     (locally
  62       (declare (sb-ext:unmuffle-conditions sb-ext:compiler-note))
  63       ;; this one gives a compiler note
  64       (* x -5))))
  65 @end lisp
  66
  67 @deffn {Declaration} sb-ext:muffle-conditions
  68 Syntax: type*
  69
  70 Muffles the diagnostic messages that would be caused by compile-time
  71 signals of given types.
  72 @end deffn
  73
  74 @deffn {Declaration} sb-ext:unmuffle-conditions
  75 Syntax: type*
  76
  77 Cancels the effect of a previous @code{sb-ext:muffle-condition}
  78 declaration.
  79 @end deffn
  80
  81 Various details of @emph{how} the compiler messages are printed can be
  82 controlled via the alist
  83 @code{sb-ext:*compiler-print-variable-alist*}.
  84
  85 @include var-sb-ext-star-compiler-print-variable-alist-star.texinfo
  86
  87 For information about muffling warnings signaled outside of the
  88 compiler, see @ref{Customization Hooks for Users}.
  89
  90 @c <!-- FIXME: How much control over error messages is in SBCL?
  91 @c      _     How much should be? How much of this documentation should
  92 @c      _     we save or adapt?
  93 @c      _
  94 @c      _ %%\node Error Message Parameterization,  , Read Errors, Interpreting Error Messages
  95 @c      _ \subsection{Error Message Parameterization}
  96 @c      _ \cpsubindex{error messages}{verbosity}
  97 @c      _ \cpsubindex{verbosity}{of error messages}
  98 @c      _
  99 @c      _ There is some control over the verbosity of error messages.  See also
 100 @c      _ \varref{undefined-warning-limit}, \code{*efficiency-note-limit*} and
 101 @c      _ \varref{efficiency-note-cost-threshold}.
 102 @c      _
 103 @c      _ \begin{defvar}{}{enclosing-source-cutoff}
 104 @c      _
 105 @c      _   This variable specifies the number of enclosing actual source forms
 106 @c      _   that are printed in full, rather than in the abbreviated processing
 107 @c      _   path format.  Increasing the value from its default of \code{1}
 108 @c      _   allows you to see more of the guts of the macroexpanded source,
 109 @c      _   which is useful when debugging macros.
 110 @c      _ \end{defvar}
 111 @c      _
 112 @c      _ \begin{defmac}{extensions:}{define-source-context}{%
 113 @c      _     \args{\var{name} \var{lambda-list} \mstar{form}}}
 114 @c      _
 115 @c      _   This macro defines how to extract an abbreviated source context from
 116 @c      _   the \var{name}d form when it appears in the compiler input.
 117 @c      _   \var{lambda-list} is a \code{defmacro} style lambda-list used to
 118 @c      _   parse the arguments.  The \var{body} should return a list of
 119 @c      _   subforms that can be printed on about one line.  There are
 120 @c      _   predefined methods for \code{defstruct}, \code{defmethod}, etc.  If
 121 @c      _   no method is defined, then the first two subforms are returned.
 122 @c      _   Note that this facility implicitly determines the string name
 123 @c      _   associated with anonymous functions.
 124 @c      _ \end{defmac}
 125 @c      _
 126 @c      _ -->
 127
 128 @node  Diagnostic Severity
 129 @comment  node-name,  next,  previous,  up
 130 @subsection Diagnostic Severity
 131 @cindex Severity of compiler messages
 132 @cindex Compiler Diagnostic Severity
 133 @tindex error
 134 @tindex warning
 135 @tindex style-warning
 136 @tindex compiler-note
 137 @tindex code-deletion-note
 138
 139 There are four levels of compiler diagnostic severity:
 140
 141 @enumerate 1
 142 @item error
 143 @item warning
 144 @item style warning
 145 @item note
 146 @end enumerate
 147
 148 The first three levels correspond to condition classes which are
 149 defined in the ANSI standard for Common Lisp and which have special
 150 significance to the @code{compile} and @code{compile-file} functions.
 151 These levels of compiler error severity occur when the compiler
 152 handles conditions of these classes.
 153
 154 The fourth level of compiler error severity, @emph{note}, corresponds
 155 to the @code{sb-ext:compiler-note}, and is used for problems which are
 156 too mild for the standard condition classes, typically hints about how
 157 efficiency might be improved. The @code{sb-ext:code-deletion-note}, a
 158 subtype of @code{compiler-note}, is signalled when the compiler
 159 deletes user-supplied code, usually after proving that the code in
 160 question is unreachable.
 161
 162 Future work for SBCL includes expanding this hierarchy of types to
 163 allow more fine-grained control over emission of diagnostic messages.
 164
 165 @include condition-sb-ext-compiler-note.texinfo
 166 @include condition-sb-ext-code-deletion-note.texinfo
 167
 168
 169 @node Understanding Compiler Diagnostics
 170 @comment  node-name,  next,  previous,  up
 171 @subsection Understanding Compile Diagnostics
 172
 173 The messages emitted by the compiler contain a lot of detail in a
 174 terse format, so they may be confusing at first. The messages will be
 175 illustrated using this example program:
 176
 177 @lisp
 178 (defmacro zoq (x)
 179   `(roq (ploq (+ ,x 3))))
 180
 181 (defun foo (y)
 182   (declare (symbol y))
 183   (zoq y))
 184 @end lisp
 185
 186 The main problem with this program is that it is trying to add
 187 @code{3} to a symbol. Note also that the functions @code{roq} and
 188 @code{ploq} aren't defined anywhere.
 189
 190 @menu
 191 * The Parts of a Compiler Diagnostic::
 192 * The Original and Actual Source::
 193 * The Processing Path::
 194 @end menu
 195
 196 @node  The Parts of a Compiler Diagnostic
 197 @comment  node-name,  next,  previous,  up
 198 @subsubsection The Parts of a Compiler Diagnostic
 199
 200 When processing this program, the compiler will produce this warning:
 201
 202 @example
 203 ; file: /tmp/foo.lisp
 204 ; in: DEFUN FOO
 205 ;     (ZOQ Y)
 206 ; --> ROQ PLOQ
 207 ; ==>
 208 ;   (+ Y 3)
 209 ;
 210 ; caught WARNING:
 211 ;   Asserted type NUMBER conflicts with derived type (VALUES SYMBOL &OPTIONAL).
 212 @end example
 213
 214 In this example we see each of the six possible parts of a compiler
 215 diagnostic:
 216
 217 @enumerate
 218
 219 @item
 220 @findex with-compilation-unit
 221 @samp{file: /tmp/foo.lisp} This is the name of the file that the
 222 compiler read the relevant code from.  The file name is displayed
 223 because it may not be immediately obvious when there is an error
 224 during compilation of a large system, especially when
 225 @code{with-compilation-unit} is used to delay undefined warnings.
 226
 227 @item
 228 @samp{in: DEFUN FOO} This is the definition top level form responsible
 229 for the diagnostic. It is obtained by taking the first two elements of
 230 the enclosing form whose first element is a symbol beginning with
 231 ``@samp{def}''. If there is no such enclosing ``@samp{def}'' form,
 232 then the outermost form is used. If there are multiple @samp{def}
 233 forms, then they are all printed from the outside in, separated by
 234 @samp{=>}'s. In this example, the problem was in the @code{defun} for
 235 @code{foo}.
 236
 237 @item
 238 @cindex Original Source
 239 @samp{(ZOQ Y)} This is the @dfn{original source} form responsible for
 240 the diagnostic. Original source means that the form directly appeared
 241 in the original input to the compiler, i.e. in the lambda passed to
 242 @code{compile} or in the top level form read from the source file. In
 243 this example, the expansion of the @code{zoq} macro was responsible
 244 for the message.
 245
 246 @item
 247 @cindex Processing Path
 248 @samp{--> ROQ PLOQ} This is the @dfn{processing path} that the
 249 compiler used to produce the code that caused the message to be
 250 emitted. The processing path is a representation of the evaluated
 251 forms enclosing the actual source that the compiler encountered when
 252 processing the original source. The path is the first element of each
 253 form, or the form itself if the form is not a list. These forms result
 254 from the expansion of macros or source-to-source transformation done
 255 by the compiler. In this example, the enclosing evaluated forms are
 256 the calls to @code{roq} and @code{ploq}. These calls resulted from the
 257 expansion of the @code{zoq} macro.
 258
 259 @item
 260 @cindex Actual Source
 261 @samp{==> (+ Y 3)} This is the @dfn{actual source} responsible for the
 262 diagnostic. If the actual source appears in the explanation, then we
 263 print the next enclosing evaluated form, instead of printing the
 264 actual source twice. (This is the form that would otherwise have been
 265 the last form of the processing path.) In this example, the problem is
 266 with the evaluation of the reference to the variable @code{y}.
 267
 268 @item
 269 @samp{caught WARNING: Asserted type NUMBER conflicts with derived type
 270 (VALUES SYMBOL &OPTIONAL).}  This is the @dfn{explanation} of the
 271 problem. In this example, the problem is that, while the call to
 272 @code{+} requires that its arguments are all of type @code{number},
 273 the compiler has derived that @code{y} will evaluate to a
 274 @code{symbol}.  Note that @samp{(VALUES SYMBOL &OPTIONAL)} expresses
 275 that @code{y} evaluates to precisely one value.
 276
 277 @end enumerate
 278
 279 Note that each part of the message is distinctively marked:
 280
 281 @itemize
 282
 283 @item
 284  @samp{file:} and @samp{in:} mark the file and definition,
 285 respectively.
 286
 287 @item
 288 The original source is an indented form with no prefix.
 289
 290 @item
 291 Each line of the processing path is prefixed with @samp{-->}
 292
 293 @item
 294 The actual source form is indented like the original source, but is
 295 marked by a preceding @samp{==>} line.
 296 @comment no it isn't.
 297
 298 @item
 299 The explanation is prefixed with the diagnostic severity, which can be
 300 @samp{caught ERROR:}, @samp{caught WARNING:}, @samp{caught
 301 STYLE-WARNING:}, or @samp{note:}.
 302
 303 @end itemize
 304
 305 Each part of the message is more specific than the preceding one. If
 306 consecutive messages are for nearby locations, then the front part of
 307 the messages would be the same. In this case, the compiler omits as
 308 much of the second message as in common with the first. For example:
 309
 310 @example
 311 ; file: /tmp/foo.lisp
 312 ; in: DEFUN FOO
 313 ;     (ZOQ Y)
 314 ; --> ROQ
 315 ; ==>
 316 ;   (PLOQ (+ Y 3))
 317 ;
 318 ; caught STYLE-WARNING:
 319 ;   undefined function: PLOQ
 320
 321 ; ==>
 322 ;   (ROQ (PLOQ (+ Y 3)))
 323 ;
 324 ; caught STYLE-WARNING:
 325 ;   undefined function: ROQ
 326 @end example
 327 @comment fixing that weird blank line might be good
 328
 329 In this example, the file, definition and original source are
 330 identical for the two messages, so the compiler omits them in the
 331 second message. If consecutive messages are entirely identical, then
 332 the compiler prints only the first message, followed by: @samp{[Last
 333 message occurs @var{repeats} times]} where @var{repeats} is the number
 334 of times the message was given.
 335
 336 If the source was not from a file, then no file line is printed. If
 337 the actual source is the same as the original source, then the
 338 processing path and actual source will be omitted. If no forms
 339 intervene between the original source and the actual source, then the
 340 processing path will also be omitted.
 341
 342
 343 @node  The Original and Actual Source
 344 @comment  node-name,  next,  previous,  up
 345 @subsubsection The Original and Actual Source
 346 @cindex Original Source
 347 @cindex Actual Source
 348
 349 The @emph{original source} displayed will almost always be a list. If
 350 the actual source for an message is a symbol, the original source will
 351 be the immediately enclosing evaluated list form. So even if the
 352 offending symbol does appear in the original source, the compiler will
 353 print the enclosing list and then print the symbol as the actual
 354 source (as though the symbol were introduced by a macro.)
 355
 356 When the @emph{actual source} is displayed (and is not a symbol), it
 357 will always be code that resulted from the expansion of a macro or a
 358 source-to-source compiler optimization. This is code that did not
 359 appear in the original source program; it was introduced by the
 360 compiler.
 361
 362 Keep in mind that when the compiler displays a source form in an
 363 diagnostic message, it always displays the most specific (innermost)
 364 responsible form. For example, compiling this function
 365
 366 @lisp
 367 (defun bar (x)
 368   (let (a)
 369     (declare (fixnum a))
 370     (setq a (foo x))
 371     a))
 372 @end lisp
 373
 374 gives this error message
 375
 376 @example
 377 ; file: /tmp/foo.lisp
 378 ; in: DEFUN BAR
 379 ;     (LET (A)
 380 ;     (DECLARE (FIXNUM A))
 381 ;     (SETQ A (FOO X))
 382 ;     A)
 383 ;
 384 ; caught WARNING:
 385 ;   Asserted type FIXNUM conflicts with derived type (VALUES NULL &OPTIONAL).
 386 @end example
 387
 388 This message is not saying ``there is a problem somewhere in this
 389 @code{let}'' -- it is saying that there is a problem with the
 390 @code{let} itself. In this example, the problem is that @code{a}'s
 391 @code{nil} initial value is not a @code{fixnum}.
 392
 393 @node The Processing Path
 394 @comment  node-name,  next,  previous,  up
 395 @subsubsection The Processing Path
 396 @cindex Processing Path
 397 @cindex Macroexpansion
 398 @cindex Source-to-source transformation
 399
 400 The processing path is mainly useful for debugging macros, so if you
 401 don't write macros, you can probably ignore it. Consider this example:
 402
 403 @lisp
 404 (defun foo (n)
 405   (dotimes (i n *undefined*)))
 406 @end lisp
 407
 408 Compiling results in this error message:
 409
 410 @example
 411 ; in: DEFUN FOO
 412 ;     (DOTIMES (I N *UNDEFINED*))
 413 ; --> DO BLOCK LET TAGBODY RETURN-FROM
 414 ; ==>
 415 ;   (PROGN *UNDEFINED*)
 416 ;
 417 ; caught WARNING:
 418 ;   undefined variable: *UNDEFINED*
 419 @end example
 420
 421 Note that @code{do} appears in the processing path. This is because
 422 @code{dotimes} expands into:
 423
 424 @lisp
 425 (do ((i 0 (1+ i)) (#:g1 n))
 426     ((>= i #:g1) *undefined*)
 427   (declare (type unsigned-byte i)))
 428 @end lisp
 429
 430 The rest of the processing path results from the expansion of
 431 @code{do}:
 432
 433 @lisp
 434 (block nil
 435   (let ((i 0) (#:g1 n))
 436     (declare (type unsigned-byte i))
 437     (tagbody (go #:g3)
 438       #:g2    (psetq i (1+ i))
 439       #:g3    (unless (>= i #:g1) (go #:g2))
 440       (return-from nil (progn *undefined*)))))
 441 @end lisp
 442
 443 In this example, the compiler descended into the @code{block},
 444 @code{let}, @code{tagbody} and @code{return-from} to reach the
 445 @code{progn} printed as the actual source. This is a place where the
 446 ``actual source appears in explanation'' rule was applied. The
 447 innermost actual source form was the symbol @code{*undefined*} itself,
 448 but that also appeared in the explanation, so the compiler backed out
 449 one level.
 450
 451
 452
 453
 454
 455 @node  Handling of Types
 456 @comment  node-name,  next,  previous,  up
 457 @section Handling of Types
 458
 459 One of the most important features of the SBCL compiler (similar to
 460 the original CMUCL compiler, also known as @dfn{Python}) is its fairly
 461 sophisticated understanding of the Common Lisp type system and its
 462 conservative approach to the implementation of type declarations.
 463
 464 These two features reward the use of type declarations throughout
 465 development, even when high performance is not a concern. Also, as
 466 discussed in the chapter on performance (@pxref{Efficiency}), the use
 467 of appropriate type declarations can be very important for performance
 468 as well.
 469
 470 @findex satisfies
 471 The SBCL compiler also has a greater knowledge of the Common Lisp
 472 type system than other compilers. Support is incomplete only for types
 473 involving the @code{satisfies} type specifier.
 474
 475 @c <!-- FIXME: See also sections \ref{advanced-type-stuff}
 476 @c      and \ref{type-inference}, once we snarf them from the
 477 @c      CMU CL manual. -->
 478 @c
 479 @c Also see my paper on improving Baker, when I get round to it.
 480 @c
 481 @c Whose paper?
 482
 483 @menu
 484 * Declarations as Assertions::
 485 * Precise Type Checking::
 486 * Getting Existing Programs to Run::
 487 * Implementation Limitations::
 488 @end menu
 489
 490 @node Declarations as Assertions
 491 @comment  node-name,  next,  previous,  up
 492 @subsection Declarations as Assertions
 493 @findex safety
 494
 495 The SBCL compiler treats type declarations differently from most other
 496 Lisp compilers. Under default compilation policy the compiler doesn't
 497 blindly believe type declarations, but considers them assertions about
 498 the program that should be checked: all type declarations that have
 499 not been proven to always hold are asserted at runtime.
 500
 501 @quotation
 502 @emph{Remaining bugs in the compiler's handling of types unfortunately
 503 provide some exceptions to this rule, see @ref{Implementation
 504 Limitations}.}
 505 @end quotation
 506
 507 CLOS slot types form a notable exception. Types declared using the
 508 @code{:type} slot option in @code{defclass} are asserted if and only
 509 if the class was defined in @emph{safe code} and the slot access
 510 location is in @emph{safe code} as well. This laxness does not pose
 511 any internal consistency issues, as the CLOS slot types are not
 512 available for the type inferencer, nor do CLOS slot types provide any
 513 efficiency benefits.
 514
 515 There are three type checking policies available in SBCL, selectable
 516 via @code{optimize} declarations.
 517
 518 @table @strong
 519
 520 @c FIXME: This should be properly integrated with general policy
 521 @c stuff, once that gets cleaned up.
 522
 523 @item Full Type Checks
 524 All declarations are considered assertions to be checked at runtime,
 525 and all type checks are precise. The default compilation policy
 526 provides full type checks.
 527
 528 Used when @code{(or (>= safety 2) (>= safety speed 1))}.
 529
 530 @item Weak Type Checks
 531 Declared types may be simplified into faster to check supertypes: for example,
 532 @code{(and unsigned-byte fixnum)} is simplified into @code{fixnum}.
 533
 534 @strong{Note}: it is relatively easy to corrupt the heap when weak
 535 type checks are used if the program contains type-errors.
 536
 537 Used when @code{(and (< safety 2) (< safety speed))}
 538
 539 @item No Type Checks
 540 All declarations are believed without assertions. Also disables
 541 argument count and array bounds checking.
 542
 543 @strong{Note}: any type errors in code where type checks are not
 544 performed are liable to corrupt the heap.
 545
 546 Used when @code{(= safety 0)}.
 547
 548 @end table
 549
 550 @node  Precise Type Checking
 551 @comment  node-name,  next,  previous,  up
 552 @subsection Precise Type Checking
 553 @cindex Precise type checking
 554 @cindex Type checking, precise
 555
 556 Precise checking means that the check is done as though @code{typep}
 557 had been called with the exact type specifier that appeared in the
 558 declaration.
 559
 560 If a variable is declared to be @code{(integer 3 17)} then its value
 561 must always be an integer between @code{3} and @code{17}. If multiple
 562 type declarations apply to a single variable, then all the
 563 declarations must be correct; it is as though all the types were
 564 intersected producing a single @code{and} type specifier.
 565
 566 To gain maximum benefit from the compiler's type checking, you should
 567 always declare the types of function arguments and structure slots as
 568 precisely as possible. This often involves the use of @code{or},
 569 @code{member}, and other list-style type specifiers.
 570
 571
 572 @node  Getting Existing Programs to Run
 573 @comment  node-name,  next,  previous,  up
 574 @subsection Getting Existing Programs to Run
 575 @cindex Existing programs, to run
 576 @cindex Types, portability
 577 @cindex Compatibility with other Lisps
 578 @c     (should also have an entry in the non-ANSI-isms section)-->
 579
 580 Since SBCL's compiler does much more comprehensive type checking than
 581 most Lisp compilers, SBCL may detect type errors in programs that have
 582 been debugged using other compilers. These errors are mostly incorrect
 583 declarations, although compile-time type errors can find actual bugs
 584 if parts of the program have never been tested.
 585
 586 Some incorrect declarations can only be detected by run-time type
 587 checking. It is very important to initially compile a program with
 588 full type checks (high @code{safety} optimization) and then test this
 589 safe version. After the checking version has been tested, then you can
 590 consider weakening or eliminating type checks.  @emph{This applies
 591 even to previously debugged programs,} because the SBCL compiler does
 592 much more type inference than other Common Lisp compilers, so an
 593 incorrect declaration can do more damage.
 594
 595 The most common problem is with variables whose constant initial value
 596 doesn't match the type declaration. Incorrect constant initial values
 597 will always be flagged by a compile-time type error, and they are
 598 simple to fix once located. Consider this code fragment:
 599
 600 @lisp
 601 (prog (foo)
 602   (declare (fixnum foo))
 603   (setq foo ...)
 604   ...)
 605 @end lisp
 606
 607 Here @code{foo} is given an initial value of @code{nil}, but is
 608 declared to be a @code{fixnum}.  Even if it is never read, the initial
 609 value of a variable must match the declared type.  There are two ways
 610 to fix this problem. Change the declaration
 611
 612 @lisp
 613 (prog (foo)
 614   (declare (type (or fixnum null) foo))
 615   (setq foo ...)
 616   ...)
 617 @end lisp
 618
 619 or change the initial value
 620
 621 @lisp
 622 (prog ((foo 0))
 623   (declare (fixnum foo))
 624   (setq foo ...)
 625   ...)
 626 @end lisp
 627
 628 It is generally preferable to change to a legal initial value rather
 629 than to weaken the declaration, but sometimes it is simpler to weaken
 630 the declaration than to try to make an initial value of the
 631 appropriate type.
 632
 633 Another declaration problem occasionally encountered is incorrect
 634 declarations on @code{defmacro} arguments. This can happen when a
 635 function is converted into a macro. Consider this macro:
 636
 637 @lisp
 638 (defmacro my-1+ (x)
 639   (declare (fixnum x))
 640   `(the fixnum (1+ ,x)))
 641 @end lisp
 642
 643 Although legal and well-defined Common Lisp code, this meaning of this
 644 definition is almost certainly not what the writer intended. For
 645 example, this call is illegal:
 646
 647 @lisp
 648 (my-1+ (+ 4 5))
 649 @end lisp
 650
 651 This call is illegal because the argument to the macro is @code{(+ 4
 652 5)}, which is a @code{list}, not a @code{fixnum}.  Because of macro
 653 semantics, it is hardly ever useful to declare the types of macro
 654 arguments.  If you really want to assert something about the type of
 655 the result of evaluating a macro argument, then put a @code{the} in
 656 the expansion:
 657
 658 @lisp
 659 (defmacro my-1+ (x)
 660   `(the fixnum (1+ (the fixnum ,x))))
 661 @end lisp
 662
 663 In this case, it would be stylistically preferable to change this
 664 macro back to a function and declare it inline.
 665 @c <!--FIXME: <xref>inline-expansion, once we crib the
 666 @c      relevant text from the CMU CL manual.-->
 667
 668 Some more subtle problems are caused by incorrect declarations that
 669 can't be detected at compile time.  Consider this code:
 670
 671 @lisp
 672 (do ((pos 0 (position #\a string :start (1+ pos))))
 673   ((null pos))
 674   (declare (fixnum pos))
 675   ...)
 676 @end lisp
 677
 678 Although @code{pos} is almost always a @code{fixnum}, it is @code{nil}
 679 at the end of the loop. If this example is compiled with full type
 680 checks (the default), then running it will signal a type error at the
 681 end of the loop. If compiled without type checks, the program will go
 682 into an infinite loop (or perhaps @code{position} will complain
 683 because @code{(1+ nil)} isn't a sensible start.) Why? Because if you
 684 compile without type checks, the compiler just quietly believes the
 685 type declaration. Since the compiler believes that @code{pos} is
 686 always a @code{fixnum}, it believes that @code{pos} is never
 687 @code{nil}, so @code{(null pos)} is never true, and the loop exit test
 688 is optimized away. Such errors are sometimes flagged by unreachable
 689 code notes, but it is still important to initially compile and test
 690 any system with full type checks, even if the system works fine when
 691 compiled using other compilers.
 692
 693 In this case, the fix is to weaken the type declaration to @code{(or
 694 fixnum null)} @footnote{Actually, this declaration is unnecessary in
 695 SBCL, since it already knows that @code{position} returns a
 696 non-negative @code{fixnum} or @code{nil}.}.
 697
 698 Note that there is usually little performance penalty for weakening a
 699 declaration in this way. Any numeric operations in the body can still
 700 assume that the variable is a @code{fixnum}, since @code{nil} is not a
 701 legal numeric argument. Another possible fix would be to say:
 702
 703 @lisp
 704 (do ((pos 0 (position #\a string :start (1+ pos))))
 705     ((null pos))
 706   (let ((pos pos))
 707     (declare (fixnum pos))
 708     ...))
 709 @end lisp
 710
 711 This would be preferable in some circumstances, since it would allow a
 712 non-standard representation to be used for the local @code{pos}
 713 variable in the loop body.
 714 @c <!-- FIXME: <xref>ND-variables, once we crib the text from the
 715 @c      CMU CL manual. -->
 716
 717 @node  Implementation Limitations
 718 @comment  node-name,  next,  previous,  up
 719 @subsection Implementation Limitations
 720
 721 Ideally, the compiler would consider @emph{all} type declarations to
 722 be assertions, so that adding type declarations to a program, no
 723 matter how incorrect they might be, would @emph{never} cause undefined
 724 behavior. However, the compiler is known to fall short of this goal in
 725 two areas:
 726
 727 @itemize
 728
 729 @item
 730 @emph{Proclaimed} constraints on argument and result types of a
 731 function are supposed to be checked by the function. If the function
 732 type is proclaimed before function definition, type checks are
 733 inserted by the compiler, but the standard allows the reversed order,
 734 in which case the compiler will trust the declaration.
 735
 736 @item
 737 The compiler cannot check types of an unknown number of values; if the
 738 number of generated values is unknown, but the number of consumed is
 739 known, only consumed values are checked.
 740
 741 For example,
 742
 743 @lisp
 744 (defun foo (x)
 745   (the integer (bar x)))
 746 @end lisp
 747
 748 causes the following compiler diagnostic to be emitted:
 749
 750 @example
 751 ; note: type assertion too complex to check:
 752 ;  (VALUES INTEGER &REST T).
 753 @end example
 754
 755 A partial workaround is instead write:
 756
 757 @lisp
 758 (defun foo (x)
 759   (the (values integer &optional) (bar x)))
 760 @end lisp
 761
 762 @end itemize
 763
 764 These are important issues, but are not necessarily easy to fix, so
 765 they may, alas, remain in the system for a while.
 766
 767 @node Compiler Policy
 768 @comment  node-name,  next,  previous,  up
 769 @section Compiler Policy
 770
 771 Compiler policy is controlled by the @code{optimize} declaration,
 772 supporting all ANSI optimization qualities (@code{debug},
 773 @code{safety}, @code{space}, and @code{speed}).@footnote{A deprecated
 774 extension @code{sb-ext:inhibit-warnings} is still supported, but
 775 liable to go away at any time.}
 776
 777 For effects of various optimization qualities on type-safety and
 778 debuggability see @ref{Declarations as Assertions} and @ref{Debugger
 779 Policy Control}.
 780
 781 Ordinarily, when the @code{speed} quality is high, the compiler emits
 782 notes to notify the programmer about its inability to apply various
 783 optimizations. For selective muffling of these notes @xref{Controlling
 784 Verbosity}.
 785
 786 The value of @code{space} mostly influences the compiler's decision
 787 whether to inline operations, which tend to increase the size of
 788 programs. Use the value @code{0} with caution, since it can cause the
 789 compiler to inline operations so indiscriminately that the net effect
 790 is to slow the program by causing cache misses or even swapping.
 791
 792 @c <!-- FIXME: old CMU CL compiler policy, should perhaps be adapted
 793 @c      _    for SBCL. (Unfortunately, the CMU CL docs are out of sync with the
 794 @c      _    CMU CL code, so adapting this requires not only reformatting
 795 @c      _    the documentation, but rooting out code rot.)
 796 @c      _
 797 @c      _<sect2 id="compiler-policy"><title>Compiler Policy</1000
 798 @c      _  INDEX {policy}{compiler}
 799 @c      _  INDEX compiler policy
 800 @c      _
 801 @c      _<para>The policy is what tells the compiler <emphasis>how</emphasis> to
 802 @c      _compile a program. This is logically (and often textually) distinct
 803 @c      _from the program itself. Broad control of policy is provided by the
 804 @c      _<parameter>optimize</parameter> declaration; other declarations and variables
 805 @c      _control more specific aspects of compilation.
 806 @c      _
 807 @c      _\begin{comment}
 808 @c      _* The Optimize Declaration::
 809 @c      _* The Optimize-Interface Declaration::
 810 @c      _\end{comment}
 811 @c      _
 812 @c      _%%\node The Optimize Declaration, The Optimize-Interface Declaration, Compiler Policy, Compiler Policy
 813 @c      _\subsection{The Optimize Declaration}
 814 @c      _\label{optimize-declaration}
 815 @c      _\cindex{optimize declaration}
 816 @c      _\cpsubindex{declarations}{\code{optimize}}
 817 @c      _
 818 @c      _The \code{optimize} declaration recognizes six different
 819 @c      _\var{qualities}.  The qualities are conceptually independent aspects
 820 @c      _of program performance.  In reality, increasing one quality tends to
 821 @c      _have adverse effects on other qualities.  The compiler compares the
 822 @c      _relative values of qualities when it needs to make a trade-off; i.e.,
 823 @c      _if \code{speed} is greater than \code{safety}, then improve speed at
 824 @c      _the cost of safety.
 825 @c      _
 826 @c      _The default for all qualities (except \code{debug}) is \code{1}.
 827 @c      _Whenever qualities are equal, ties are broken according to a broad
 828 @c      _idea of what a good default environment is supposed to be.  Generally
 829 @c      _this downplays \code{speed}, \code{compile-speed} and \code{space} in
 830 @c      _favor of \code{safety} and \code{debug}.  Novice and casual users
 831 @c      _should stick to the default policy.  Advanced users often want to
 832 @c      _improve speed and memory usage at the cost of safety and
 833 @c      _debuggability.
 834 @c      _
 835 @c      _If the value for a quality is \code{0} or \code{3}, then it may have a
 836 @c      _special interpretation.  A value of \code{0} means ``totally
 837 @c      _unimportant'', and a \code{3} means ``ultimately important.''  These
 838 @c      _extreme optimization values enable ``heroic'' compilation strategies
 839 @c      _that are not always desirable and sometimes self-defeating.
 840 @c      _Specifying more than one quality as \code{3} is not desirable, since
 841 @c      _it doesn't tell the compiler which quality is most important.
 842 @c      _
 843 @c      _
 844 @c      _These are the optimization qualities:
 845 @c      _\begin{Lentry}
 846 @c      _
 847 @c      _\item[\code{speed}] \cindex{speed optimization quality}How fast the
 848 @c      _  program should is run.  \code{speed 3} enables some optimizations
 849 @c      _  that hurt debuggability.
 850 @c      _
 851 @c      _\item[\code{compilation-speed}] \cindex{compilation-speed optimization
 852 @c      _    quality}How fast the compiler should run.  Note that increasing
 853 @c      _  this above \code{safety} weakens type checking.
 854 @c      _
 855 @c      _\item[\code{space}] \cindex{space optimization quality}How much space
 856 @c      _  the compiled code should take up.  Inline expansion is mostly
 857 @c      _  inhibited when \code{space} is greater than \code{speed}.  A value
 858 @c      _  of \code{0} enables indiscriminate inline expansion.  Wide use of a
 859 @c      _  \code{0} value is not recommended, as it may waste so much space
 860 @c      _  that run time is slowed.  \xlref{inline-expansion} for a discussion
 861 @c      _  of inline expansion.
 862 @c      _
 863 @c      _\item[\code{debug}] \cindex{debug optimization quality}How debuggable
 864 @c      _  the program should be.  The quality is treated differently from the
 865 @c      _  other qualities: each value indicates a particular level of debugger
 866 @c      _  information; it is not compared with the other qualities.
 867 @c      _  \xlref{debugger-policy} for more details.
 868 @c      _
 869 @c      _\item[\code{safety}] \cindex{safety optimization quality}How much
 870 @c      _  error checking should be done.  If \code{speed}, \code{space} or
 871 @c      _  \code{compilation-speed} is more important than \code{safety}, then
 872 @c      _  type checking is weakened (\pxlref{weakened-type-checks}).  If
 873 @c      _  \code{safety} if \code{0}, then no run time error checking is done.
 874 @c      _  In addition to suppressing type checks, \code{0} also suppresses
 875 @c      _  argument count checking, unbound-symbol checking and array bounds
 876 @c      _  checks.
 877 @c      _  ... and checking of tag existence in RETURN-FROM and GO.
 878 @c      _
 879 @c      _\item[\code{extensions:inhibit-warnings}] \cindex{inhibit-warnings
 880 @c      _    optimization quality}This is a CMU extension that determines how
 881 @c      _  little (or how much) diagnostic output should be printed during
 882 @c      _  compilation.  This quality is compared to other qualities to
 883 @c      _  determine whether to print style notes and warnings concerning those
 884 @c      _  qualities.  If \code{speed} is greater than \code{inhibit-warnings},
 885 @c      _  then notes about how to improve speed will be printed, etc.  The
 886 @c      _  default value is \code{1}, so raising the value for any standard
 887 @c      _  quality above its default enables notes for that quality.  If
 888 @c      _  \code{inhibit-warnings} is \code{3}, then all notes and most
 889 @c      _  non-serious warnings are inhibited.  This is useful with
 890 @c      _  \code{declare} to suppress warnings about unavoidable problems.
 891 @c      _\end{Lentry}
 892 @c      _
 893 @c      _%%\node The Optimize-Interface Declaration,  , The Optimize Declaration, Compiler Policy
 894 @c      _\subsection{The Optimize-Interface Declaration}
 895 @c      _\label{optimize-interface-declaration}
 896 @c      _\cindex{optimize-interface declaration}
 897 @c      _\cpsubindex{declarations}{\code{optimize-interface}}
 898 @c      _
 899 @c      _The \code{extensions:optimize-interface} declaration is identical in
 900 @c      _syntax to the \code{optimize} declaration, but it specifies the policy
 901 @c      _used during compilation of code the compiler automatically generates
 902 @c      _to check the number and type of arguments supplied to a function.  It
 903 @c      _is useful to specify this policy separately, since even thoroughly
 904 @c      _debugged functions are vulnerable to being passed the wrong arguments.
 905 @c      _The \code{optimize-interface} declaration can specify that arguments
 906 @c      _should be checked even when the general \code{optimize} policy is
 907 @c      _unsafe.
 908 @c      _
 909 @c      _Note that this argument checking is the checking of user-supplied
 910 @c      _arguments to any functions defined within the scope of the
 911 @c      _declaration, \code{not} the checking of arguments to \llisp{}
 912 @c      _primitives that appear in those definitions.
 913 @c      _
 914 @c      _The idea behind this declaration is that it allows the definition of
 915 @c      _functions that appear fully safe to other callers, but that do no
 916 @c      _internal error checking.  Of course, it is possible that arguments may
 917 @c      _be invalid in ways other than having incorrect type.  Functions
 918 @c      _compiled unsafely must still protect themselves against things like
 919 @c      _user-supplied array indices that are out of bounds and improper lists.
 920 @c      _See also the \kwd{context-declarations} option to
 921 @c      _\macref{with-compilation-unit}.
 922 @c      _
 923 @c      _(end of section on compiler policy)
 924 @c      _-->
 925
 926 @node Compiler Errors
 927 @comment  node-name,  next,  previous,  up
 928 @section Compiler Errors
 929
 930 @menu
 931 * Type Errors at Compile Time::
 932 * Errors During Macroexpansion::
 933 * Read Errors::
 934 @end menu
 935
 936 @node  Type Errors at Compile Time
 937 @comment  node-name,  next,  previous,  up
 938 @subsection Type Errors at Compile Time
 939 @cindex Compile time type errors
 940 @cindex Type checking, at compile time
 941
 942 If the compiler can prove at compile time that some portion of the
 943 program cannot be executed without a type error, then it will give a
 944 warning at compile time.
 945
 946 It is possible that the offending code would never actually be
 947 executed at run-time due to some higher level consistency constraint
 948 unknown to the compiler, so a type warning doesn't always indicate an
 949 incorrect program.
 950
 951 For example, consider this code fragment:
 952
 953 @lisp
 954 (defun raz (foo)
 955   (let ((x (case foo
 956               (:this 13)
 957               (:that 9)
 958               (:the-other 42))))
 959     (declare (fixnum x))
 960     (foo x)))
 961 @end lisp
 962
 963 Compilation produces this warning:
 964
 965 @example
 966 ; in: DEFUN RAZ
 967 ;     (CASE FOO (:THIS 13) (:THAT 9) (:THE-OTHER 42))
 968 ; --> LET COND IF COND IF COND IF
 969 ; ==>
 970 ;   (COND)
 971 ;
 972 ; caught WARNING:
 973 ;   This is not a FIXNUM:
 974 ;   NIL
 975 @end example
 976
 977 In this case, the warning means that if @code{foo} isn't any of
 978 @code{:this}, @code{:that} or @code{:the-other}, then @code{x} will be
 979 initialized to @code{nil}, which the @code{fixnum} declaration makes
 980 illegal. The warning will go away if @code{ecase} is used instead of
 981 @code{case}, or if @code{:the-other} is changed to @code{t}.
 982
 983 This sort of spurious type warning happens moderately often in the
 984 expansion of complex macros and in inline functions. In such cases,
 985 there may be dead code that is impossible to correctly execute. The
 986 compiler can't always prove this code is dead (could never be
 987 executed), so it compiles the erroneous code (which will always signal
 988 an error if it is executed) and gives a warning.
 989
 990 @node  Errors During Macroexpansion
 991 @comment  node-name,  next,  previous,  up
 992 @subsection Errors During Macroexpansion
 993 @cindex Macroexpansion, errors during
 994
 995 The compiler handles errors that happen during macroexpansion, turning
 996 them into compiler errors. If you want to debug the error (to debug a
 997 macro), you can set @code{*break-on-signals*} to @code{error}. For
 998 example, this definition:
 999
1000 @lisp
1001 (defun foo (e l)
1002   (do ((current l (cdr current))
1003        ((atom current) nil))
1004       (when (eq (car current) e) (return current))))
1005 @end lisp
1006
1007 gives this error:
1008
1009 @example
1010 ; in: DEFUN FOO
1011 ;     (DO ((CURRENT L (CDR CURRENT))
1012 ;        ((ATOM CURRENT) NIL))
1013 ;       (WHEN (EQ (CAR CURRENT) E) (RETURN CURRENT)))
1014 ;
1015 ; caught ERROR:
1016 ;   (in macroexpansion of (DO # #))
1017 ;   (hint: For more precise location, try *BREAK-ON-SIGNALS*.)
1018 ;   DO step variable is not a symbol: (ATOM CURRENT)
1019 @end example
1020
1021
1022 @node  Read Errors
1023 @comment  node-name,  next,  previous,  up
1024 @subsection Read Errors
1025 @cindex Read errors, compiler
1026
1027 SBCL's compiler does not attempt to recover from read errors when
1028 reading a source file, but instead just reports the offending
1029 character position and gives up on the entire source file.
1030
1031 @node  Open Coding and Inline Expansion
1032 @comment  node-name,  next,  previous,  up
1033 @section Open Coding and Inline Expansion
1034 @cindex Open-coding
1035 @cindex Inline expansion
1036 @cindex Static functions
1037
1038 Since Common Lisp forbids the redefinition of standard functions, the
1039 compiler can have special knowledge of these standard functions
1040 embedded in it. This special knowledge is used in various ways (open
1041 coding, inline expansion, source transformation), but the implications
1042 to the user are basically the same:
1043
1044 @itemize
1045
1046 @item
1047 Attempts to redefine standard functions may be frustrated, since the
1048 function may never be called. Although it is technically illegal to
1049 redefine standard functions, users sometimes want to implicitly
1050 redefine these functions when they are debugging using the
1051 @code{trace} macro.  Special-casing of standard functions can be
1052 inhibited using the @code{notinline} declaration, but even then some
1053 phases of analysis such as type inferencing are applied by the
1054 compiler.
1055
1056 @item
1057 The compiler can have multiple alternate implementations of standard
1058 functions that implement different trade-offs of speed, space and
1059 safety.  This selection is based on the compiler policy, @ref{Compiler
1060 Policy}.
1061
1062 @end itemize
1063
1064 When a function call is @emph{open coded}, inline code whose effect is
1065 equivalent to the function call is substituted for that function
1066 call. When a function call is @emph{closed coded}, it is usually left
1067 as is, although it might be turned into a call to a different function
1068 with different arguments. As an example, if @code{nthcdr} were to be
1069 open coded, then
1070
1071 @lisp
1072 (nthcdr 4 foobar)
1073 @end lisp
1074
1075 might turn into
1076
1077 @lisp
1078 (cdr (cdr (cdr (cdr foobar))))
1079 @end lisp
1080
1081 or even
1082
1083 @lisp
1084 (do ((i 0 (1+ i))
1085   (list foobar (cdr foobar)))
1086   ((= i 4) list))
1087 @end lisp
1088
1089 If @code{nth} is closed coded, then
1090
1091 @lisp
1092 (nth x l)
1093 @end lisp
1094
1095 might stay the same, or turn into something like
1096
1097 @lisp
1098 (car (nthcdr x l))
1099 @end lisp
1100
1101 In general, open coding sacrifices space for speed, but some functions
1102 (such as @code{car}) are so simple that they are always
1103 open-coded. Even when not open-coded, a call to a standard function
1104 may be transformed into a different function call (as in the last
1105 example) or compiled as @emph{static call}. Static function call uses
1106 a more efficient calling convention that forbids redefinition.
1107
1108 @node  Interpreter
1109 @comment  node-name,  next,  previous,  up
1110 @section Interpreter
1111 @cindex Interpreter
1112 @vindex sb-ext:*evaluator-mode*
1113
1114 By default SBCL implements @code{eval} by calling the native code
1115 compiler. SBCL also includes an interpreter for use in special cases
1116 where using the compiler is undesirable, for example due to compilation
1117 overhead. Unlike in some other Lisp implementations, in SBCL interpreted
1118 code is not safer or more debuggable than compiled code.
1119
1120 Switching between the compiler and the interpreter is done using the
1121 special variable @code{sb-ext:*evaluator-mode*}. As of 0.9.17, valid
1122 values for @code{sb-ext:*evaluator-mode*} are @code{:compile} and
1123 @code{:interpret}.