00:20:19 bfulgham [~brent@wsip-72-215-191-226.sb.sd.cox.net] has joined #ccl 00:48:39 -!- smolyn [~smolyn@76.77.66.100] has quit [Ping timeout: 260 seconds] 03:25:26 bfulgham_ [~brent@adsl-69-234-130-16.dsl.irvnca.pacbell.net] has joined #ccl 03:32:48 -!- bfulgham_ [~brent@adsl-69-234-130-16.dsl.irvnca.pacbell.net] has quit [Read error: Connection reset by peer] 04:41:58 -!- rme [rme@clozure-70A3CE26.chi.dsl-w.verizon.net] has quit [Quit: rme] 04:41:58 -!- rme [~rme@pool-70-105-119-125.chi.dsl-w.verizon.net] has quit [Quit: rme] 05:01:05 smolyn [~smolyn@S0106002500fe5c20.vc.shawcable.net] has joined #ccl 05:01:23 -!- smolyn [~smolyn@S0106002500fe5c20.vc.shawcable.net] has quit [Remote host closed the connection] 05:24:16 leo2007 [~leo@cpc1-cmbg13-0-0-cust596.cmbg.cable.ntl.com] has joined #ccl 05:32:23 -!- billstclair [~billstcla@unaffiliated/billstclair] has quit [Ping timeout: 260 seconds] 06:06:15 billstclair [~billstcla@unaffiliated/billstclair] has joined #ccl 07:01:08 -!- billstclair [~billstcla@unaffiliated/billstclair] has quit [Ping timeout: 265 seconds] 07:21:13 billstclair [~billstcla@unaffiliated/billstclair] has joined #ccl 09:48:45 -!- billstclair [~billstcla@unaffiliated/billstclair] has quit [Ping timeout: 240 seconds] 09:49:52 billstclair [~billstcla@unaffiliated/billstclair] has joined #ccl 10:25:45 roffe [~roffe@vpn-client134.uio.no] has joined #ccl 11:41:10 tfb [~tfb@212.183.140.60] has joined #ccl 12:52:40 alms [~alms@mobile-166-137-138-135.mycingular.net] has joined #ccl 14:11:39 roffe_ [~roffe@195.159.90.93] has joined #ccl 14:13:45 -!- roffe [~roffe@vpn-client134.uio.no] has quit [Ping timeout: 240 seconds] 14:13:45 -!- roffe_ is now known as roffe 14:27:26 LiamH [~none@pdp8.nrl.navy.mil] has joined #ccl 14:37:52 -!- clop [~jared@moat3.centtech.com] has quit [Quit: Leaving] 14:40:27 clop [~jared@moat3.centtech.com] has joined #ccl 14:59:28 -!- roffe [~roffe@195.159.90.93] has quit [Remote host closed the connection] 14:59:46 roffe [~roffe@vpn-client373.uio.no] has joined #ccl 15:11:52 -!- alms [alms@clozure-F047E6A0.mycingular.net] has quit [Ping timeout] 15:12:47 -!- alms [~alms@mobile-166-137-138-135.mycingular.net] has quit [Ping timeout: 258 seconds] 15:31:21 Is there a good way to debug a glibc error that causes CCL to crash? 15:32:26 the crash is linux-only? 15:34:05 milanj [~milanj_@109.93.23.80] has joined #ccl 15:37:02 hypno: I only run on linux, so hard to tell. 15:37:22 leo2007: you are running OS X, aren't you? 15:37:43 ok. do you have some code that will reproduce the crash? 15:37:54 LiamH: yes 15:38:03 LiamH: crash here too I think. 15:38:05 hypno: large system, yes 15:38:18 leo2007: try this 15:38:40 In a fresh GSLL, (lisp-unit:run-tests mathieu) twice 15:39:45 LiamH: no crash. 15:40:30 leo2007: you got good results both times? 15:40:52 yeah 15:40:55 same results 15:41:03 i'm running the whole test suite 15:41:11 hypno: yes, so linux only 15:41:16 LiamH: TOTAL: 4028 assertions passed, 39 failed, 1 execution errors. 15:41:36 leo2007: no, mathieu only 15:41:50 (lisp-unit:run-tests mathieu) 15:41:56 do it twice in a row 15:42:45 LiamH: no crash. even using the test suite. 15:44:19 LiamH: I am pretty sure osx is immune to the crash. 15:47:02 http://paste.lisp.org/+2E04 15:47:48 It actually completes the second set of tests, and then crashes. 15:52:25 alms [~alms@mobile-166-137-139-089.mycingular.net] has joined #ccl 16:31:28 rme [~rme@pool-70-105-119-125.chi.dsl-w.verizon.net] has joined #ccl 16:44:21 -!- alms [alms@clozure-A021EBC8.mycingular.net] has quit [Quit: alms] 16:44:21 -!- alms [~alms@mobile-166-137-139-089.mycingular.net] has quit [Quit: alms] 17:06:44 -!- tfb [~tfb@212.183.140.60] has quit [Ping timeout: 258 seconds] 17:12:38 alms [~alms@mobile-166-137-136-181.mycingular.net] has joined #ccl 17:29:35 -!- alms [alms@clozure-1B60B4D1.mycingular.net] has quit [Quit: alms] 17:29:35 -!- alms [~alms@mobile-166-137-136-181.mycingular.net] has quit [Quit: alms] 17:33:00 -!- milanj [~milanj_@109.93.23.80] has quit [Ping timeout: 240 seconds] 17:38:15 milanj [~milanj_@178.223.145.236] has joined #ccl 19:09:11 LiamH: that seems to be malloc heap corruption. The general approach to debugging it is to try to get malloc/free et al to do more consistency checks in hopes of catching the cause of the problem as soon as possible (e.g., the call to free() in the backtrace might be correct, but something may have scrambled things earlier.) 19:42:34 gbyers: I agree the message is not appearing when the error happens, but later. 19:43:08 Which makes it very hard to debug. 19:43:29 So how do I get malloc/free to do consistency checks? 19:44:50 Linux historically supported something called 'mtrace', which logs info about malloc/free calls. There's an interface to it in ccl:library;leaks.lisp. I looked briefly at the malloc man page on a Fedora 13 system and it didn't mention mtrace, so it may not be supported. 19:45:32 (or whether it is supported or not may depend on libc version; I'm not sure.) 19:46:09 LiamH: I think there's an environment variable "MALLOC_CHECK_" or something. See the malloc man page. 19:46:13 ? (ccl::external "mtrace") 20:18:26 export MALLOC_CHECK_=3 gives me the exact same result, so there's no heap corruption anyway 20:25:07 gbyers: I find that some errors could cause ccl to crash but on sbcl just bring up the lisp debugger. 20:38:38 LiamH: heap corruption is what the crash is complaining about; setting MALLOC_CHECK_ didn't find it, but that's not really conclusive. 20:46:03 gbyers: Right, man malloc says " Not all such errors can be protected against, however" so I assume heap corruption is still possible. 21:02:02 lispm [~lispm@g224126237.adsl.alicedsl.de] has joined #ccl 21:21:22 -!- lispm [~lispm@g224126237.adsl.alicedsl.de] has quit [Remote host closed the connection] 21:49:55 -!- LiamH [~none@pdp8.nrl.navy.mil] has quit [Quit: Leaving.] 21:54:47 -!- milanj [~milanj_@178.223.145.236] has quit [Ping timeout: 245 seconds] 21:57:14 milanj [~milanj_@178.223.145.236] has joined #ccl 22:02:59 -!- milanj [~milanj_@178.223.145.236] has quit [Quit: Leaving] 22:25:40 palter [~palter@2002:4b44:b1e1:0:5ab0:35ff:fe78:6749] has joined #ccl 22:25:54 -!- palter [~palter@2002:4b44:b1e1:0:5ab0:35ff:fe78:6749] has quit [Client Quit] 22:36:36 -!- bfulgham [~brent@wsip-72-215-191-226.sb.sd.cox.net] has left #ccl 22:39:39 palter [~palter@c-75-68-177-225.hsd1.ma.comcast.net] has joined #ccl 22:39:57 -!- palter [~palter@c-75-68-177-225.hsd1.ma.comcast.net] has quit [Client Quit] 23:05:40 bfulgham [~brent@wsip-72-215-191-226.sb.sd.cox.net] has joined #ccl