Questions and Answers : Unix/Linux : restart results in errors
Message board moderation

To post messages, you must log in.

AuthorMessage
tcp-ip

Send message
Joined: 2 Dec 05
Posts: 2
Credit: 297,949
RAC: 0
Message 23582 - Posted: 24 Oct 2011, 21:09:46 UTC

I'm running lhc@home on Ubuntu 11.10 64-bit, BOINC client 6.12.33. Today my system was working on four Sixtrack workunits: 429643, 429649, 434249, 435448. After a system reboot I found out that all the four workunits had been aborted with the "error while computing" message. They were progressing with no troubles before the reboot, and the other BOINC projects stopped and resumed correctly.
What can I do lest it happens again?
ID: 23582 · Report as offensive     Reply Quote
Profile jujube

Send message
Joined: 25 Jan 11
Posts: 179
Credit: 83,858
RAC: 0
Message 23583 - Posted: 24 Oct 2011, 22:49:19 UTC - in response to Message 23582.  

Did you shutdown BOINC client before you rebooted? Maybe the Sixtrack application doesn't like being shutdown by the operating system. Click Advanced -> Shutdown Connected Client -> OK -> Cancel to be sure the client and science applications are shutdown and then reboot.
ID: 23583 · Report as offensive     Reply Quote
tcp-ip

Send message
Joined: 2 Dec 05
Posts: 2
Credit: 297,949
RAC: 0
Message 23584 - Posted: 25 Oct 2011, 9:09:46 UTC - in response to Message 23583.  
Last modified: 25 Oct 2011, 9:10:29 UTC

Today I tried a reboot in this way but It didn't work. I lost 4 more workunits in the same way. I think that there is some serious bug.

This is a complete log after the reboot:



Tue 25 Oct 2011 10:55:02 AM CEST | | Starting BOINC client version 6.12.33 for x86_64-pc-linux-gnu
Tue 25 Oct 2011 10:55:02 AM CEST | | Config: GUI RPC allowed from:
Tue 25 Oct 2011 10:55:02 AM CEST | | log flags: file_xfer, sched_ops, task
Tue 25 Oct 2011 10:55:02 AM CEST | | Libraries: libcurl/7.21.6 OpenSSL/1.0.0e zlib/1.2.3.4 libidn/1.22 librtmp/2.3
Tue 25 Oct 2011 10:55:02 AM CEST | | Data directory: /var/lib/boinc-client
Tue 25 Oct 2011 10:55:02 AM CEST | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-2630QM CPU @ 2.00GHz [Family 6 Model 42 Stepping 7]
Tue 25 Oct 2011 10:55:02 AM CEST | | Processor: 6.00 MB cache
Tue 25 Oct 2011 10:55:02 AM CEST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqd
Tue 25 Oct 2011 10:55:02 AM CEST | | OS: Linux: 3.0.0-12-generic
Tue 25 Oct 2011 10:55:02 AM CEST | | Memory: 5.75 GB physical, 3.73 GB virtual
Tue 25 Oct 2011 10:55:02 AM CEST | | Disk: 570.02 GB total, 445.78 GB free
Tue 25 Oct 2011 10:55:02 AM CEST | | Local time is UTC +2 hours
Tue 25 Oct 2011 10:55:02 AM CEST | | No usable GPUs found
Tue 25 Oct 2011 10:55:02 AM CEST | | A new version of BOINC is available. <a href=http://boinc.berkeley.edu/download.php>Download it.</a>
Tue 25 Oct 2011 10:55:02 AM CEST | Einstein@Home | URL http://einstein.phys.uwm.edu/; Computer ID 4232364; resource share 30
Tue 25 Oct 2011 10:55:02 AM CEST | FreeHAL@home | URL http://www.freehal.net/freehal_at_home/; Computer ID 60645; resource share 60
Tue 25 Oct 2011 10:55:02 AM CEST | rosetta@home | URL http://boinc.bakerlab.org/rosetta/; Computer ID 1483415; resource share 30
Tue 25 Oct 2011 10:55:02 AM CEST | LHC@home 1.0 | URL http://lhcathomeclassic.cern.ch/sixtrack/; Computer ID 9934485; resource share 100
Tue 25 Oct 2011 10:55:02 AM CEST | climateprediction.net | URL http://climateprediction.net/; Computer ID 1172856; resource share 60
Tue 25 Oct 2011 10:55:02 AM CEST | Einstein@Home | General prefs: from Einstein@Home (last modified 21-Aug-2007 11:23:47)
Tue 25 Oct 2011 10:55:02 AM CEST | Einstein@Home | Computer location: home
Tue 25 Oct 2011 10:55:02 AM CEST | Einstein@Home | General prefs: no separate prefs for home; using your defaults
Tue 25 Oct 2011 10:55:02 AM CEST | | Reading preferences override file
Tue 25 Oct 2011 10:55:02 AM CEST | | Preferences:
Tue 25 Oct 2011 10:55:02 AM CEST | | max memory usage when active: 2942.94MB
Tue 25 Oct 2011 10:55:02 AM CEST | | max memory usage when idle: 5297.30MB
Tue 25 Oct 2011 10:55:02 AM CEST | | max disk usage: 10.00GB
Tue 25 Oct 2011 10:55:02 AM CEST | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager)
Tue 25 Oct 2011 10:55:02 AM CEST | | Not using a proxy
Tue 25 Oct 2011 10:55:16 AM CEST | climateprediction.net | Restarting task hadcm3n_u2ry_1980_40_007459100_4 using hadcm3n version 607
Tue 25 Oct 2011 10:55:16 AM CEST | climateprediction.net | Restarting task hadam3p_eu_60jb_2002_1_007481858_0 using hadam3p_eu version 609
Tue 25 Oct 2011 10:55:16 AM CEST | LHC@home 1.0 | Restarting task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__10.5_1_sixvf_boinc62046_1 using sixtrack version 53010
Tue 25 Oct 2011 10:55:16 AM CEST | LHC@home 1.0 | Restarting task w3_weak5a_collision_err_bb__24__s__64.31_59.32__8_10__6__87_1_sixvf_boinc61979_0 using sixtrack version 53010
Tue 25 Oct 2011 10:55:16 AM CEST | LHC@home 1.0 | Restarting task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__1.5_1_sixvf_boinc62040_0 using sixtrack version 53010
Tue 25 Oct 2011 10:55:16 AM CEST | LHC@home 1.0 | Restarting task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__45_1_sixvf_boinc62069_1 using sixtrack version 53010
Tue 25 Oct 2011 10:55:16 AM CEST | FreeHAL@home | Restarting task fh_nci_0_42210995_82_0 using newFreeHAL version 193
Tue 25 Oct 2011 10:55:16 AM CEST | rosetta@home | Restarting task rlx_jsr_decoys_3fk8_SAVE_ALL_OUT_34558_569_0 using minirosetta version 314
Tue 25 Oct 2011 10:56:19 AM CEST | climateprediction.net | Task hadam3p_eu_60jb_2002_1_007481858_0 exited with zero status but no 'finished' file
Tue 25 Oct 2011 10:56:19 AM CEST | climateprediction.net | If this happens repeatedly you may need to reset the project.
Tue 25 Oct 2011 10:56:19 AM CEST | climateprediction.net | Restarting task hadam3p_eu_60jb_2002_1_007481858_0 using hadam3p_eu version 609
Tue 25 Oct 2011 10:56:19 AM CEST | rosetta@home | Starting task ploop2x3_design_22_abinitio_SAVE_ALL_OUT_34432_56277_0 using minirosetta version 314
Tue 25 Oct 2011 10:56:20 AM CEST | LHC@home 1.0 | Computation for task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__10.5_1_sixvf_boinc62046_1 finished
Tue 25 Oct 2011 10:56:20 AM CEST | LHC@home 1.0 | Restarting task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__39_1_sixvf_boinc62065_0 using sixtrack version 53010
Tue 25 Oct 2011 10:56:22 AM CEST | climateprediction.net | Task hadcm3n_u2ry_1980_40_007459100_4 exited with zero status but no 'finished' file
Tue 25 Oct 2011 10:56:22 AM CEST | climateprediction.net | If this happens repeatedly you may need to reset the project.
Tue 25 Oct 2011 10:56:22 AM CEST | climateprediction.net | Restarting task hadcm3n_u2ry_1980_40_007459100_4 using hadcm3n version 607
Tue 25 Oct 2011 10:56:23 AM CEST | LHC@home 1.0 | Started upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__10.5_1_sixvf_boinc62046_1_0
Tue 25 Oct 2011 10:56:23 AM CEST | LHC@home 1.0 | Computation for task w3_weak5a_collision_err_bb__24__s__64.31_59.32__8_10__6__87_1_sixvf_boinc61979_0 finished
Tue 25 Oct 2011 10:56:23 AM CEST | LHC@home 1.0 | Starting task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__31.5_1_sixvf_boinc62060_1 using sixtrack version 53010
Tue 25 Oct 2011 10:56:25 AM CEST | LHC@home 1.0 | Finished upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__10.5_1_sixvf_boinc62046_1_0
Tue 25 Oct 2011 10:56:25 AM CEST | LHC@home 1.0 | Computation for task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__1.5_1_sixvf_boinc62040_0 finished
Tue 25 Oct 2011 10:56:25 AM CEST | LHC@home 1.0 | Starting task w3_weak5a_collision_err_bb__29__s__64.31_59.32__4_6__6__22.5_1_sixvf_boinc63588_0 using sixtrack version 53010
Tue 25 Oct 2011 10:56:25 AM CEST | rosetta@home | Sending scheduler request: To fetch work.
Tue 25 Oct 2011 10:56:25 AM CEST | rosetta@home | Requesting new tasks for CPU
Tue 25 Oct 2011 10:56:26 AM CEST | LHC@home 1.0 | Started upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__8_10__6__87_1_sixvf_boinc61979_0_0
Tue 25 Oct 2011 10:56:26 AM CEST | LHC@home 1.0 | Computation for task w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__45_1_sixvf_boinc62069_1 finished
Tue 25 Oct 2011 10:56:26 AM CEST | LHC@home 1.0 | Starting task w3_weak6a_collision_err_bb__2__s__64.31_59.32__6_8__6__88.5_1_sixvf_boinc65585_0 using sixtrack version 53010
Tue 25 Oct 2011 10:56:26 AM CEST | rosetta@home | Scheduler request completed: got 1 new tasks
Tue 25 Oct 2011 10:56:27 AM CEST | LHC@home 1.0 | Finished upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__8_10__6__87_1_sixvf_boinc61979_0_0
Tue 25 Oct 2011 10:56:27 AM CEST | LHC@home 1.0 | Started upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__1.5_1_sixvf_boinc62040_0_0
Tue 25 Oct 2011 10:56:29 AM CEST | LHC@home 1.0 | Started upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__45_1_sixvf_boinc62069_1_0
Tue 25 Oct 2011 10:56:29 AM CEST | rosetta@home | Started download of rlx_2qsk.zip
Tue 25 Oct 2011 10:56:33 AM CEST | LHC@home 1.0 | Finished upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__1.5_1_sixvf_boinc62040_0_0
Tue 25 Oct 2011 10:56:33 AM CEST | LHC@home 1.0 | Finished upload of w3_weak5a_collision_err_bb__24__s__64.31_59.32__12_14__6__45_1_sixvf_boinc62069_1_0
Tue 25 Oct 2011 10:56:58 AM CEST | rosetta@home | Finished download of rlx_2qsk.zip
ID: 23584 · Report as offensive     Reply Quote
Profile jujube

Send message
Joined: 25 Jan 11
Posts: 179
Credit: 83,858
RAC: 0
Message 23585 - Posted: 25 Oct 2011, 19:20:25 UTC - in response to Message 23584.  

I run Ubuntu 10.10 and Sixtrack project. I can leave BOINC client running and reboot the OS and when I restart BOINC the Sixtrack tasks do not crash. It seems like there is not a bug in Sixtrack but perhaps a problem on your end.

What happens when you just shutdown BOINC client (click Advanced -> Shutdown Attached Client -> OK -> Cancel), no OS reboot, and restart BOINC client? Do the Sixtrack tasks crash then?

I am using BOINC client 6.12.34, you are 6.12.33. I wonder if the problem is in 6.12.33? Or maybe Ubuntu 11.10 is the problem?

Sometimes detaching and reattaching to the project fixes a problem.
ID: 23585 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : restart results in errors


©2024 CERN