41) Message boards : News : Server Intervention 10-Feb-2014 (Message 27140)
Posted 11 Feb 2015 by Richard Haselgrove
Post:
None of the Windows applications previously used by my hosts (win32 or win64, _gen, _sse2 or _pni) currently appear to be available for download from the server.
42) Message boards : News : Server Intervention 10-Feb-2014 (Message 27138)
Posted 11 Feb 2015 by Richard Haselgrove
Post:
Likewise,

11/02/2015 11:59:02 | LHC@home 1.0 | Giving up on download of sixtrack_win64_4517_gen.exe: permanent HTTP error

The data files for the six tasks allocated all downloaded OK, but without the program file they couldn't run.

Edit - note that this appears to be a generic (non-optimised) application. I normally get allocated the PNI version for SSE3 CPUs.
43) Message boards : News : Server Intervention 10-Feb-2014 (Message 27136)
Posted 11 Feb 2015 by Richard Haselgrove
Post:
That looks better, thanks.

Meanwhile, on another machine, I've just got one new task (task 58122173), but only one - and it was a resend because somebody else missed their deadline (WU 27029491). No sign of any tasks from the 'new work' pool:

11/02/2015 11:32:53 | LHC@home 1.0 | Requesting new tasks for CPU
11/02/2015 11:32:53 | LHC@home 1.0 | [sched_op] CPU work request: 17401.94 seconds; 3.00 devices
11/02/2015 11:32:55 | LHC@home 1.0 | Scheduler request completed: got 1 new tasks
...
11/02/2015 11:33:06 | LHC@home 1.0 | Requesting new tasks for CPU
11/02/2015 11:33:06 | LHC@home 1.0 | [sched_op] CPU work request: 4454.46 seconds; 2.00 devices
11/02/2015 11:33:08 | LHC@home 1.0 | Scheduler request completed: got 0 new tasks
11/02/2015 11:33:08 | LHC@home 1.0 | No tasks sent
44) Message boards : News : Server Intervention 10-Feb-2014 (Message 27135)
Posted 11 Feb 2015 by Richard Haselgrove
Post:
None available here either:

11/02/2015 11:07:30 | LHC@home 1.0 | [sched_op] CPU work request: 27112.20 seconds; 6.00 devices
11/02/2015 11:07:32 | LHC@home 1.0 | Scheduler request completed: got 0 new tasks
11/02/2015 11:07:32 | LHC@home 1.0 | Project has no tasks available

Meanwhile, you just happened to update the server software while there was an untested word-wrap modification in the style sheet. David reverted that last night, because it made these log snippets unreadable: could you possibly apply

http://boinc.berkeley.edu/gitweb/?p=boinc-v2.git;a=commit;h=8f2eb6a5ca95d20db83d85ae9e9fcb299c5decb6

Thanks.
45) Message boards : News : Server Intervention 10-Feb-2014 (Message 27129)
Posted 10 Feb 2015 by Richard Haselgrove
Post:
You might have a problem (like access permissions?) with some download files:

10/02/2015 16:46:50 | LHC@home 1.0 | Giving up on download of w22_job_base_bb_np_nt_fset_240214_1k_2_x100_cc__47__s__62.31_60.32__10_12__6__49.5_1_sixvf_boinc16553.zip: permanent HTTP error
10/02/2015 16:46:50 | LHC@home 1.0 | Giving up on download of w22_job_base_bb_np_nt_fset_240214_1k_2_x100_cc__47__s__62.31_60.32__10_12__6__4.5_1_sixvf_boinc16523.zip: permanent HTTP error

Doesn't apply to all files - this was allocated in the same request.

10/02/2015 16:46:52 | LHC@home 1.0 | Finished download of w8_job_tracking_bb_np_nt_dq-3_20kHz_2__18__s__62.31_60.32__4_6__6__51_1_sixvf_boinc6111.zip

Edit: or maybe the files are simply missing. Trying manually got

The requested URL /sixtrack/download/76/w22_job_base_bb_np_nt_fset_240214_1k_2_x100_cc__47__s__62.31_60.32__10_12__6__4.5_1_sixvf_boinc16523.zip was not found on this server.
46) Message boards : LHC@home Science : Hide host info (Message 27126)
Posted 8 Feb 2015 by Richard Haselgrove
Post:
I want to hide infos about my hosts running LHC@Home. What can I do?

Although you are perfectly at liberty to hide your computer details, please pause and consider for a second before you do so.

Look at Uffe's computers or my computers. You will see that very much less detail is shown, compared to what is available when you look at your own computer. That automatic removal of detail may be enough to satisfy your perfectly understandable desire for privacy.

If at any point in the future you encounter difficulties running LHC, you are always welcome to ask for help (though the Number Crunching forum is the traditional place to do this). It greatly helps people trying to assist you if we can find your computers, and view details of the tasks you have processed - all of this is lost if you hide your computer entirely from other participants.
47) Message boards : Number crunching : Host messing up tons of results (Message 27122)
Posted 1 Feb 2015 by Richard Haselgrove
Post:
Perhaps, because we've had so much work recently, those inconclusives hung around on the disk for longer than usual - the resends wait at the end of the queue behind all the submitted first-run work. It all contributes to the server and database load.
48) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27110)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
Oh dear, we're full again:

29/01/2015 21:03:56 | LHC@home 1.0 | Started upload of w200_HLLHC_RFcav_scana3_50000.BOINC__4__s__62.31_60.32__12_14__5__88.5_1_sixvf_boinc826_1_0
29/01/2015 21:03:57 | LHC@home 1.0 | [error] Error reported by file upload server: Server is out of disk space
49) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27109)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
And, I'd venture to suggest, the results of our work won't be available to the researchers/designers who submitted it in the first place.
50) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27106)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
And the ones I got back in return all seem to be short-running, so I'm already generating new uploads to fill up the new disk (or cluster filesystem quota, as I suspect it may be). I hope the CERN admins are in a position to keep an eye on it overnight.
51) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27100)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
I've had a PM reply from Eric Mcintosh

I have reported to CERN BOINC support.
52) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27095)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
I fear that we may already have reached catch-22...

Most of the work I've succeeded in uploading this morning has gone into 'pending', and is waiting for a wingmate to upload their work so it can validate and move on.

And I've got 55 completed tasks backed up, waiting to upload so they can validate somebody else's work.

But how can get the two queues ever to meet each other?
53) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27091)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
Uploads are stalling - possibly due to ore smaller files being returned and the server cant keep up in processing?

Unfortunately, the files seem to be all the same size, whether the task runs for seconds or several hours. We just fill up the space much more quickly if a lot of short-running tasks are passing through the system.

It probably helps (marginally) if we report any tasks which have successfully made it through the uploading stage, so that the next stage of processing can take place and the space can be freed up.

The Einstein project had a problem with uploads - thing that was due to number of files.

The problem at Einstein was that the server disk file system became incredibly slow - it was taking up to 8 seconds to locate a free 'inode' so that uploaded files could be stored and indexed. Given the thousands of files that a server needs to process, that slowed everything down to a crawl.
54) Message boards : Number crunching : Error reported by file upload server: Server is out of disk space !? (Message 27088)
Posted 29 Jan 2015 by Richard Haselgrove
Post:
Getting two variants of the same thing:

29/01/2015 08:48:47 | LHC@home 1.0 | [error] Error reported by file upload server: Server is out of disk space
29/01/2015 08:50:56 | LHC@home 1.0 | [error] Error reported by file upload server: can't write file /data/boinc/project/sixtrack/upload/ea/w200_HLLHC_RFcav_scanb3_50000.BOINC__3__s__62.31_60.32__16_18__5__28.5_1_sixvf_boinc668_1_0: No space left on server

Just a side effect of the large volume of work we've been doing recently. Some uploads are being accepted, presumably as older tasks are processed and files are deleted.
55) Message boards : Number crunching : Stats Export MIA (Message 27082)
Posted 25 Jan 2015 by Richard Haselgrove
Post:
Same here. I don't know if it's BOINCstats not picking up the stats or if it's LHC@Home.

It's LHC@Home. The files in http://lhcathomeclassic.cern.ch/sixtrack/stats/ are four days old.
56) Message boards : Number crunching : Avast antivirus - false positive (Message 27071)
Posted 21 Jan 2015 by Richard Haselgrove
Post:
One of my machines threw four errors this morning. Turned out that Avast had developed an aversion to 'sixtrack_win32_4517_pni.exe' overnight - an application which we've all been using perfectly happily since May last year, and which no other antivirus program finds any fault with.

The problem shows as

Exit status -185 (0xffffffffffffff47) ERR_RESULT_START
couldn't start CreateProcess() failed - The system cannot find the file specified. (0x2): -148

It can be worked round by temporarily disabling Avast's shields while a new copy downloads, and by excluding the BOINC data directory from scanning (Settings - Active protection - File system shield - customise - exclusions, in my copy of Avast Free Antivirus 2015).

This is now the third BOINC project which has suffered from a false positive detection by Avast since I installed this version for testing (the other two are Climate Prediction dot Net and GPUGrid) - in all cases, like this one, a detection on a perfectly good file which has already been in use for an extended period. I'll report it myself, but it's reaching the point where BOINC itself should make a protest.
57) Message boards : Number crunching : Lots of Work! (Message 27070)
Posted 20 Jan 2015 by Richard Haselgrove
Post:
I do hope that the researchers responsible for submitting jobs for each of the various teams are aware of what they're doing, delaying resends and hence the completion of their cohort by trampling all over each others' toes?
58) Message boards : Number crunching : Sixtrack 32 bit vs 64 bit program for Windows (Message 26917)
Posted 20 Oct 2014 by Richard Haselgrove
Post:
'PNI' stands for Prescott New Instructions, and was Intel's trade name for what became SSE3 (http://en.wikipedia.org/wiki/SSE3). So the instruction sets are the same, and the same application has indeed been deployed twice so that BOINC's feature detection works correctly on both Intel and AMD CPUs.

I don't know the thinking on why a separate 64-bit deployment was necessary too.
59) Message boards : News : CERN AFS problems (Message 26890)
Posted 13 Oct 2014 by Richard Haselgrove
Post:
Think it is OK now (but we are still looking at AFS). Eric.

Sorry, no. Still getting

13/10/2014 22:16:06 | LHC@home 1.0 | [error] Error reported by file upload server: can't open log file '../log_boinc05/file_upload_handler.log' (errno: 9)
13/10/2014 22:16:06 | LHC@home 1.0 | Temporarily failed upload of sd_HL_15_440_2.2_6D_cc_err_Q5__14__s__62.31_60.32__8_10__6__85_1_sixvf_boinc3727_1_0: transient upload error

when retesting after your message.
60) Message boards : News : CERN AFS problems (Message 26885)
Posted 13 Oct 2014 by Richard Haselgrove
Post:
Scheduler contacts (requesting new work and potentially reporting completed work) are running smoothly:

13/10/2014 17:54:56 | LHC@home 1.0 | Sending scheduler request: Requested by user.
13/10/2014 17:54:56 | LHC@home 1.0 | Requesting new tasks for CPU
13/10/2014 17:54:56 | LHC@home 1.0 | [sched_op] CPU work request: 1875.28 seconds; 0.00 devices
13/10/2014 17:54:56 | LHC@home 1.0 | [sched_op] NVIDIA GPU work request: 0.00 seconds; 0.00 devices
13/10/2014 17:54:56 | LHC@home 1.0 | [sched_op] Intel GPU work request: 0.00 seconds; 0.00 devices
13/10/2014 17:54:58 | LHC@home 1.0 | Scheduler request completed: got 1 new tasks
13/10/2014 17:54:58 | LHC@home 1.0 | [sched_op] estimated total CPU task duration: 18441 seconds
13/10/2014 17:55:00 | LHC@home 1.0 | Started download of sd_HL_15_440_2.2_6D_cc_err_D2__13__s__62.31_60.32__6_8__6__60_1_sixvf_boinc3342.zip
13/10/2014 17:55:02 | LHC@home 1.0 | Finished download of sd_HL_15_440_2.2_6D_cc_err_D2__13__s__62.31_60.32__6_8__6__60_1_sixvf_boinc3342.zip

but it's the uploading of data from completed tasks which is failing:

13/10/2014 17:52:58 | LHC@home 1.0 | Started upload of sd_HL_15_440_2.2_6D_cc_err_IT__26__s__62.31_60.32__10_12__6__60_1_sixvf_boinc2137_1_0
13/10/2014 17:52:58 | LHC@home 1.0 | [http] [ID#1458] Info: About to connect() to lhcathomeclassic.cern.ch port 80 (#1619)
13/10/2014 17:52:58 | LHC@home 1.0 | [http] [ID#1458] Info: Trying 128.142.138.22...
13/10/2014 17:52:58 | LHC@home 1.0 | [http] [ID#1458] Info: Connected to lhcathomeclassic.cern.ch (128.142.138.22) port 80 (#1619)
13/10/2014 17:52:59 | LHC@home 1.0 | [http] [ID#1458] Received header from server: Date: Mon, 13 Oct 2014 16:53:00 GMT
13/10/2014 17:52:59 | LHC@home 1.0 | [http] [ID#1458] Received header from server: Connection: close
13/10/2014 17:52:59 | LHC@home 1.0 | [error] Error reported by file upload server: can't open log file '../log_boinc05/file_upload_handler.log' (errno: 9)
13/10/2014 17:52:59 | LHC@home 1.0 | Temporarily failed upload of sd_HL_15_440_2.2_6D_cc_err_IT__26__s__62.31_60.32__10_12__6__60_1_sixvf_boinc2137_1_0: transient upload error


Previous 20 · Next 20


©2024 CERN