1) Message boards : Number crunching : sixtrack back on track (Message 24330)
Posted 13 Jul 2012 by alephnull
Post:
looks like the project has more or less stabilized over the last two days or so. looks like several people may still be having some problems but overall things are looking good from what i can see. ive been getting steady work and all the wu seem to be running fine. in the past two days, i think ive only had one computation error, no download errors or invalid tasks. the few computers i have running lhc have lhc work caches filled properly so all seems to be in order.

as such, i hope those individuals that are still having problems here and there are able to get everything sorted. hopefully most others are stable now too.

just wanted to say thanks to all the folks at lhc for the great work getting the project back on track. its nice to be getting steady work from this project again.

thanks again.

rob
2) Message boards : Number crunching : Is there an upload server-side issue? Permissions failure? (Message 24209)
Posted 9 Jul 2012 by alephnull
Post:
If some users are still experiencing problems then please post the main error messages. The filesystem full issue has been resolved and so has the write permissions. Any details would be helpful

this machine has pending tasks that are of the 1-2 second variety (about 84 of them).

heres an example of the message from one of them (task 3965742):

Stderr output

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
[../../projects/lhcathomeclassic.cern.ch_sixtrack/w1jul_niebb1d__27__s__64.28_59.31__10.2_10.4__6__11_1_sixvf_boinc47270.zip]
End-of-central-directory signature not found. Either this file is not
a zipfile, or it constitutes one disk of a multi-part archive. In the
latter case the central directory and zipfile comment will be found on
the last disk(s) of this archive.
unzip: cannot find zipfile directory in ../../projects/lhcathomeclassic.cern.ch_sixtrack/w1jul_niebb1d__27__s__64.28_59.31__10.2_10.4__6__11_1_sixvf_boinc47270.zip,
and cannot find ../../projects/lhcathomeclassic.cern.ch_sixtrack/w1jul_niebb1d__27__s__64.28_59.31__10.2_10.4__6__11_1_sixvf_boinc47270.zip.zip, period.
17:05:40 (3796): called boinc_finish

</stderr_txt>
]]>

im pretty sure some of these tasks occured prior to the disk full problem but some of them are also from a few minutes ago which makes them after the point you mention this issue was corrected.

i do not know what other information you would like regarding this but ill provide any further information you may find helpful, just let me know.

i got the same issues today on several other boxes as well. just posting the results from one machine.

thanks.

rob

EDIT:

currently, my main machines do not have any lhc work, their caches were depleted and when they request more work, the server does not send any tasks. this occurred on machines with no cpu work at the time as they are set up to only crunch lhc so i know its not a full work cache issue. these machines are set up with a 2.0/0.1 work cache.
3) Message boards : Number crunching : is this an error? (Message 24161)
Posted 8 Jul 2012 by alephnull
Post:
Also ran in to this. Made a forum post for it already:

http://lhcathomeclassic.cern.ch/sixtrack/forum_thread.php?id=3460

Maybe has to do something with the servers being down.

right. i was wondering about this issue after the servers came back up. no matter though, it all seems to be working well now.

thanks.

rob
4) Message boards : Number crunching : is this an error? (Message 24156)
Posted 8 Jul 2012 by alephnull
Post:
since the servers cam back up i've been getting a lot of "short" wu. they are quickly returned with as successful and all execute in about 2 seconds or so.

these tasks have the following message:

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
[../../projects/lhcathomeclassic.cern.ch_sixtrack/w1jul_niebb0d__30__s__64.28_59.31__8_8.2__6__85_1_sixvf_boinc51705.zip]
End-of-central-directory signature not found. Either this file is not
a zipfile, or it constitutes one disk of a multi-part archive. In the
latter case the central directory and zipfile comment will be found on
the last disk(s) of this archive.
unzip: cannot find zipfile directory in ../../projects/lhcathomeclassic.cern.ch_sixtrack/w1jul_niebb0d__30__s__64.28_59.31__8_8.2__6__85_1_sixvf_boinc51705.zip,
and cannot find ../../projects/lhcathomeclassic.cern.ch_sixtrack/w1jul_niebb0d__30__s__64.28_59.31__8_8.2__6__85_1_sixvf_boinc51705.zip.zip, period.
11:39:30 (3704): called boinc_finish

</stderr_txt>
]]>

is this correct?

thanks.

rob
5) Message boards : Number crunching : Mac OS X Errors (Message 24125)
Posted 7 Jul 2012 by alephnull
Post:
All our excutablesiare 32-bit but they run on 64-bit
systems OK. Well at least on some....
More news soonest. Eric.

thanks for the explanation on the executables. will wait to hear more before i attempt lhc on those machines again.

if theres any other information you would like to know about these machines to help diagnose any problems, please let me know.

thanks for the update.

rob
6) Message boards : Number crunching : No Tasks ??? (Message 24124)
Posted 7 Jul 2012 by alephnull
Post:
Situation might be better now; see News.
Update on Monday latest. Eric.

looking forward to the news eric. it has to be good cause im getting work to that machine now, thanks.

rob
7) Message boards : Number crunching : No Tasks ??? (Message 24113)
Posted 7 Jul 2012 by alephnull
Post:
Using 7.0.28, under Projects-Properties I see a -0.00 scheduling priority for this project. My other two running BOINC projects have non-zero scheduling priorities. So perhaps that is a source of the problem although it is a mystery why some people are fortunate enough to receive work units while others are not.

i have this host thats running boinc 7.0.25.

it has many projects attached but only three set to get new work for these projects (scheduling priority)

lhc (-0.00)
test4theory (-0.00)
gpugrid (-3.03)

that host hasnt been able to get work for lhc for the last two days or so. it gets steady work for t4t and gpugrid though. its inability to get work for lhc is not cache related, i only have one instance of t4t running on it and stopped all work from other projects to see if eventually it would get some lhc work so the cpu has been essentially idle for the last two days on that machine. i was gonna leave it free for the remainder of the weekend to see if there are any changes and monday ill set cpu work fetch back on for another project if it doesnt get anything for lhc by then.

ive been reading other threads about work quotas. im not sure if i understand how lhc enforces it but if my math is correct, there should be nothing preventing that machine from getting new tasks.

the other machines i have set to retrieve new work for lhc are getting work regularly. those have lhc scheduling priorities of:

-2.39, -0.05, -0.04, -0.04, -1.61, -0.11

i think youre right, its sorta a mystery to me too. i get work from non lhc projects with -0.00 (t4t) scheduling priorities though.

rob
8) Message boards : Number crunching : Mac OS X Errors (Message 24112)
Posted 7 Jul 2012 by alephnull
Post:
I just discovered that SixTrack can run on a 64-bit mac. I have two, an i7 and Core2 Duo. It is too soon to see if I have any crashes.

I hope I can be of some help.

Philip

hopefully youll have success. according to the applications page there should be a 64 bit mac version. according to the errors i got it seemed to me like my mac was using a 32 bit version but im not sure:



...
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
SIGBUS: bus error

Crashed executable name: sixtrack_apple_gen
Machine type Intel 80486 (32-bit executable)

System version: Macintosh OS 10.7.4 build 11E53
Tue Jul 3 17:45:00 2012
...

that error was on a mac mini with an i7 processor so i dont know if that machine may have downloaded the wrong application for some reason. i would assume it should have downloaded the 64 bit app? the boinc client runs in 32 bit mode on that mac but for other projects that have 64 bit apps, they run fine.

rob
9) Message boards : Number crunching : Errors with new APP v444.01 sse3 (Message 24055)
Posted 4 Jul 2012 by alephnull
Post:
Now getting the new work units but they all don't even run now.
All failing on download with "couldn't get input files"
Both Linux 64 and Win 32.

Example 3828522

Conan

i have seventeen of the same errors here as well. they are not all sse3 wu though.
10) Message boards : Number crunching : Mac OS X Errors (Message 24045)
Posted 3 Jul 2012 by alephnull
Post:
thank you for the warning.

The MAC version was indeed intended for Apple machines with Intel processors.
I think we need to analyze this one to see why it is crashing.

Sorry about the difficulties.

Igor.

i dont think the difficulties are based on the processor. the mac mini has an i7 and the macbook air has an i5, both of which are 64 bit i believe. the two machines are no more than a year and a half old.

if there is more information you would like me to provide regarding this, please let me know.

thanks.

rob
11) Message boards : Number crunching : Mac OS X Errors (Message 24043)
Posted 3 Jul 2012 by alephnull
Post:
apologies in advance if im posting this question in the wrong place. i looked at the q&a section for unix/linux related issues and the threads there all seem to be quite old.

i got pretty excited recently when lhc started having work again so i wanted to crunch on my mac too. attempts today to do this have all ended in errors for

this host's tasks and
this host's tasks

i see on the applications page there is a mac app for sixtrack:

Intel 64-bit Mac OS 10.5 or later
444.01
3 Jul 2012 | 19:18:48 UTC

all the wu seemed to get the same error. here's an example:

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
SIGBUS: bus error

Crashed executable name: sixtrack_apple_gen
Machine type Intel 80486 (32-bit executable)
System version: Macintosh OS 10.7.4 build 11E53
Tue Jul 3 17:45:00 2012

0 sixtrack_apple_gen 0x0020df9e PrintBacktrace (in sixtrack_apple_gen) + 1022

Thread 0 crashed with X86 Thread State (32-bit):
eax: 0xffffffe1 ebx: 0x00000003 ecx: 0xbfffa61c edx: 0x92166c22
edi: 0xbfffa678 esi: 0x00000003 ebp: 0xbfffa648 esp: 0xbfffa61c
ss: 0x00000023 efl: 0x00000206 eip: 0x92166c22 cs: 0x0000000b
ds: 0x00000023 es: 0x00000023 fs: 0x00000000 gs: 0x0000000f

Binary Images Description:
0x1000 - 0x31ffff /Library/Application Support/BOINC Data/slots/8/../../projects/lhcathomeclassic.cern.ch_sixtrack/sixtrack_apple_gen
0x90626000 - 0x90627fff /usr/lib/system/libunc.dylib
0x90783000 - 0x90786fff /usr/lib/system/libmathCommon.A.dylib
0x91085000 - 0x9108dfff /usr/lib/system/libcopyfile.dylib
0x911f1000 - 0x911f1fff /usr/lib/system/libdnsinfo.dylib
0x91de2000 - 0x91de9fff /usr/lib/system/libsystem_notify.dylib
0x92150000 - 0x9216efff /usr/lib/system/libsystem_kernel.dylib
0x92209000 - 0x9220efff /usr/lib/system/libmacho.dylib
0x940bb000 - 0x94186fff /usr/lib/system/libsystem_c.dylib
0x94eb6000 - 0x94eccfff /usr/lib/system/libxpc.dylib
0x96040000 - 0x96044fff /usr/lib/system/libsystem_network.dylib
0x9631a000 - 0x96349fff /usr/lib/system/libsystem_info.dylib
0x97944000 - 0x97952fff /usr/lib/system/libdispatch.dylib
0x9829f000 - 0x982a3fff /usr/lib/system/libcache.dylib
0x982a4000 - 0x982d2fff /usr/lib/libSystem.B.dylib
0x9838b000 - 0x983cefff /usr/lib/system/libcommonCrypto.dylib
0x983cf000 - 0x983d1fff /usr/lib/system/libdyld.dylib
0x983d2000 - 0x983d3fff /usr/lib/system/libquarantine.dylib
0x9a47e000 - 0x9a486fff /usr/lib/system/liblaunch.dylib
0x9aa2e000 - 0x9aa31fff /usr/lib/system/libcompiler_rt.dylib
0x9ab22000 - 0x9ab2afff /usr/lib/system/libunwind.dylib
0x9aeae000 - 0x9aeaffff /usr/lib/system/libremovefile.dylib
0x9bcc2000 - 0x9bcc2fff /usr/lib/system/libkeymgr.dylib
0x9bcc3000 - 0x9bccafff /usr/lib/system/libsystem_dnssd.dylib
0x9bd3c000 - 0x9bd3dfff /usr/lib/system/libsystem_sandbox.dylib
0x9be6f000 - 0x9be70fff /usr/lib/system/libsystem_blocks.dylib


Exiting...

</stderr_txt>
]]>

---------------------

of the two macs, one is a mac mini and the other is a macbook air. both are running os x lion 10.7.4. im not too familiar with mac or unix/linux so any information you can provide youll have to treat me like a new guy.

this thread seems to indicate working towards some apps for mac. i dont know if the sixtrack app being used is the correct version. it would be nice to get these two machines running sixtrack. they have had boinc installed and have been running other projects for quite a while now so i know they are stable. just having difficulty right now trying to get sixtrack working.

thanks for any help or information you can provide.

rob



©2024 CERN