Message boards :
ATLAS application :
queue is empty
Message board moderation
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 13 · Next
Author | Message |
---|---|
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
If something changes so the task is pointless to continue, would you rather waste even more time or abort? It's not their fault they realised what they asked you to do is now incorrect. |
Send message Joined: 28 Sep 04 Posts: 732 Credit: 49,336,437 RAC: 25,953 |
They could have announced it on the forum and leave the decision to abort to the crunchers. They could also have tested them on the dev-site first. But it is what it is, there's no point in crying over spilled milk. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
They could have announced it on the forum and leave the decision to abort to the crunchers. They could also have tested them on the dev-site first. But it is what it is, there's no point in crying over spilled milk.I wasn't aware Boinc had the facility to remotely abort a started task. Perhaps it's just other projects don't like doing so. I'm not sure why anyone would want to finish a task which is just going to be discarded at the project end. |
Send message Joined: 17 Sep 04 Posts: 105 Credit: 32,824,862 RAC: 131 |
They could have announced it on the forum and leave the decision to abort to the crunchers. They could also have tested them on the dev-site first. But it is what it is, there's no point in crying over spilled milk.I wasn't aware Boinc had the facility to remotely abort a started task. Perhaps it's just other projects don't like doing so. I'm not sure why anyone would want to finish a task which is just going to be discarded at the project end. Is the way to successfully complete these long units is to use more processors to accelerate completion time of the work unit? When does the "kill switch" appear, after about a day and a half of processing? I just answered my own question. Both units were cancelled by the server at precisely 112,671.63 seconds. Regards, Bob P. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
Is the way to successfully complete these long units is to use more processors to accelerate completion time of the work unit? When does the "kill switch" appear, after about a day and a half of processing?I don't understand using anything less than the 8 threads the program will use. Why do twice as many at once in double the time? |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 677 |
What is, when this timelimit from one and a half day, is a test in production, to check something? Last Thursday and this Thursday, Atlas was running Tasks. |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,923,693 RAC: 32,358 |
the server status page suddenly shows about 1.700 unsent tasks. The question is: will they also be aborted by the server after many hours, and hence it would not make sense to stark crunching them ??? Also: where can one see how many events these tasks have? |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
I'm just going to run them. I've grabbed a bunch on my 5 fastest computers. Might aswell help out with whatever they're trying to do/test/fix. |
Send message Joined: 4 Mar 17 Posts: 25 Credit: 10,262,043 RAC: 1,885 |
Also: where can one see how many events these tasks have? This is line 172 in my current /var/lib/boinc/slots/0/pilotlog.txt (not 100% sure as i did not have a shorter one of this run yet) payload execution command: export ATHENA_CORE_NUMBER=12;export ATHENA_PROC_NUMBER=12;export PANDA_RESOURCE='BOINC_MCORE';export FRONTIER_ID=//...cutted out..// --maxEvents=2000 --.......... 2023-09-22 11:37:52,557 | WARNING | container name not defined in CRIC 2023-09-22 11:37:48,914 | INFO | executing command: export ATHENA_CORE_NUMBER=12;export ATHENA_PROC_NUMBER=12;export PANDA_RESOURCE='BOINC_MCORE';export FRONTIER_ID= //did cut quite some stuff out// --inputEVNTFile=EVNT.123456789._000123.pool.root.1 --maxEvents=2000 and when you have one running you can look at /var/lib/boinc/slots/0/PanDA_Pilot-123456789/eventLoopHeartBeat.txt there you can see how many events of that workunit are already finished. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
If the percentage complete in Boinc is correct I'm doing 3 tasks at once on my Ryzen 9 3900XT, in 3 hours. They don't look excessively large to me. Atlas has always taken about that long. They're at 50%, so Boinc could of course be insanely wrong. Edit: one finished in 2h11m (9h37 CPU). It's been accepted as valid, although I notice the one I did 6 days ago on the same machine was 10 times longer, maybe this is a one off: https://lhcathome.cern.ch/lhcathome/results.php?userid=55945&offset=0&show_names=0&state=4&appid=14 |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
Is it my imagination or has the memory requirement dropped? All running Atlases are showing 4.4-4.8GB in Boinc. And one of them is a 4 core computer, the others are 8 cores per task of the computer's 24 cores. I thought Atlas used to be 2GB + 1GB per core, making 10GB for 8 cores and 6GB for 4 cores? |
Send message Joined: 4 Sep 22 Posts: 91 Credit: 16,008,656 RAC: 17,897 |
What gives? For one Atlas task, running on 4 threads, I get a credit of nearly 13,000. For another task, running on 8 threads, I get credit of only 3200. Both tasks required nearly the same amount of CPU time, over 530,000 secs. |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,923,693 RAC: 32,358 |
thanks for the information. Please tell me where I can find these filesAlso: where can one see how many events these tasks have? |
Send message Joined: 4 Mar 17 Posts: 25 Credit: 10,262,043 RAC: 1,885 |
The credit system that it AFAIK uses is a bit weird. https://boinc.berkeley.edu/trac/wiki/CreditNew If i remember right did the credit per Workunit move around quite a bit when a new version was released. Guess right now with more users that come back cause of new ATLAS work and the different long tasks(2000events and it sounds like the new are shorter) is the change bigger. But AFIAK did it smooth out after a few days of Atlas running with constant flow of work. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
What gives? For one Atlas task, running on 4 threads, I get a credit of nearly 13,000. For another task, running on 8 threads, I get credit of only 3200. Both tasks required nearly the same amount of CPU time, over 530,000 secs.Credit is never accurate. LHC is in fact banned from Gridcoin because you can cheat the credit here very easily. On Asteroids at the moment, due to a GPU version which is slower than the CPU version, if my GPU task is compared to a CPU wingman, I get normal credit. But if my wingman is also on GPU, the credit is calculated as time multiplied by GPU speed, and I get 100 times more. Who cares anyway, credit is just a toy, as long as you get the science done. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
thanks for the information. Please tell me where I can find these filesThe Windows equivalent is presumably C:\ProgramData\BOINC\slots\2\boinc_133f818bb0d17e70\Logs but I've no idea how you find the correct slot out of hundreds to look in. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
The credit system that it AFAIK uses is a bit weird. https://boinc.berkeley.edu/trac/wiki/CreditNewYou'd think they'd just give you 1 credit per 1000 calculations or something. It's not particle physics. |
Send message Joined: 4 Sep 22 Posts: 91 Credit: 16,008,656 RAC: 17,897 |
Who cares anyway, credit is just a toy, as long as you get the science done. Well, for one thing, if the average credit displayed on the boinc manager statistics screen is chugging away at a constant level, then suddenly drops unexpectedly, then it is time to go looking for signs of a problem -- such as tasks which suddenly start failing in droves. Furthermore, it helps if one's contribution to the effort is appreciated by the people who run the projects. The only way I can see for such appreciation to be seen on an ongoing basis is through the credit system. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,591,167 RAC: 1,044 |
Furthermore, it helps if one's contribution to the effort is appreciated by the people who run the projects. The only way I can see for such appreciation to be seen on an ongoing basis is through the credit system.Cash would be a show of appreciation. Then again there are those who ran Collatz for credit and didn't believe there was a point to the maths. "It takes all sorts to make a world" as my gran used to say. |
Send message Joined: 28 Sep 04 Posts: 732 Credit: 49,336,437 RAC: 25,953 |
Is it my imagination or has the memory requirement dropped? All running Atlases are showing 4.4-4.8GB in Boinc. And one of them is a 4 core computer, the others are 8 cores per task of the computer's 24 cores. I thought Atlas used to be 2GB + 1GB per core, making 10GB for 8 cores and 6GB for 4 cores? The memory requirements dropped when the current version 3.01 was released in May. I remember it was talked then that 4 GB is enough for any number of cores (1...8). But I see the same reporting in Boinc as you see. |
©2024 CERN