21) Message boards : CMS Application : no new WUs available (Message 49526)
Posted 12 Feb 2024 by Erich56
Post:
CMS jobs inside BOINC-VM available again . . .
credentials working now?
22) Message boards : CMS Application : Could not get X509 credentials (Message 49516)
Posted 11 Feb 2024 by Erich56
Post:
Can this problem with getting a proxy credential from LHC be avoided by installing a local proxy server?
obviously not; I am using a local proxy, but I experience the credential problem, too.

As the log shows, the credential is requested directly from LHC&home(-dev)

(1648): Guest Log: [INFO] Requesting an X509 credential from LHC@home
(1648): Guest Log: [INFO] Requesting an X509 credential from vLHC@home-dev
23) Message boards : CMS Application : Could not get X509 credentials (Message 49513)
Posted 11 Feb 2024 by Erich56
Post:
I set to NNT, no point to hammer the server for no work. Maybe it will be back on Monday
you say it!
Still, the automatic stop of tasks submission does not work, and from what the server status page tells us: 13.565 tasks are "in process", none of them being of any value for the science, due to lack of credentials and jobs :-(
What a waste :-(
24) Message boards : CMS Application : Could not get X509 credentials (Message 49510)
Posted 10 Feb 2024 by Erich56
Post:
Ivan, any idea what's going on with CMS?

No credentials, no jobs :-(
25) Message boards : CMS Application : Could not get X509 credentials (Message 49508)
Posted 10 Feb 2024 by Erich56
Post:
now also I had a problem with getting credentials:

...
2024-02-10 19:11:15 (30552): Guest Log: ERROR: Couldn't read proxy from: /tmp/x509up_u0
2024-02-10 19:11:15 (30552): Guest Log: globus_credential: Error reading proxy credential
2024-02-10 19:11:15 (30552): Guest Log: globus_credential: Error reading proxy credential: Couldn't read PEM from bio
2024-02-10 19:11:15 (30552): Guest Log: OpenSSL Error: pem_lib.c:707: in library: PEM routines, function PEM_read_bio: no start line
2024-02-10 19:11:15 (30552): Guest Log: Use -debug for further information.
2024-02-10 19:11:15 (30552): Guest Log: [ERROR] Could not get an x509 credential
2024-02-10 19:11:15 (30552): Guest Log: [ERROR] The x509 proxy creation failed.
2024-02-10 19:11:15 (30552): Guest Log: [DEBUG] Volunteer: Erich56 (389876)
2024-02-10 19:11:15 (30552): Guest Log: [INFO] Shutting Down.
2024-02-10 19:11:45 (30552): VM Completion File Detected.
2024-02-10 19:11:45 (30552): VM Completion Message: The x509 proxy creation failed.
...
https://lhcathome.cern.ch/lhcathome/result.php?resultid=405986026
26) Message boards : CMS Application : Could not get X509 credentials (Message 49506)
Posted 10 Feb 2024 by Erich56
Post:
I imagine there is some CERN maintaince over the weekend as all the other tasks dried up as well.
besides, I have noticed that the CERN websites sometimes have longer response times than usual.
27) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49494)
Posted 10 Feb 2024 by Erich56
Post:
From the looks of the rest of it, you were having some network connection problems. The result output is the concatenation of several logs at the end of the task, so the output is not necessarily in chronological order.
thank you, Ivan, for the clarification.
I should have read the complete stderr text, and not just the first few lines - sorry for that :-(

no idea what happened to the network, I'll need to watch this more closely.
28) Message boards : Theory Application : Website showing list with "bad" sherpa tasks (Message 49492)
Posted 10 Feb 2024 by Erich56
Post:
on one of my hosts, a Sherpa [Boinc pp winclusive 7000 10 - sherpa 2.2.9 default 6000 223] has been running for more than 2 days.
Console f2 says "lean back and enjoy" ... but no events are being shown since then. CPU is active with 99+ % in console f3.

Since this task is shown in the list from the link above posting, I guess that I should kill the task, right?
29) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49490)
Posted 10 Feb 2024 by Erich56
Post:
Last night, on some of my hosts numerous tasks failed after a few minutes - see here:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=405976178

stderr starts with:

<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
Der Dateiname oder die Erweiterung ist zu lang.
(0xce) - exit code 206 (0xce)
</message>

how come?
30) Message boards : Theory Application : Website showing list with "bad" sherpa tasks (Message 49485)
Posted 9 Feb 2024 by Erich56
Post:
I remember that a few years ago, here in the forum someone posted an URL to a website showing the names of "bad" (faulty) sherpa tasks (so that one could check after a task download whether that task will sucdeed or probably fail).
Unfortunately, I don't remember this URL and I can't find it anywhere.
Coulde someone please post it here?
31) Message boards : CMS Application : no new WUs available (Message 49475)
Posted 9 Feb 2024 by Erich56
Post:
...so I will just let them run and see what happens.
the tasks will finish eventually. See my posting from this morning:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6104&postid=49474#49474
32) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49474)
Posted 9 Feb 2024 by Erich56
Post:
my observation this morning is as follows:

the tasks finally finish after about a 4- or 5-fold time compared to before, with credit points about double compared to before.
33) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49465)
Posted 8 Feb 2024 by Erich56
Post:
Again I am noticing the same phenomenon as referred to in the title of this thread:
the new tasks have now been running longer than the ones from before (several days ago), still not finished, and no longer using CPU for quite some time.
Is this coincidence or is the same problem from day before yesterday back?
still none of the tasks which started on all of my machines this early afternoon has finished yet. All running without CPU usage meanwhile.
Hence, the same problem as 2 days ago still exists. Something is definitely wrong with these tasks :-(
34) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49464)
Posted 8 Feb 2024 by Erich56
Post:
Again I am noticing the same phenomenon as referred to in the title of this thread:
the new tasks have now been running longer than the ones from before (several days ago), still not finished, and no longer using CPU for quite some time.
Is this coincidence or is the same problem from day before yesterday back?
35) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49459)
Posted 8 Feb 2024 by Erich56
Post:
OK, we're running again, after a few glitches (and one error in a script, memory given in GiB where MiB was expected!). I think I'll resist any attempt at further tests until after the weekend...
thanks, Ivan, for the information. The tasks now work fine.
Could you please make sure that the automatic "tasks-sending-stop-tool" (sorry for this strange name I invented for it) is also working again; just in case the next problem with jobs-submission will come up some time.
36) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49448)
Posted 8 Feb 2024 by Erich56
Post:
This is the header:
tasks now running unusual long time without CPU usage
the currently downloaded tasks do NOT run unusually long. They run only for about 25 minutes, using almost no CPU, because they do not receive jobs.
I put my comment from earlier this morning in here just because the current problems with CMS have been discussed here within the past few days. So I did not want to open a new thread.
37) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49446)
Posted 8 Feb 2024 by Erich56
Post:
as a test 10 minutes ago shows, there are still no jobs available, but tasks are still being distributed - the automatic stop of tasks distribution again does not work.

The result looks like this:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=405292196

On the project status page one can see 345 users within the past 24 hours (and almost 13.000 tasks being processed), so that many users have been crunching CMS tasks which run only in the "envelope" and finish after some 25 minutes, with NO VALUE at all for the science :-(

Holpefully this nonsense will be stopped ASAP.
38) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49438)
Posted 7 Feb 2024 by Erich56
Post:
Task is running 25 minutes and cmsRun inside VM 13 minutes now (95% Cpu)
You were more lucky than I was: The task I started did not error out like the others before, but obviousla there were not jobs available. The task finished after 29 minutes with just about 1 minute CPU:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=405220625
in view of the fact that obviously no jobs are available, would't it make sense to stop distribution of tasks ?
I tested again this morning, and still the tasks finished after about 25 minutes because no jobs could be downloaded. What I am wondering is that this time the automatic tasks submission stop does not work.
39) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49433)
Posted 6 Feb 2024 by Erich56
Post:
Task is running 25 minutes and cmsRun inside VM 13 minutes now (95% Cpu)
You were more lucky than I was: The task I started did not error out like the others before, but obviousla there were not jobs available. The task finished after 29 minutes with just about 1 minute CPU:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=405220625
in view of the fact that obviously no jobs are available, would't it make sense to stop distribution of tasks ?
40) Message boards : CMS Application : tasks now running unusual long time without CPU usage (Message 49432)
Posted 6 Feb 2024 by Erich56
Post:
Task is running 25 minutes and cmsRun inside VM 13 minutes now (95% Cpu)
You were more lucky than I was: The task I started did not error out like the others before, but obviousla there were not jobs available. The task finished after 29 minutes with just about 1 minute CPU:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=405220625


Previous 20 · Next 20


©2024 CERN