Message boards : ATLAS application : Download failures
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
CloverField

Send message
Joined: 17 Oct 06
Posts: 88
Credit: 57,162,236
RAC: 7,635
Message 50259 - Posted: 27 May 2024, 14:47:54 UTC - in response to Message 50257.  

You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?
ID: 50259 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2244
Credit: 173,902,375
RAC: 677
Message 50260 - Posted: 27 May 2024, 14:57:25 UTC - in response to Message 50259.  

Win11pro, all Atlas-Tasks running well, with Squid.
https://lhcathome.cern.ch/lhcathome/results.php?hostid=10795955
ID: 50260 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2541
Credit: 254,608,838
RAC: 56,545
Message 50261 - Posted: 27 May 2024, 17:34:22 UTC - in response to Message 50259.  

You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?

ATLAS EVNT files are very large and Squid does not cache them.
The decision which file needs to be downloaded is made by the BOINC client based on the WU data it gets from the server.

Hence, I don't think your Squid is the problem.
Nonetheless, feel free to purge it's cache - it will rebuild automatically.

A project reset looks more promising (no 100% guarantee).
Finish all tasks in your work buffer before.
ID: 50261 · Report as offensive     Reply Quote
CloverField

Send message
Joined: 17 Oct 06
Posts: 88
Credit: 57,162,236
RAC: 7,635
Message 50262 - Posted: 27 May 2024, 19:06:12 UTC - in response to Message 50261.  

You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?

ATLAS EVNT files are very large and Squid does not cache them.
The decision which file needs to be downloaded is made by the BOINC client based on the WU data it gets from the server.

Hence, I don't think your Squid is the problem.
Nonetheless, feel free to purge it's cache - it will rebuild automatically.

A project reset looks more promising (no 100% guarantee).
Finish all tasks in your work buffer before.


Yeah the squid purge didn't work I'll set no new tasks and go for the project reset.
ID: 50262 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1422
Credit: 9,484,585
RAC: 1,882
Message 50263 - Posted: 27 May 2024, 19:42:11 UTC - in response to Message 50261.  

A project reset looks more promising (no 100% guarantee).
Finish all tasks in your work buffer before.
I tried two days ago a project reset first, but the returned task state kept 'in progress', so would have resent the 'lost' task after requesting work again.
But after a project remove and adding LHC again, let get the task the abandoned state.
ID: 50263 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2244
Credit: 173,902,375
RAC: 677
Message 50264 - Posted: 27 May 2024, 21:13:43 UTC - in response to Message 50263.  

Have Squid in a CentOS9-VM defined.
So, it's no Problem to stop and start Squid.
In this moment of start or stop, Boincmanager in Win11pro is waiting.
ID: 50264 · Report as offensive     Reply Quote
greg_be

Send message
Joined: 28 Dec 08
Posts: 339
Credit: 4,865,275
RAC: 191
Message 50265 - Posted: 27 May 2024, 21:43:30 UTC

I've lost 2 tasks (show as in progress in the LHC server but not physically on my system) due to this series of errors.

What's going on?

5/27/2024 11:35:23 PM | LHC@home | Started download of boinc_job_script.YrIbh8
5/27/2024 11:35:23 PM | LHC@home | [error] File lyiMDm8yGT5n7Olcko1bjSoqABFKDmABFKDmOtvXDmdQbKDmgBmKKm_input.tar.gz has wrong size: expected 480010, got 0
5/27/2024 11:35:23 PM | LHC@home | [error] Checksum or signature error for lyiMDm8yGT5n7Olcko1bjSoqABFKDmABFKDmOtvXDmdQbKDmgBmKKm_input.tar.gz
5/27/2024 11:35:24 PM | LHC@home | Incomplete read of 520.000000 < 5KB for boinc_job_script.YrIbh8 - truncating
5/27/2024 11:35:24 PM | LHC@home | Finished download of boinc_job_script.YrIbh8 (17537 bytes)
5/27/2024 11:35:24 PM | LHC@home | [error] File boinc_job_script.YrIbh8 has wrong size: expected 17537, got 0
5/27/2024 11:35:24 PM | LHC@home | [error] Checksum or signature error for boinc_job_script.YrIbh8
ID: 50265 · Report as offensive     Reply Quote
NOGOOD

Send message
Joined: 18 Nov 17
Posts: 131
Credit: 55,791,999
RAC: 6,041
Message 50266 - Posted: 28 May 2024, 11:02:47 UTC

I've got this problem:
28.05.2024 13:50:00 | LHC@home | Giving up on download of EifKDmyCrS5nsSi4apGgGQJmABFKDmABFKDm4ySLDmOyTKDmB07kJn_EVNT.38776190._000013.pool.root.1: permanent HTTP error
ID: 50266 · Report as offensive     Reply Quote
greg_be

Send message
Joined: 28 Dec 08
Posts: 339
Credit: 4,865,275
RAC: 191
Message 50270 - Posted: 28 May 2024, 18:25:55 UTC - in response to Message 50265.  

Did a project reset.
See what happens.
Otherwise it just keeps trying to download the same file that refuses to download because it's 'lost'.
ID: 50270 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,921,996
RAC: 33,985
Message 50272 - Posted: 28 May 2024, 19:03:30 UTC - in response to Message 50270.  

Did a project reset.
See what happens.
Otherwise it just keeps trying to download the same file that refuses to download because it's 'lost'.
greg_be: did the project reset solve your problem ?
ID: 50272 · Report as offensive     Reply Quote
greg_be

Send message
Joined: 28 Dec 08
Posts: 339
Credit: 4,865,275
RAC: 191
Message 50273 - Posted: 28 May 2024, 21:26:13 UTC - in response to Message 50272.  

Did a project reset.
See what happens.
Otherwise it just keeps trying to download the same file that refuses to download because it's 'lost'.
greg_be: did the project reset solve your problem ?



No. It sent the 'lost task' task again and that failed again.
It just repeats it self.
I don't know how to shake a 'lost task'.

Any ideas? Clearing data out of BOINC folder for LHC?
At this rate I will not get any new work until I can get rid of the lost task.
ID: 50273 · Report as offensive     Reply Quote
CloverField

Send message
Joined: 17 Oct 06
Posts: 88
Credit: 57,162,236
RAC: 7,635
Message 50274 - Posted: 28 May 2024, 22:26:02 UTC - in response to Message 50262.  

You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?

ATLAS EVNT files are very large and Squid does not cache them.
The decision which file needs to be downloaded is made by the BOINC client based on the WU data it gets from the server.

Hence, I don't think your Squid is the problem.
Nonetheless, feel free to purge it's cache - it will rebuild automatically.

A project reset looks more promising (no 100% guarantee).
Finish all tasks in your work buffer before.


Yeah the squid purge didn't work I'll set no new tasks and go for the project reset.

And no dice for me I'll just run theory until all my atlas tasks expire
ID: 50274 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1422
Credit: 9,484,585
RAC: 1,882
Message 50277 - Posted: 29 May 2024, 6:57:26 UTC - in response to Message 50273.  

I don't know how to shake a 'lost task'.

Any ideas? Clearing data out of BOINC folder for LHC?
At this rate I will not get any new work until I can get rid of the lost task.
Start reading before writing :)
ID: 50277 · Report as offensive     Reply Quote
NOGOOD

Send message
Joined: 18 Nov 17
Posts: 131
Credit: 55,791,999
RAC: 6,041
Message 50279 - Posted: 29 May 2024, 8:18:25 UTC - in response to Message 50277.  

ID: 50279 · Report as offensive     Reply Quote
greg_be

Send message
Joined: 28 Dec 08
Posts: 339
Credit: 4,865,275
RAC: 191
Message 50289 - Posted: 29 May 2024, 18:48:29 UTC - in response to Message 50279.  

Start reading before writing :)


It works! :-)



Your smart writing is actually a repeat of what someone else wrote.
So do you have a brain?

I had no time this morning to scroll through everything and here I am 12 hrs later and about the only thing I see that MIGHT be of use is what Crystal wrote awhile back. Otherwise I don't see anything that applies to my simple set up in windows.

Squid? I had no idea what they are talking about.
ID: 50289 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2244
Credit: 173,902,375
RAC: 677
Message 50290 - Posted: 29 May 2024, 18:54:54 UTC - in response to Message 50289.  

Squid is a proxy and cach the Data at your PC.
You can install a proxy or not.
Atlas works now as before, after the problems last night at midnight.
ID: 50290 · Report as offensive     Reply Quote
greg_be

Send message
Joined: 28 Dec 08
Posts: 339
Credit: 4,865,275
RAC: 191
Message 50291 - Posted: 29 May 2024, 19:00:28 UTC - in response to Message 50290.  

Squid is a proxy and cach the Data at your PC.
You can install a proxy or not.
Atlas works now as before, after the problems last night at midnight.


Thank you about the information on Squid.

I am downloading the massive file for the task.
7 minutes to download with DSL! That's a huge file!
ID: 50291 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 850
Credit: 692,696,974
RAC: 98,025
Message 50294 - Posted: 30 May 2024, 16:40:33 UTC - in response to Message 50291.  

For me on one computer resetting the project does not fix the problems, I will monitor further.
ID: 50294 · Report as offensive     Reply Quote
greg_be

Send message
Joined: 28 Dec 08
Posts: 339
Credit: 4,865,275
RAC: 191
Message 50297 - Posted: 30 May 2024, 18:29:44 UTC - in response to Message 50294.  

For me on one computer resetting the project does not fix the problems, I will monitor further.


I did like Crystal did and removed the project and then attached to it again and the problem was solved.
ID: 50297 · Report as offensive     Reply Quote
Profile rbpeake

Send message
Joined: 17 Sep 04
Posts: 105
Credit: 32,824,862
RAC: 131
Message 50298 - Posted: 30 May 2024, 18:49:33 UTC - in response to Message 50297.  

For me on one computer resetting the project does not fix the problems, I will monitor further.


I did like Crystal did and removed the project and then attached to it again and the problem was solved.


Agreed, resetting the project does not work. You have to remove the project, and then reattach.
Regards,
Bob P.
ID: 50298 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : ATLAS application : Download failures


©2024 CERN