Message boards :
ATLAS application :
Unable to upload an Atlas task
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 852 |
The upload problems for ATLAS tasks are back. The number of running jobs is over 10,000 again. Almost 11,000, maybe therefore the upload problems again? |
Send message Joined: 15 Jun 08 Posts: 2541 Credit: 254,608,838 RAC: 23,290 |
A couple of my ATLAS WUs are now trying to upload for nearly 30h. Is anybody working on a (short term) solution? |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,270,405 RAC: 5,338 |
|
Send message Joined: 1 May 07 Posts: 27 Credit: 2,339,393 RAC: 63 |
I am getting a 100% transfer of file and then fails. It then tries to upload the entire file again and fails at 100%.... frustrating. |
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
Is it the same "HTTP transient error" that has been seen before? I see there is increased load on our file servers but I'm not sure what is causing it. However some results are getting through ok. |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 307 |
File Upload server error..... file_upload_handler PID=-1 Some WU's are uploaded - one 30 min. ago, but a lot are waiting...... |
Send message Joined: 22 Mar 17 Posts: 64 Credit: 14,576,403 RAC: 858 |
1 for me as well: 3570k 6313 LHC@home 3/6/2018 5:53:12 PM Temporarily failed upload of j70KDm0NUCsnDDn7oo6G73TpABFKDmABFKDmSwKKDmABFKDmohi1Vn_2_r1676589227_ATLAS_result: transient upload error |
Send message Joined: 17 Sep 04 Posts: 105 Credit: 32,824,862 RAC: 59 |
I have 10 waiting, some from as long as 2 days ago. Here is the BOINC log: 3/6/2018 6:41:17 PM | LHC@home | Started upload of qzJODmxEjCsnSu7Ccp2YYBZmABFKDmABFKDmyQMKDmABFKDmUGlfOn_2_r1113909317_ATLAS_result 3/6/2018 6:41:17 PM | LHC@home | Started upload of vUaKDmlmrCsnSu7Ccp2YYBZmABFKDmABFKDmfhGKDmABFKDmEH2fzm_0_r963091061_ATLAS_result 3/6/2018 6:41:28 PM | LHC@home | [error] Error reported by file upload server: [vUaKDmlmrCsnSu7Ccp2YYBZmABFKDmABFKDmfhGKDmABFKDmEH2fzm_0_r963091061_ATLAS_result] locked by file_upload_handler PID=-1 3/6/2018 6:41:28 PM | LHC@home | Temporarily failed upload of vUaKDmlmrCsnSu7Ccp2YYBZmABFKDmABFKDmfhGKDmABFKDmEH2fzm_0_r963091061_ATLAS_result: transient upload error 3/6/2018 6:41:28 PM | LHC@home | Backing off 03:18:54 on upload of vUaKDmlmrCsnSu7Ccp2YYBZmABFKDmABFKDmfhGKDmABFKDmEH2fzm_0_r963091061_ATLAS_result 3/6/2018 6:41:29 PM | LHC@home | [error] Error reported by file upload server: [qzJODmxEjCsnSu7Ccp2YYBZmABFKDmABFKDmyQMKDmABFKDmUGlfOn_2_r1113909317_ATLAS_result] locked by file_upload_handler PID=-1 3/6/2018 6:41:29 PM | LHC@home | Temporarily failed upload of qzJODmxEjCsnSu7Ccp2YYBZmABFKDmABFKDmyQMKDmABFKDmUGlfOn_2_r1113909317_ATLAS_result: transient upload error 3/6/2018 6:41:29 PM | LHC@home | Backing off 05:35:19 on upload of qzJODmxEjCsnSu7Ccp2YYBZmABFKDmABFKDmyQMKDmABFKDmUGlfOn_2_r1113909317_ATLAS_result Regards, Bob P. |
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
Are things better now? The file server load has gone back to normal overnight and the half-uploaded files which cause the "locked by file_upload_handler" messages should have been cleaned. |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 307 |
Since Yesterday morning only ONE Atlas-Task was downloading. Upload is always waiting. After a manual activating, it fall back to waiting. Edit: -dev Atlas run normally with the new storage backend! |
Send message Joined: 15 Jun 08 Posts: 2541 Credit: 254,608,838 RAC: 23,290 |
David Cameron wrote: Are things better now? Partly. David Cameron wrote: ... and the half-uploaded files which cause the "locked by file_upload_handler" messages should have been cleaned. Unfortunately not all of them: Mi 07 Mär 2018 09:25:46 CET | LHC@home | Started upload of fkHKDmW16CsnDDn7oo6G73TpABFKDmABFKDmXfLKDmABFKDmcutGYo_0_r629916232_ATLAS_result Mi 07 Mär 2018 09:25:48 CET | LHC@home | [error] Error reported by file upload server: [fkHKDmW16CsnDDn7oo6G73TpABFKDmABFKDmXfLKDmABFKDmcutGYo_0_r629916232_ATLAS_result] locked by file_upload_handler PID=-1 Mi 07 Mär 2018 09:25:48 CET | LHC@home | Temporarily failed upload of fkHKDmW16CsnDDn7oo6G73TpABFKDmABFKDmXfLKDmABFKDmcutGYo_0_r629916232_ATLAS_result: transient upload error Mi 07 Mär 2018 09:25:48 CET | LHC@home | Backing off 03:22:08 on upload of fkHKDmW16CsnDDn7oo6G73TpABFKDmABFKDmXfLKDmABFKDmcutGYo_0_r629916232_ATLAS_result Also affected by the same type of error: GeoNDmDJOCsnSu7Ccp2YYBZmABFKDmABFKDmvALKDmABFKDmuGxLLo_0_r931142306_ATLAS_result h4eLDmFDLDsnSu7Ccp2YYBZmABFKDmABFKDm0aIKDmABFKDm5bw40n_0_r12622427_ATLAS_result sGvNDmXMqCsnDDn7oo6G73TpABFKDmABFKDm3eFKDmABFKDmaYr87m_0_r1115052517_ATLAS_result 79MLDm9DJDsnDDn7oo6G73TpABFKDmABFKDm71HKDmABFKDmmbpShm_2_r233222736_ATLAS_result It may be necessary to run the cleanup script more frequently. When do you expect the new storage backend will be activated? Thanks for taking care. |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 852 |
Are things better now? I had and have only one in Upload pending status. Did not request more ATLAS tasks until it's fixed, but my retry upload does not function: LHC@home 07 Mar 12:47:56 [error] Error reported by file upload server: [lFvLDmDP1CsnDDn7oo6G73TpABFKDmABFKDmsUJKDmABFKDmwnH64m_0_r894275464_ATLAS_result] locked by file_upload_handler PID=-1 LHC@home 07 Mar 12:47:56 Temporarily failed upload of lFvLDmDP1CsnDDn7oo6G73TpABFKDmABFKDmsUJKDmABFKDmwnH64m_0_r894275464_ATLAS_result: transient upload error |
Send message Joined: 22 May 17 Posts: 15 Credit: 1,226,011 RAC: 4 |
Yes, better now. I had two tasks stuck for over 24 hours, and this morning when I booted the machine, both of them immediately completed their upload and transferred. Thank you! |
Send message Joined: 1 Sep 04 Posts: 57 Credit: 2,835,005 RAC: 0 |
2018-03-07 5:40:17 PM | LHC@home | Temporarily failed upload of su3MDmEYCCsnSu7Ccp2YYBZmABFKDmABFKDmnbGKDmABFKDm1owStm_0_r1889291022_ATLAS_result: transient upload error |
Send message Joined: 15 Jun 08 Posts: 2541 Credit: 254,608,838 RAC: 23,290 |
Finally all pending uploads could be finished and reported. |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 307 |
+1 :-) |
Send message Joined: 22 Mar 17 Posts: 64 Credit: 14,576,403 RAC: 858 |
Another locked: 316316 LHC@home 3/12/2018 6:41:14 AM [error] Error reported by file upload server: [E8dMDmPJSFsnDDn7oo6G73TpABFKDmABFKDmquKKDmABFKDmkyA17m_0_r860623117_ATLAS_result] locked by file_upload_handler PID=-1 |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 307 |
same here. |
Send message Joined: 15 Jun 08 Posts: 2541 Credit: 254,608,838 RAC: 23,290 |
Same here. |
Send message Joined: 15 Jun 08 Posts: 2541 Credit: 254,608,838 RAC: 23,290 |
My uploads are back to normal. May have been just a glitch. |
©2024 CERN