Message boards : ATLAS application : Validation Error
Message board moderation

To post messages, you must log in.

AuthorMessage
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 1966
Credit: 139,950,386
RAC: 88,895
Message 37561 - Posted: 7 Dec 2018, 11:18:48 UTC

Got a strange validation error for this task:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=211509698

Logfile snippet:
****************The last 100 lines of the pilot log******************
2018-12-07 10:40:23|66033|MessageInter| pika module does not exist - ARGO interface will fail
2018-12-07 10:40:23|66033|pUtil.py    | PanDA Pilot, version PICARD 73.7
2018-12-07 10:40:23|66033|pUtil.py    | Version tag = PR
2018-12-07 10:40:23|66033|pUtil.py    | PilotId = xtestP001, jobSchedulerId = xtestJS001
2018-12-07 10:40:23|66033|pUtil.py    | Current time: 2018-12-07T11:40:23+-100
2018-12-07 10:40:23|66033|pUtil.py    | Run by Python 2.6.6 (r266:84292, Aug 18 2016, 15:13:37) 
[GCC 4.4.7 20120313 (Red Hat 4.4.7-17)]
2018-12-07 10:40:23|66033|pUtil.py    | 64 bit OS
2018-12-07 10:40:23|66033|pUtil.py    | Pilot init dir: /home/boinc1/BOINC_ATLAS/slots/3
2018-12-07 10:40:23|66033|pUtil.py    | All output written to file: pilotlog.txt
2018-12-07 10:40:23|66033|pUtil.py    | Pilot executed by: boinc1
2018-12-07 10:40:23|66033|pilot.py    | argParser arguments: ['-h', 'BOINC_MCORE', '-s', 'BOINC_MCORE', '-f', 'false', '-u', 'managed', '-F', 'Nordugrid-ATLAS', '-d', '{HOME}', '-j', 'false', '-z', 'true', '-b', '2', '-t', 'false']
2018-12-07 10:40:23|66033|pUtil.py    | Processing queuedata
2018-12-07 10:40:23|66033|pUtil.py    | getSiteInformation: got experiment=Nordugrid-ATLAS
2018-12-07 10:40:23|66033|SiteInformat| Executing command: curl --connect-timeout 20 --max-time 120 --cacert /tmp/x509up_u2001 -sS "http://pandaserver.cern.ch:25085/cache/schedconfig/BOINC_MCORE.all.json" > /home/boinc1/BOINC_ATLAS/slots/3/queuedata.json
2018-12-07 10:40:23|66033|SiteInformat| !!WARNING!!1999!! curl command exited with code 1792
2018-12-07 10:40:23|66033|SiteInformat| Executing command: curl --connect-timeout 20 --max-time 120 --cacert /tmp/x509up_u2001 -sS "http://pandaserver.cern.ch:25085/cache/schedconfig/BOINC_MCORE.all.json" > /home/boinc1/BOINC_ATLAS/slots/3/queuedata.json
2018-12-07 10:40:23|66033|SiteInformat| !!WARNING!!1999!! curl command exited with code 1792
2018-12-07 10:40:23|66033|SiteInformat| Executing command: curl --connect-timeout 20 --max-time 120 --cacert /tmp/x509up_u2001 -sS "http://pandaserver.cern.ch:25085/cache/schedconfig/BOINC_MCORE.all.json" > /home/boinc1/BOINC_ATLAS/slots/3/queuedata.json
2018-12-07 10:40:23|66033|SiteInformat| !!WARNING!!1999!! curl command exited with code 1792
2018-12-07 10:40:23|66033|ATLASSiteInf| !!FAILED!!1999!! Found no valid queuedata - aborting pilot
***************diag file************
ID: 37561 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 1513
Credit: 49,305,827
RAC: 150,592
Message 37562 - Posted: 7 Dec 2018, 13:54:42 UTC

This is yours:
Checking for CVMFS
CVMFS is installed
OS:cat: /etc/redhat-release: Datei oder Verzeichnis nicht gefunden
This is not SLC6, need to run with Singularity....
Checking Singularity...
Singularity is installed

This is when SL69:
Checking for CVMFS
CVMFS is installed
OS:Scientific Linux release 6.10 (Carbon)
This is SLC or CentOS release 6, run the atlas job without Singularity
ID: 37562 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 1966
Credit: 139,950,386
RAC: 88,895
Message 37563 - Posted: 7 Dec 2018, 14:30:17 UTC - in response to Message 37562.  

I don't run SLC6, so this lines are important:
This is not SLC6, need to run with Singularity....
Checking Singularity...
Singularity is installed


See the logfiles from my successful tasks.

Beside that CVMFS- or Singularity-errors would have stopped the task after a few seconds.
This tasks had a runtime of a bit more than 10 minutes.
ID: 37563 · Report as offensive     Reply Quote
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 362
Credit: 13,000,837
RAC: 5,413
Message 37583 - Posted: 10 Dec 2018, 9:53:47 UTC - in response to Message 37563.  

We were running a few ATLAS tasks with a new wrapper script last week and it seems there is a bug in it. One of the expected files used by the tasks was not present when the task started so it failed over to downloading it from pandaserver.cern.ch:25085 and I suppose you have a firewall blocking that server or port number. We will fix this bug for the next tasks.
ID: 37583 · Report as offensive     Reply Quote

Message boards : ATLAS application : Validation Error


©2022 CERN