log in

ATLAS affected by DSL Outage


Advanced search

Message boards : ATLAS application : ATLAS affected by DSL Outage

Author Message
computezrmle
Send message
Joined: 15 Jun 08
Posts: 347
Credit: 3,501,271
RAC: 1,830
Message 31192 - Posted: 30 Jun 2017, 10:32:50 UTC

The following ATLAS job was affected by a 1 h DSL outage caused by my ISP and finished later without manual intervention.
You (mainly the project team) may examine the logs to see if the VM behaves as expected.

https://lhcathome.cern.ch/lhcathome/result.php?resultid=150145411

David Cameron
Project administrator
Project developer
Project scientist
Send message
Joined: 13 May 14
Posts: 139
Credit: 3,159,531
RAC: 6,484
Message 31193 - Posted: 30 Jun 2017, 11:23:21 UTC - in response to Message 31192.

From the output:

Input/output error: '/cvmfs/atlas.cern.ch/repo/sw/software/21.0/AtlasOffline/21.0.15/InstallArea/x86_64-slc6-gcc49-opt/jobOptions/SimuJobTransforms/skeleton.HITSMerge.py'

Data in /cvmfs is read over the network so your outage caused the task to fail. You still got the credits because you used significant CPU but our systems down the chain will see the error and retry the task automatically.

computezrmle
Send message
Joined: 15 Jun 08
Posts: 347
Credit: 3,501,271
RAC: 1,830
Message 31197 - Posted: 30 Jun 2017, 12:00:22 UTC - in response to Message 31193.

Thank you David.

It may at least help to harden the WUs against outages or - if this situations are rare - simply reschedule the job as you stated.

Message boards : ATLAS application : ATLAS affected by DSL Outage