Message boards : Number crunching : invalid results
Message board moderation

To post messages, you must log in.

AuthorMessage
kcharuso

Send message
Joined: 19 Feb 22
Posts: 1
Credit: 286,703
RAC: 0
Message 47659 - Posted: 12 Jan 2023, 7:29:09 UTC

hi all,

i seems to have lots of invalid results but i do not know why and i have limited knowledge regarding what the log file say but below is the report from one of the invalid work unit. please assist, thank you :)

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<stderr_txt>
2023-01-12 13:37:19 (17220): Detected: vboxwrapper 26206
2023-01-12 13:37:19 (17220): Detected: BOINC client v7.20.2
2023-01-12 13:37:20 (17220): Detected: VirtualBox VboxManage Interface (Version: 7.0.4)
2023-01-12 13:37:20 (17220): Successfully copied 'init_data.xml' to the shared directory.
2023-01-12 13:37:21 (17220): Create VM. (boinc_5deb979876a75273, slot#4)
2023-01-12 13:37:21 (17220): Setting Memory Size for VM. (10200MB)
2023-01-12 13:37:22 (17220): Setting CPU Count for VM. (8)
2023-01-12 13:37:22 (17220): Setting Chipset Options for VM.
2023-01-12 13:37:22 (17220): Setting Graphics Controller Options for VM.
2023-01-12 13:37:22 (17220): Setting Boot Options for VM.
2023-01-12 13:37:23 (17220): Setting Network Configuration for NAT.
2023-01-12 13:37:23 (17220): Enabling VM Network Access.
2023-01-12 13:37:23 (17220): Disabling USB Support for VM.
2023-01-12 13:37:23 (17220): Disabling COM Port Support for VM.
2023-01-12 13:37:24 (17220): Disabling LPT Port Support for VM.
2023-01-12 13:37:24 (17220): Disabling Audio Support for VM.
2023-01-12 13:37:24 (17220): Disabling Clipboard Support for VM.
2023-01-12 13:37:24 (17220): Disabling Drag and Drop Support for VM.
2023-01-12 13:37:25 (17220): Adding storage controller(s) to VM.
2023-01-12 13:37:25 (17220): Adding virtual disk drive to VM. (ATLAS_vbox_2.03_image.vdi)
2023-01-12 13:37:25 (17220): Adding VirtualBox Guest Additions to VM.
2023-01-12 13:37:26 (17220): Adding network bandwidth throttle group to VM. (Defaulting to 1024GB)
2023-01-12 13:37:26 (17220): forwarding host port 50809 to guest port 80
2023-01-12 13:37:26 (17220): Enabling remote desktop for VM.
2023-01-12 13:37:27 (17220): Enabling shared directory for VM.
2023-01-12 13:37:27 (17220): Starting VM using VBoxManage interface. (boinc_5deb979876a75273, slot#4)
2023-01-12 13:37:33 (17220): Successfully started VM. (PID = '9592')
2023-01-12 13:37:33 (17220): Reporting VM Process ID to BOINC.
2023-01-12 13:37:33 (17220): Guest Log: BIOS: VirtualBox 7.0.4
2023-01-12 13:37:33 (17220): Guest Log: CPUID EDX: 0x178bfbff
2023-01-12 13:37:33 (17220): Guest Log: BIOS: No PCI IDE controller, not probing IDE
2023-01-12 13:37:33 (17220): Guest Log: BIOS: AHCI 0-P#0: PCHS=16383/16/63 LCHS=1024/255/63 0x0000000002800000 sectors
2023-01-12 13:37:33 (17220): VM state change detected. (old = 'poweredoff', new = 'running')
2023-01-12 13:37:33 (17220): Detected: Web Application Enabled (http://localhost:50809)
2023-01-12 13:37:33 (17220): Detected: Remote Desktop Enabled (localhost:50810)
2023-01-12 13:37:33 (17220): Preference change detected
2023-01-12 13:37:33 (17220): Setting CPU throttle for VM. (100%)
2023-01-12 13:37:33 (17220): Setting checkpoint interval to 900 seconds. (Higher value of (Preference: 300 seconds) or (Vbox_job.xml: 900 seconds))
2023-01-12 13:37:35 (17220): Guest Log: BIOS: Boot : bseqnr=1, bootseq=0032
2023-01-12 13:37:35 (17220): Guest Log: BIOS: Booting from Hard Disk...
2023-01-12 13:37:38 (17220): Guest Log: BIOS: KBD: unsupported int 16h function 03
2023-01-12 13:37:38 (17220): Guest Log: BIOS: AX=0305 BX=0000 CX=0000 DX=0000
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=81
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=81
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=82
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=82
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=83
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=83
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=84
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=84
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=85
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=85
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=86
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=86
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=87
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=87
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=88
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=88
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=89
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=89
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=8a
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=8a
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=8b
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=8b
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=8c
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=8c
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=8d
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=8d
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=8e
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=8e
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk_ext: function 41, unmapped device for ELDL=8f
2023-01-12 13:37:38 (17220): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=8f
2023-01-12 13:37:42 (17220): Guest Log: vgdrvHeartbeatInit: Setting up heartbeat to trigger every 2000 milliseconds
2023-01-12 13:37:42 (17220): Guest Log: vboxguest: misc device minor 58, IRQ 20, I/O port d020, MMIO at 00000000f0400000 (size 0x400000)
2023-01-12 13:37:46 (17220): Guest Log: VBoxService 5.2.32 r132073 (verbosity: 0) linux.amd64 (Jul 12 2019 10:32:28) release log
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000125 main Log opened 2023-01-12T13:37:43.778629000Z
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000196 main OS Product: Linux
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000215 main OS Release: 3.10.0-957.27.2.el7.x86_64
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000231 main OS Version: #1 SMP Mon Jul 29 17:46:05 UTC 2019
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000247 main Executable: /opt/VBoxGuestAdditions-5.2.32/sbin/VBoxService
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000247 main Process ID: 1535
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000248 main Package type: LINUX_64BITS_GENERIC
2023-01-12 13:37:46 (17220): Guest Log: 00:00:00.000993 main 5.2.32 r132073 started. Verbose level = 0
2023-01-12 13:37:46 (17220): Guest Log: [INFO] Probing /cvmfs/atlas.cern.ch...
2023-01-12 13:37:46 (17220): Guest Log: [INFO] Mounting shared directory
2023-01-12 13:37:46 (17220): Guest Log: [INFO] Checking for init_data.xml
2023-01-12 13:37:48 (17220): Guest Log: [INFO] Probing /cvmfs/atlas.cern.ch... OK
2023-01-12 13:37:48 (17220): Guest Log: [INFO] Detected branch: prod
2023-01-12 13:37:49 (17220): Guest Log: This is the prod version of the ATLAS job wrapper
2023-01-12 13:37:49 (17220): Guest Log: Copying input files
2023-01-12 13:37:50 (17220): Guest Log: Copied input files into RunAtlas.
2023-01-12 13:37:50 (17220): Guest Log: This VM did not configure a local http proxy via BOINC.
2023-01-12 13:37:50 (17220): Guest Log: Small home clusters do not require a local http proxy but it is suggested if
2023-01-12 13:37:50 (17220): Guest Log: more than 10 cores throughout the same LAN segment are regularly running ATLAS like tasks.
2023-01-12 13:37:50 (17220): Guest Log: Further information can be found at the LHC@home message board.
2023-01-12 13:37:56 (17220): Guest Log: 00:00:10.023476 timesync vgsvcTimeSyncWorker: Radical guest time change: -25 188 215 977 000ns (GuestNow=1 673 505 475 564 484 000 ns GuestLast=1 673 530 663 780 461 000 ns fSetTimeLastLoop=true )
2023-01-12 13:37:58 (17220): Guest Log: Running cvmfs_config stat atlas.cern.ch
2023-01-12 13:37:58 (17220): Guest Log: VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
2023-01-12 13:37:58 (17220): Guest Log: 2.6.3.0 1811 0 32192 114452 4 1 1522681 4096000 0 65024 0 0 n/a 0 0 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch DIRECT 1
2023-01-12 13:37:59 (17220): Guest Log: copied the webapp to /var/www
2023-01-12 13:37:59 (17220): Guest Log: ATHENA_PROC_NUMBER=8
2023-01-12 13:37:59 (17220): Guest Log: *** Starting ATLAS job. (PandaID=5714143078 taskID=31766465) ***
2023-01-12 13:39:28 (17220): Preference change detected
2023-01-12 13:39:28 (17220): Setting CPU throttle for VM. (100%)
2023-01-12 13:39:28 (17220): Setting checkpoint interval to 900 seconds. (Higher value of (Preference: 300 seconds) or (Vbox_job.xml: 900 seconds))
2023-01-12 13:43:14 (17220): Guest Log: *** Job finished ***
2023-01-12 13:43:14 (17220): Guest Log: *** The last 20 lines of the pilot log: ***
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:05,434 | WARNING | data:queue_monitoring:received graceful stop - abort after this iteration
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:07,780 | WARNING | no jobs in monitored_payloads queue (waited for 61 s)
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:07,781 | INFO | [job] job monitor thread has finished
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:08,451 | INFO | [data] queue_monitor thread has finished
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,597 | INFO | job.realtimelogging is not enabled
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,597 | INFO | [payload] run_realtimelog thread has finished
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,800 | INFO | end of generic workflow (traces error code: 0)
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,800 | INFO | traces error code: 0
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,800 | INFO | pilot has finished
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,838 [wrapper] ==== pilot stdout END ====
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,843 [wrapper] ==== wrapper stdout RESUME ====
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,845 [wrapper] pilotpid: 7163
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,846 [wrapper] Pilot exit status: 0
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,851 [wrapper] pandaids: 5714143078
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,853 [wrapper] apfmon messages muted
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,854 [wrapper] Test setup, not cleaning
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,855 [wrapper] ==== wrapper stdout END ====
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,856 [wrapper] ==== wrapper stderr END ====
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,858 [wrapper] wrapperexiting ec=0, duration=315
2023-01-12 13:43:14 (17220): Guest Log: 2023-01-12 06:43:13,859 [wrapper] apfmon messages muted
2023-01-12 13:43:14 (17220): Guest Log: *** Error codes and diagnostics ***
2023-01-12 13:43:14 (17220): Guest Log: "exeErrorCode": 65,
2023-01-12 13:43:14 (17220): Guest Log: "exeErrorDiag": "Non-zero return code from EVNTtoHITS (33); Logfile error in log.EVNTtoHITS: \"DetectorStore FATAL in sysInitialize(): standard std::exception is caught\"",
2023-01-12 13:43:14 (17220): Guest Log: "pilotErrorCode": 1165,
2023-01-12 13:43:14 (17220): Guest Log: "pilotErrorDiag": "Local output file is missing",
2023-01-12 13:43:14 (17220): Guest Log: *** Listing of results directory ***
2023-01-12 13:43:14 (17220): Guest Log: total 3808
2023-01-12 13:43:14 (17220): Guest Log: -rw-r--r--. 1 atlas atlas 379728 Jan 11 14:32 pilot3.tar.gz
2023-01-12 13:43:14 (17220): Guest Log: -rw-r--r--. 1 atlas atlas 6329 Jan 11 15:30 queuedata.json
2023-01-12 13:43:14 (17220): Guest Log: -rwx------. 1 atlas atlas 26318 Jan 11 15:32 runpilot2-wrapper.sh
2023-01-12 13:43:14 (17220): Guest Log: -rwxr-xr-x. 1 atlas atlas 11913 Jan 12 06:37 init_data.xml
2023-01-12 13:43:14 (17220): Guest Log: -rwxr-xr-x. 1 atlas atlas 391450 Jan 12 06:37 input.tar.gz
2023-01-12 13:43:14 (17220): Guest Log: -rwxr-xr-x. 1 atlas atlas 17681 Jan 12 06:37 start_atlas.sh
2023-01-12 13:43:14 (17220): Guest Log: lrwxrwxrwx. 1 atlas atlas 20 Jan 12 06:37 EVNT.31550708._000932.pool.root.1 -> /data/./ATLAS.root_0
2023-01-12 13:43:14 (17220): Guest Log: -rw-r--r--. 1 atlas atlas 2854 Jan 12 06:37 pandaJob.out
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 416 Jan 12 06:37 setup.sh.local
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 1036205 Jan 12 06:38 agis_schedconf.cvmfs.json
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 1704033 Jan 12 06:38 agis_ddmendpoints.agis.ALL.json
2023-01-12 13:43:14 (17220): Guest Log: drwx------. 4 atlas atlas 245 Jan 12 06:38 pilot3
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 1003 Jan 12 06:40 memory_monitor_summary.json
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 51007 Jan 12 06:41 log.31766465._047094.job.log.tgz.1
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 9229 Jan 12 06:42 heartbeat.json
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 2876 Jan 12 06:43 pilotlog.txt
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 66274 Jan 12 06:43 log.31766465._047094.job.log.1
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 186 Jan 12 06:43 output.list
2023-01-12 13:43:14 (17220): Guest Log: -rw-r--r--. 1 atlas atlas 620 Jan 12 06:43 runtime_log
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 133120 Jan 12 06:43 result.tar.gz
2023-01-12 13:43:14 (17220): Guest Log: -rw-r--r--. 1 atlas atlas 9487 Jan 12 06:43 runtime_log.err
2023-01-12 13:43:14 (17220): Guest Log: -rw-------. 1 atlas atlas 625 Jan 12 06:43 CRANDmtGsZ2nsSi4apGgGQJmABFKDmABFKDmZ3vUDmvipKDm1WfhXn.diag
2023-01-12 13:43:14 (17220): Guest Log: Looking for outputfile HITS.31766465._047094.pool.root.1
2023-01-12 13:43:14 (17220): Guest Log: No HITS file was produced
2023-01-12 13:43:14 (17220): Guest Log: Successfully finished the ATLAS job!
2023-01-12 13:43:14 (17220): Guest Log: Copying the results back to the shared directory!
2023-01-12 13:43:14 (17220): Guest Log: *** Contents of shared directory: ***
2023-01-12 13:43:14 (17220): Guest Log: total 452452
2023-01-12 13:43:14 (17220): Guest Log: -rwxrwxrwx. 1 root root 462748742 Jan 12 06:37 ATLAS.root_0
2023-01-12 13:43:14 (17220): Guest Log: -rwxrwxrwx. 1 root root 11913 Jan 12 06:37 init_data.xml
2023-01-12 13:43:14 (17220): Guest Log: -rwxrwxrwx. 1 root root 391450 Jan 12 06:31 input.tar.gz
2023-01-12 13:43:14 (17220): Guest Log: -rwxrwxrwx. 1 root root 133120 Jan 12 2023 result.tar.gz
2023-01-12 13:43:14 (17220): Guest Log: -rwxrwxrwx. 1 root root 17681 Jan 12 06:31 start_atlas.sh
2023-01-12 13:43:14 (17220): Guest Log: *** Success! Shutting down the machine. ***
2023-01-12 13:43:14 (17220): VM Completion File Detected.
2023-01-12 13:43:14 (17220): Powering off VM.
2023-01-12 13:43:14 (17220): Successfully stopped VM.
2023-01-12 13:43:14 (17220): Deregistering VM. (boinc_5deb979876a75273, slot#4)
2023-01-12 13:43:14 (17220): Removing network bandwidth throttle group from VM.
2023-01-12 13:43:14 (17220): Removing VM from VirtualBox.
13:43:20 (17220): called boinc_finish(0)

</stderr_txt>
]]>
ID: 47659 · Report as offensive     Reply Quote
Jonathan

Send message
Joined: 25 Sep 17
Posts: 99
Credit: 3,425,566
RAC: 0
Message 47660 - Posted: 12 Jan 2023, 9:14:17 UTC - in response to Message 47659.  

The one posted doesn't show anything obvious. Have you tried working through the Yeti checklist in the forums?
We would need to be able to see the individual computer stats and look at some of the logs on the work units and get more details.

https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4161#29359
ID: 47660 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2244
Credit: 173,902,375
RAC: 748
Message 47662 - Posted: 12 Jan 2023, 11:47:35 UTC

Do you have checked Virtualbox media manager?
You have to upgrade Virtualbox extension pack to 7.0.4.
ID: 47662 · Report as offensive     Reply Quote

Message boards : Number crunching : invalid results


©2024 CERN