Message boards :
CMS Application :
Jobs in the Queue / VirtualBox 5.1.14
Message board moderation
Author | Message |
---|---|
Send message Joined: 2 Sep 04 Posts: 455 Credit: 202,097,372 RAC: 39,123 |
As part of the race I'm running CMS-Tasks. On some machines it looks like they are working, but one machine produces only errors. Are you out of jobs in the moment ? The machine had VirtualBox 5.0.x installed, I have upgraded it to 5.1.14. TheoryTasks are running fine on this box. So, the question is, what is the reason? Out of Jobs or VirtualBox 5.1.14 ? Any idea ? EDIT: This is the box: https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10338055 Supporting BOINC, a great concept ! |
Send message Joined: 29 Aug 05 Posts: 1065 Credit: 8,134,418 RAC: 13,358 |
|
Send message Joined: 15 Jun 08 Posts: 2563 Credit: 257,112,605 RAC: 112,922 |
The new process (wmagent) communicates via TCP port 4080. Is it open? |
Send message Joined: 29 Aug 05 Posts: 1065 Credit: 8,134,418 RAC: 13,358 |
|
Send message Joined: 15 Jun 08 Posts: 2563 Credit: 257,112,605 RAC: 112,922 |
The new process (wmagent) communicates via TCP port 4080. My firewall currently shows open connections between the VMs and 188.184.82.11 on TCP port 4080. Must be either CMS or ATLAS. |
Send message Joined: 29 Aug 05 Posts: 1065 Credit: 8,134,418 RAC: 13,358 |
The new process (wmagent) communicates via TCP port 4080. vocms0159.cern.ch: vocms0159 (Special condor pool, CMS@HOME project) YLSNED! Perhaps we should be testing this link at startup too? BTW, I updated to 5.1.14 on one box and the tasks resumed nicely. Hmm, that box doesn't have any connections to vocms1059 but the other one does. Perhaps it's something needed at start-up that can then go away if the task pauses. OK, there it is in StarterLog, seems to be at the start of each job: 01/19/17 09:00:15 (pid:4142) Submitting machine is "vocms0159.cern.ch" 01/19/17 09:00:15 (pid:4142) setting the orig job name in starter |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 202,097,372 RAC: 39,123 |
|
Send message Joined: 2 Sep 04 Posts: 455 Credit: 202,097,372 RAC: 39,123 |
The new process (wmagent) communicates via TCP port 4080. No, it was not really open. When I configured last time my firewall this port was not in the official list. How can other PCs of my network crunch successfull CMS as long as this port was closed ? Supporting BOINC, a great concept ! |
Send message Joined: 15 Jun 08 Posts: 2563 Credit: 257,112,605 RAC: 112,922 |
I also noticed the HTTP 404 temporarily during the upload of a job result. When a new job starts the console shows the log of this new job. TCP port 4080 was introduced with CMS version 47.80 in the dev-project. See: https://lhcathomedev.cern.ch/lhcathome-dev/forum_thread.php?id=329&postid=4542 |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 202,097,372 RAC: 39,123 |
TCP port 4080 was introduced with CMS version 47.80 in the dev-project. Here we are running 47.60, so it should not be this port. Regarding this port I have made an enhancement-request on the forum. At the moment my box below mentioned crunches Theory without problems, so would be interesting to find out what the problem really is Supporting BOINC, a great concept ! |
Send message Joined: 15 Jun 08 Posts: 2563 Credit: 257,112,605 RAC: 112,922 |
TCP port 4080 was introduced with CMS version 47.80 in the dev-project. The WU-internal job distribution process changed (as I understand) from CRAB to WMAgent. This affects also version 47.60. WMAgent needs port 4080. The dev-project uses WMAgent since version 47.80. Theory uses a different job distribution process and is therefore not affected. The port is listed on the FAQ page since 2016-12-20. |
Send message Joined: 16 Jul 05 Posts: 24 Credit: 35,251,537 RAC: 0 |
At the moment my box below mentioned crunches Theory without problems, so would be interesting to find out what the problem really is I have that too: 4 identical boxes - 4 crunch Theory, but only 3 will crunch CMS. And the last again simply gives the impression that Condor has no jobs available, no concrete error reported anywhere. |
©2025 CERN