Welcome to MilkyWay@home

Admin Updates Discussion

Message boards : News : Admin Updates Discussion
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7

AuthorMessage
JohnDK
Avatar

Send message
Joined: 18 Feb 10
Posts: 58
Credit: 223,296,189
RAC: 4,326
Message 77225 - Posted: 6 Sep 2024, 20:29:45 UTC

Hello everyone,

Announcement
The server needs to be rebooted. This will happen on 6 September 2024 at 14:00 UTC. If this fixes the issue, the server will be down for less than an hour.

Thanks,
Kevin
Seems something is wrong, I'm almost out of work and there's no new work available.
ID: 77225 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bryan Price

Send message
Joined: 19 Apr 09
Posts: 4
Credit: 1,799,023
RAC: 5,008
Message 77230 - Posted: 10 Sep 2024, 22:11:54 UTC - in response to Message 77225.  

Hello everyone,

Announcement
The server needs to be rebooted. This will happen on 6 September 2024 at 14:00 UTC. If this fixes the issue, the server will be down for less than an hour.

Thanks,
Kevin
Seems something is wrong, I'm almost out of work and there's no new work available.


Same thing here. :/
ID: 77230 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bill F
Avatar

Send message
Joined: 4 Jul 09
Posts: 101
Credit: 17,832,339
RAC: 3,199
Message 77231 - Posted: 12 Sep 2024, 1:51:58 UTC

I am not seeing any shortage of available tasks. The server status page is showing the normal 1000+ tasks available.

Bill F
ID: 77231 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bryan Price

Send message
Joined: 19 Apr 09
Posts: 4
Credit: 1,799,023
RAC: 5,008
Message 77233 - Posted: 13 Sep 2024, 21:51:10 UTC - in response to Message 77231.  

I am not seeing any shortage of available tasks. The server status page is showing the normal 1000+ tasks available.


I'm receiving tasks now.
ID: 77233 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Link
Avatar

Send message
Joined: 19 Jul 10
Posts: 672
Credit: 19,581,077
RAC: 1,407
Message 77363 - Posted: 31 Mar 2025, 9:11:17 UTC
Last modified: 31 Mar 2025, 9:18:53 UTC

@Kevin: you might need to check the disk limit for the current WUs, it might be necessary to increase it. I got EXIT_DISK_LIMIT_EXCEEDED errors on some of the de_nbody_orbit_fitting_03_25_2025_v186_OCS__data__33_1740880091_* tasks, the wingmen got them too. The slot directories of the running WUs seem to be quite a bit larger than what they used to be in the past as far as I remember, some of them close to or even over 30MB, the errored out tasks went over 50MB. Perhaps the current WU set needs 100-200MB as limit.

Examples: 1001559783, 1001630955.
ID: 77363 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 725
Credit: 562,574,684
RAC: 25,050
Message 77369 - Posted: 6 Apr 2025, 16:04:41 UTC - in response to Message 77363.  

Yes, the admins need to look into the task generation template and correct this error. I'm getting half a dozen a day of these errors on known good hardware and Boinc setups.
ID: 77369 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Link
Avatar

Send message
Joined: 19 Jul 10
Posts: 672
Credit: 19,581,077
RAC: 1,407
Message 77401 - Posted: 26 Apr 2025, 9:32:06 UTC

Still not fixed:
1002361354 Too many errors (may have bug)
1002146987 Stil in progress on other machines, EXIT_DISK_LIMIT_EXCEEDED on my computer.

This one is interesting, it errored out for me, but two other computers could finish it with 40.72 MB and 44.48 MB peak disk usage: 1002419239
ID: 77401 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kevin Roux
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Aug 22
Posts: 85
Credit: 4,148,498
RAC: 4,653
Message 77411 - Posted: 5 May 2025, 14:15:25 UTC - in response to Message 77363.  

@Kevin: you might need to check the disk limit for the current WUs, it might be necessary to increase it. I got EXIT_DISK_LIMIT_EXCEEDED errors on some of the de_nbody_orbit_fitting_03_25_2025_v186_OCS__data__33_1740880091_* tasks, the wingmen got them too. The slot directories of the running WUs seem to be quite a bit larger than what they used to be in the past as far as I remember, some of them close to or even over 30MB, the errored out tasks went over 50MB. Perhaps the current WU set needs 100-200MB as limit.

Examples: 1001559783, 1001630955.


I set the limit to 100MB.
Let me know if the issue persists.
ID: 77411 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 725
Credit: 562,574,684
RAC: 25,050
Message 77412 - Posted: 6 May 2025, 14:03:16 UTC - in response to Message 77411.  

Still seeing the errors. I have very low caches on each host but the older resends are still coming through with the misconfiguration. It will probably take a fairly long while to clear all the bad work with 8 possible resends for each task.
ID: 77412 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GWGeorge007
Avatar

Send message
Joined: 6 Jan 18
Posts: 10
Credit: 88,795,644
RAC: 46,724
Message 77413 - Posted: 6 May 2025, 14:53:17 UTC - in response to Message 77412.  
Last modified: 6 May 2025, 14:54:02 UTC

Still seeing the errors. I have very low caches on each host but the older resends are still coming through with the misconfiguration. It will probably take a fairly long while to clear all the bad work with 8 possible resends for each task.

The same applies to me. I have nearly 2,200 errors.
George
ID: 77413 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Link
Avatar

Send message
Joined: 19 Jul 10
Posts: 672
Credit: 19,581,077
RAC: 1,407
Message 77414 - Posted: 6 May 2025, 17:25:35 UTC - in response to Message 77413.  
Last modified: 6 May 2025, 17:32:35 UTC

Still seeing the errors. I have very low caches on each host but the older resends are still coming through with the misconfiguration.
This was not a resend, it was _0 created more than 12 hours after Kevin's post.


The same applies to me. I have nearly 2,200 errors.
Nearly all of your errors are Aborted by user, 203 (0x000000CB) EXIT_ABORTED_VIA_GUI. Users aborting 2000 WUs is not an issue we are talking here about.
ID: 77414 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GWGeorge007
Avatar

Send message
Joined: 6 Jan 18
Posts: 10
Credit: 88,795,644
RAC: 46,724
Message 77416 - Posted: 6 May 2025, 17:31:29 UTC - in response to Message 77414.  

The same applies to me. I have nearly 2,200 errors.
Nearly all of your errors are Aborted by user, 203 (0x000000CB) EXIT_ABORTED_VIA_GUI. Users aborting 2000 WUs is not an issue we are talking here about.

Ooops! Sorry! I forgot that I did abort literally all of those in an effort to get Orbit Fitting to not give me single tasks when I have x4 CPUs in my app_config.xml file.
George
ID: 77416 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Link
Avatar

Send message
Joined: 19 Jul 10
Posts: 672
Credit: 19,581,077
RAC: 1,407
Message 77417 - Posted: 6 May 2025, 17:36:49 UTC - in response to Message 77416.  
Last modified: 6 May 2025, 17:39:39 UTC

Ooops! Sorry! I forgot that I did abort literally all of those in an effort to get Orbit Fitting to not give me single tasks when I have x4 CPUs in my app_config.xml file.
Don't use app_config.xml anymore for setting the number of threads per WU, use MilkyWay@home preferences instead, than you will get what you want.

All that's left of my app_config.xml is this (to get better run time estimates):
<app_config>
 <app>
  <name>milkyway_nbody</name>
  <fraction_done_exact/>
 </app>
 <app>
  <name>milkyway_nbody_orbit_fitting</name>
  <fraction_done_exact/>
 </app>
</app_config>

ID: 77417 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GWGeorge007
Avatar

Send message
Joined: 6 Jan 18
Posts: 10
Credit: 88,795,644
RAC: 46,724
Message 77418 - Posted: 6 May 2025, 17:42:13 UTC - in response to Message 77417.  

Ooops! Sorry! I forgot that I did abort literally all of those in an effort to get Orbit Fitting to not give me single tasks when I have x4 CPUs in my app_config.xml file.
Don't use app_config.xml anymore for setting the number of threads per WU, use MilkyWay@home preferences instead, than you will get what you want.

All that's left of my app_config.xml is this (to get better run time estimates):
<app_config>
 <app>
  <name>milkyway_nbody</name>
  <fraction_done_exact/>
 </app>
 <app>
  <name>milkyway_nbody_orbit_fitting</name>
  <fraction_done_exact/>
 </app>
</app_config>

I already do use the Milkyway preferences for just that.[/img]
George
ID: 77418 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Link
Avatar

Send message
Joined: 19 Jul 10
Posts: 672
Credit: 19,581,077
RAC: 1,407
Message 77420 - Posted: 6 May 2025, 20:23:33 UTC - in response to Message 77418.  

I already do use the Milkyway preferences for just that.
What settings do you have there? If you want 4-thread tasks, set "Max # of threads for each MilkyWay@home task" to 4, remove anything regarding number of threads from your app_config.xml and the server should not send you any single thread tasks. It used to work in the past at least. Do not use "no limit", that will result in a mix of MT and ST tasks.
ID: 77420 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 . . . 4 · 5 · 6 · 7

Message boards : News : Admin Updates Discussion

©2025 Astroinformatics Group