Message boards :
News :
Work should be flowing
Message board moderation
Author | Message |
---|---|
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
As usual, I go out of town and the server crashes. Work should be flowing, just give the server some time to catch up. --Travis |
Send message Joined: 10 May 10 Posts: 9 Credit: 13,403,734 RAC: 0 |
Yeah, it is running. But only half... My scheduler is only asking for CPU tasks. |
Send message Joined: 14 Feb 09 Posts: 999 Credit: 74,932,619 RAC: 0 |
You crashed, Collatz kept dropping off the internet and SETI was only allowing 20 units at a time. I had to get work from DNETC just to keep the GPU warm this weekend. |
Send message Joined: 10 May 10 Posts: 9 Credit: 13,403,734 RAC: 0 |
Yeah, it is running. But only half... Scratch that...Just gave the scheduler a Pwnt, sorted it out. |
Send message Joined: 4 Oct 08 Posts: 1734 Credit: 64,228,409 RAC: 0 |
Thanks Travis. All seems to be running well ATM Go away, I was asleep |
Send message Joined: 14 Dec 09 Posts: 161 Credit: 589,318,064 RAC: 0 |
Recently, i have been receiving one task per gpu. has the limitation been reduced again? Also, the server crashes more frequently in last several weeks. How about upgrading the server? |
Send message Joined: 12 Nov 07 Posts: 2425 Credit: 524,164 RAC: 0 |
Recently, i have been receiving one task per gpu. has the limitation been reduced again? Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. |
Send message Joined: 14 Dec 09 Posts: 161 Credit: 589,318,064 RAC: 0 |
i set it to 5 and it solved the problem. thank you |
Send message Joined: 30 Oct 09 Posts: 4 Credit: 150,544 RAC: 0 |
I keep having problems with Milkyway. I don't seem to be getting credit for completed work, and after completion sometimes there is no work available. 7/14/2010 6:11:25 AM Milkyway@home Requesting new tasks 7/14/2010 6:11:26 AM Milkyway@home Scheduler request completed: got 0 new tasks 7/14/2010 6:11:26 AM Milkyway@home Message from server: No work available 7/14/2010 6:19:31 AM Milkyway@home Sending scheduler request: To fetch work. 7/14/2010 6:19:31 AM Milkyway@home Requesting new tasks 7/14/2010 6:19:32 AM Milkyway@home Scheduler request completed: got 0 new tasks 7/14/2010 6:19:32 AM Milkyway@home Message from server: No work available 7/14/2010 6:41:39 AM Milkyway@home Sending scheduler request: To fetch work. It seems to be a waste of resources to work on this project. |
Send message Joined: 8 Feb 08 Posts: 261 Credit: 104,050,322 RAC: 0 |
a) no need to post the same question in multiple threads b) your WU's are waiting for validation see http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=116980 c) you could run an optimized app for your CPU (see number crunching section) |
Send message Joined: 1 Sep 08 Posts: 520 Credit: 302,528,262 RAC: 263 |
The title of the thread makes a good statement -- work should be flowing. Unfortunately, it hasn't been nor has the validator been functioning for the past 12+ hours. I take it that Travis is out of town again as it seems that 1) Problems happen when he is not around to babysit the servers and 2) No one else at RPI is empowered or trained to resolve the problems. |
Send message Joined: 11 Jul 10 Posts: 1 Credit: 475,666 RAC: 0 |
Can't get ny new work. anybody know why. Same for SETI Thanks R. Mengel |
Send message Joined: 24 Dec 07 Posts: 1947 Credit: 240,884,648 RAC: 0 |
Can't get ny new work. anybody know why. Same for SETI No problem for me on either project. |
Send message Joined: 19 Feb 08 Posts: 350 Credit: 141,284,369 RAC: 0 |
Can't get ny new work. anybody know why. Same for SETI Sorry, no problem for me. |
Send message Joined: 4 Oct 08 Posts: 1734 Credit: 64,228,409 RAC: 0 |
I was thinking how well Milkyway was doing up until yesterday, then the validator jammed up last night. Was it my thoughts that jinxed the system? Going to take a day or so before the servers are kicked I think. Go away, I was asleep |
Send message Joined: 1 Sep 08 Posts: 520 Credit: 302,528,262 RAC: 263 |
|
Send message Joined: 2 May 10 Posts: 57 Credit: 2,138 RAC: 0 |
I sent Travis an email regarding the issue with the validator. Any additional information you could give on the problem would be helpful. Urgent issues can be emailed to astro@cs.rpi.edu to alert all the project developers (there are currently 11). |
Send message Joined: 1 Sep 08 Posts: 520 Credit: 302,528,262 RAC: 263 |
OK -- thanks for the reply. The problem appears identical to the server issues that occurred about two weeks ago. No new work is available and work units are not getting validated. I suspect resolving this (temporarily) would need a server (or process) restart. As to resolving this at the root cause (as it is at least somewhat repetitive) I've no clue (aside from having Travis live onsite with the server 24/7 <smile>). Again, my suspicion as to why there has been no fix (temporary or otherwise) or reply to posts over in the number crunching message board is that Travis is not around at the moment, and, frankly, when he's not around, things go to automatic pilot with less direct attention. It seems that the server is 'sensitive' to Travis not being around to keep it company and tends to cajole Travis (and us) by becoming problematic when he is not around. |
Send message Joined: 2 May 10 Posts: 57 Credit: 2,138 RAC: 0 |
I had a brief conversation with Travis and he says the validator had crashed and should be up and running now. Thank you for the heads up. |
Send message Joined: 1 Sep 08 Posts: 520 Credit: 302,528,262 RAC: 263 |
The validator has crashed again. In fact, until the root cause issue underneath the validator (and work generator) crashes is recognized and dealt with, you can expect to see this as very much a recurring problem (it has been a recurring problem now for months). As noted elsewhere, the workaround is to set up a automatic process which stop/starts the various processes or does a full server down/restart to clear out the problems (temporarily) -- but this is only a workaround, as it is fairly clear that there is an underlying root cause problem which needs attention. |
©2024 Astroinformatics Group