Welcome to MilkyWay@home

Server Trouble

Message boards : News : Server Trouble
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 22 · Next

AuthorMessage
Spatzthecat

Send message
Joined: 1 Dec 10
Posts: 82
Credit: 15,452,009,012
RAC: 0
Message 72919 - Posted: 17 Apr 2022, 18:50:58 UTC
Last modified: 17 Apr 2022, 18:53:38 UTC

Hello Tom,
Things are blocking up again after a couple of days of it working quite well.
Not able to get work and validation pending on the rise.

Maybe consider a fund raiser to get things that are needed so that the project doesn't need to be such "low priority".
ID: 72919 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72920 - Posted: 17 Apr 2022, 19:04:11 UTC - in response to Message 72919.  

Hello Tom,
Things are blocking up again after a couple of days of it working quite well.
Not able to get work and validation pending on the rise.

Maybe consider a fund raiser to get things that are needed so that the project doesn't need to be such "low priority".
+1

I've just got hold of a couple of R9 Nano GPUs (for an absurdly low price in an auction), which are twice as fast at single and half as fast at double precision as my 280Xs. I intend to leave the six 280Xs on Milkyway continuously as double seems to be getting so rare. I will use very rude words if they aren't kept busy.
ID: 72920 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 72921 - Posted: 17 Apr 2022, 19:16:09 UTC - in response to Message 72919.  

Hello Tom,
Things are blocking up again after a couple of days of it working quite well.
Not able to get work and validation pending on the rise.

Maybe consider a fund raiser to get things that are needed so that the project doesn't need to be such "low priority".


When you say "thinks are blocking up again," what do you mean? Things all appear fine on my end.
ID: 72921 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72922 - Posted: 17 Apr 2022, 19:18:42 UTC - in response to Message 72921.  
Last modified: 17 Apr 2022, 19:20:35 UTC

Hello Tom,
Things are blocking up again after a couple of days of it working quite well.
Not able to get work and validation pending on the rise.

Maybe consider a fund raiser to get things that are needed so that the project doesn't need to be such "low priority".


When you say "thinks are blocking up again," what do you mean? Things all appear fine on my end.
I'm getting "no new tasks" when requesting GPU work, as of 6:44pm Zulu (or possibly earlier, I wasn't asking then).
ID: 72922 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 72923 - Posted: 17 Apr 2022, 19:26:39 UTC
Last modified: 17 Apr 2022, 19:26:59 UTC

I wonder if the feeder is just running out of tasks at this point because people are requesting lots of work. The number of WUs in the pool is set to 10k jobs, but I can double that and see if it improves things.

We also have the max WUs to send at one time limit set at 600, but it seems like people crunch through that fairly quickly. Would changing that to 1000 help you, so you don't have to request work as often?
ID: 72923 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 72924 - Posted: 17 Apr 2022, 19:28:13 UTC - in response to Message 72923.  
Last modified: 17 Apr 2022, 19:28:56 UTC

At any rate, it looks like we are sitting around ~10k unsent jobs for separation at any given time now, which is the expected behavior.
ID: 72924 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72925 - Posted: 17 Apr 2022, 19:38:24 UTC - in response to Message 72923.  
Last modified: 17 Apr 2022, 19:42:26 UTC

I wonder if the feeder is just running out of tasks at this point because people are requesting lots of work. The number of WUs in the pool is set to 10k jobs, but I can double that and see if it improves things.

We also have the max WUs to send at one time limit set at 600, but it seems like people crunch through that fairly quickly. Would changing that to 1000 help you, so you don't have to request work as often?
Yes it would help greatly, since I have machines with up to 4 cards. I thought it was set to 300 per card, 900 per host, but at any rate, please turn both of those up.

Every time I check server status it always says around 10,000, so I'm not sure how it's ever running out. I guess you could try increasing that since it only takes a dozen people to empty it.

I just made another request and got 79. Earlier in the day I was getting about 250.

Of course everything would be wonderful if we could fetch and receive on the same communication, but I'm guessing nobody ever worked out what was wrong there?
ID: 72925 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
unixchick
Avatar

Send message
Joined: 21 Feb 22
Posts: 66
Credit: 817,008
RAC: 0
Message 72927 - Posted: 17 Apr 2022, 20:24:55 UTC

Things have been running smoothly for a few days, so I don't know what changed, but I'm not getting separation tasks even though the status page says there are plenty.

hopefully it is just a lag as the system does something else (generate work? backup?) and it will hand out tasks again.
ID: 72927 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Spatzthecat

Send message
Joined: 1 Dec 10
Posts: 82
Credit: 15,452,009,012
RAC: 0
Message 72928 - Posted: 17 Apr 2022, 21:31:16 UTC - in response to Message 72923.  

I wonder if the feeder is just running out of tasks at this point because people are requesting lots of work. The number of WUs in the pool is set to 10k jobs, but I can double that and see if it improves things.

We also have the max WUs to send at one time limit set at 600, but it seems like people crunch through that fairly quickly. Would changing that to 1000 help you, so you don't have to request work as often?


Still not getting any work.

Can it be set so that you simply replenish the units you report after getting the initial batch, which can be set lower?
ID: 72928 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72929 - Posted: 17 Apr 2022, 21:42:18 UTC - in response to Message 72928.  

I wonder if the feeder is just running out of tasks at this point because people are requesting lots of work. The number of WUs in the pool is set to 10k jobs, but I can double that and see if it improves things.

We also have the max WUs to send at one time limit set at 600, but it seems like people crunch through that fairly quickly. Would changing that to 1000 help you, so you don't have to request work as often?
Still not getting any work.

Can it be set so that you simply replenish the units you report after getting the initial batch, which can be set lower?
There's some kind of bug nobody's ever worked out where you can't send and receive seperation tasks at once. Since a fast GPU will complete a task in under the minimum server contact time, there will always be some to report and you'll never get tasks until you run out. And Boinc doesn't try again for 10 minutes as it assumes there are none available.
ID: 72929 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Kiska

Send message
Joined: 31 Mar 12
Posts: 96
Credit: 152,502,225
RAC: 0
Message 72930 - Posted: 17 Apr 2022, 21:47:56 UTC

Seems like the feeder is sleeping too long to feed the buffer. I would suggest reducing the amount of time the feeder sleeps for.


Either that or someone has just dumped a ton of tasks onto the server, and its working through validating all of them
ID: 72930 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72931 - Posted: 17 Apr 2022, 21:49:52 UTC - in response to Message 72930.  
Last modified: 17 Apr 2022, 21:50:41 UTC

Seems like the feeder is sleeping too long to feed the buffer. I would suggest reducing the amount of time the feeder sleeps for.

Either that or someone has just dumped a ton of tasks onto the server, and its working through validating all of them
The Boinc server software needs a complete rewrite. Looks like at the moment you have to fiddle with lots of numbers to make it work right. Kinda like a machine from the 50s. Please use your expertise and go to github and kick those "programmers" where the sun doesn't shine.
ID: 72931 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Kiska

Send message
Joined: 31 Mar 12
Posts: 96
Credit: 152,502,225
RAC: 0
Message 72932 - Posted: 17 Apr 2022, 21:53:20 UTC - in response to Message 72931.  

Seems like the feeder is sleeping too long to feed the buffer. I would suggest reducing the amount of time the feeder sleeps for.

Either that or someone has just dumped a ton of tasks onto the server, and its working through validating all of them
The Boinc server software needs a complete rewrite. Looks like at the moment you have to fiddle with lots of numbers to make it work right. Kinda like a machine from the 50s. Please use your expertise and go to github and kick those "programmers" where the sun doesn't shine.


And why would I want to do that? Every one of the BOINC devs are volunteers since they've lost NSF funding, so all they are doing is maintaining the repo.

For the GPU thing, I think one of the previous server admins made a change and Tom can't find it.
ID: 72932 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72933 - Posted: 17 Apr 2022, 21:56:37 UTC - in response to Message 72932.  

Seems like the feeder is sleeping too long to feed the buffer. I would suggest reducing the amount of time the feeder sleeps for.

Either that or someone has just dumped a ton of tasks onto the server, and its working through validating all of them
The Boinc server software needs a complete rewrite. Looks like at the moment you have to fiddle with lots of numbers to make it work right. Kinda like a machine from the 50s. Please use your expertise and go to github and kick those "programmers" where the sun doesn't shine.
And why would I want to do that? Every one of the BOINC devs are volunteers since they've lost NSF funding, so all they are doing is maintaining the repo.
Then help them out. You seem like someone who knows their way around the software. It's ridiculous the server doesn't just work.

For the GPU thing, I think one of the previous server admins made a change and Tom can't find it.
Assuming you mean send and receive at once, I asked Eric and then Tom, they both tried, but there's some "misconfiguration" that nobody can find a fix for. I've mentioned it to several people on github in Boinc but nobody knows.
ID: 72933 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
unixchick
Avatar

Send message
Joined: 21 Feb 22
Posts: 66
Credit: 817,008
RAC: 0
Message 72934 - Posted: 17 Apr 2022, 22:50:01 UTC
Last modified: 17 Apr 2022, 22:51:10 UTC

I'm getting tasks again, so it must have been like stated... an influx of completed WUs that needed validation.

edit: WCG friends... hopefully we will be back at WCG on Friday April 22.
ID: 72934 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72935 - Posted: 17 Apr 2022, 22:54:38 UTC - in response to Message 72934.  
Last modified: 17 Apr 2022, 22:54:45 UTC

I'm getting tasks again, so it must have been like stated... an influx of completed WUs that needed validation.
I thought that, one computer grabbed a few hundred. But then I told the other machines to check and they got nothing.

edit: WCG friends... hopefully we will be back at WCG on Friday April 22.
Cool! I couldn't find any info on that, other than "ready for final testing" a week or so ago. I have one phone checking every day for it, unfortunately the rest of my machines need to be re-attached since I changed from pool to private Gridcoin mining.
ID: 72935 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HRFMguy

Send message
Joined: 12 Nov 21
Posts: 236
Credit: 575,038,236
RAC: 0
Message 72936 - Posted: 17 Apr 2022, 23:11:31 UTC - in response to Message 72923.  

I wonder if the feeder is just running out of tasks at this point because people are requesting lots of work. The number of WUs in the pool is set to 10k jobs, but I can double that and see if it improves things.

We also have the max WUs to send at one time limit set at 600, but it seems like people crunch through that fairly quickly. Would changing that to 1000 help you, so you don't have to request work as often?

Yes! please do! Just installed an R9 280X last night, and it has run dry several times already!
ID: 72936 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,143,149
RAC: 0
Message 72937 - Posted: 17 Apr 2022, 23:15:05 UTC - in response to Message 72936.  
Last modified: 17 Apr 2022, 23:15:14 UTC

Just installed an R9 280X last night, and it has run dry several times already!
Those cards are wonderful, I have 6. Not sure what MW is going to do when they've all expired and everyone uses the modern pitiful ones without much DP.
ID: 72937 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HRFMguy

Send message
Joined: 12 Nov 21
Posts: 236
Credit: 575,038,236
RAC: 0
Message 72938 - Posted: 17 Apr 2022, 23:18:59 UTC - in response to Message 72923.  
Last modified: 17 Apr 2022, 23:20:02 UTC

deleted duplicate post
ID: 72938 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HRFMguy

Send message
Joined: 12 Nov 21
Posts: 236
Credit: 575,038,236
RAC: 0
Message 72939 - Posted: 17 Apr 2022, 23:21:08 UTC - in response to Message 72937.  

Just installed an R9 280X last night, and it has run dry several times already!
Those cards are wonderful, I have 6. Not sure what MW is going to do when they've all expired and everyone uses the modern pitiful ones without much DP.

yep
ID: 72939 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 22 · Next

Message boards : News : Server Trouble

©2024 Astroinformatics Group