Message boards :
Application Code Discussion :
problem with checkpoints 2
Message board moderation
| Author | Message |
|---|---|
TravisSend message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I'm pretty sure the application is checkpointing correctly but we're still getting the odd bad workunit. Not quite sure what's causing it, but I'm going to keep looking into the problem.
|
TravisSend message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
This is happening in all the 0.14 compiled apps. Including the Gipsel app (which i think has a slightly different fix to the checkpointing problem). So I'm not quite sure if the issue is still checkpointing or if it's something else.
|
TravisSend message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
Pretty sure I found the issue. At the end of calculate integrals there is:
#ifdef GMLE_BOINC
int retval = write_checkpoint(es);
if (retval) {
fprintf(stderr,"APP: astronomy checkpoint failed %d\n",retval);
return retval;
}
#endif
So in the rare case that this is the last checkpoint calculated (and there hasn't been a new one from the next integral calculation or likelihood calculation), the app will recalculate an integral. I'm going to do another update because I think this should put the last nail in the coffin to this problem.
|
©2025 Astroinformatics Group