Advanced search

Message boards : News : [dna,wildlife] status update

Author Message
Travis Desell
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 16 Jan 12
Posts: 1813
Combined Credit: 23,514,257
DNA@Home: 293,563
SubsetSum@Home: 349,212
Wildlife@Home: 22,871,482
Wildlife@Home Watched: 212,926s
Wildlife@Home Events: 51
Climate Tweets: 21
Images Observed: 774

              
Message 5032 - Posted: 20 Mar 2015, 16:03:45 UTC

I've started up two new runs for DNA@Home, which should add a few more workunits. I think I might have figured out the issue with checkpointing as well. I think it might actually be on the server end. I'll be having the validator stop when there's a mismatch so I can debug better. Let me know if any of the these workunits have any issues.

I'm also putting the finishing touches on debugging checkpointing for the new wildlife@home app, which will have the additional bonus of not having limits on how many WUs it can send out. I'm currently traveling in NY, but I hope to at least test the new app this weekend for OSX, and then when I get back do some testing on linux, and then windows.

Travis Desell
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 16 Jan 12
Posts: 1813
Combined Credit: 23,514,257
DNA@Home: 293,563
SubsetSum@Home: 349,212
Wildlife@Home: 22,871,482
Wildlife@Home Watched: 212,926s
Wildlife@Home Events: 51
Climate Tweets: 21
Images Observed: 774

              
Message 5035 - Posted: 20 Mar 2015, 17:56:11 UTC - in response to Message 5032.

Back to the drawing board on the DNA@Home checkpointing issue. Looks like it's not server side. Back to trying to reproduce the issue locally (my current binaries can restart from different checkpoints and still get the same final result...).

Profile Henk Haneveld
Send message
Joined: 25 Dec 14
Posts: 8
Combined Credit: 626,885
DNA@Home: 17,297
SubsetSum@Home: 40,264
Wildlife@Home: 569,324
Wildlife@Home Watched: 0s
Wildlife@Home Events: 0
Climate Tweets: 0
Images Observed: 0

      
Message 5036 - Posted: 20 Mar 2015, 20:41:58 UTC

Travis,

Have you considered the possibility that residual values in internal computer memory may have a effect.

I noticed that results that have a restart from checkpoint because of a Boinc heartbeat loss will mostly be valid.

However a start from checkpoint after a shutdown and restart of the computer will to the best of my knowledge always end in invalid.
____________

Travis Desell
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 16 Jan 12
Posts: 1813
Combined Credit: 23,514,257
DNA@Home: 293,563
SubsetSum@Home: 349,212
Wildlife@Home: 22,871,482
Wildlife@Home Watched: 212,926s
Wildlife@Home Events: 51
Climate Tweets: 21
Images Observed: 774

              
Message 5037 - Posted: 21 Mar 2015, 0:20:06 UTC - in response to Message 5036.

Travis,

Have you considered the possibility that residual values in internal computer memory may have a effect.

I noticed that results that have a restart from checkpoint because of a Boinc heartbeat loss will mostly be valid.

However a start from checkpoint after a shutdown and restart of the computer will to the best of my knowledge always end in invalid.


At this point it's a possibility. Which would be very nasty as it would mean somewhere the program is reading from uninitialized memory...


Post to thread

Message boards : News : [dna,wildlife] status update