Conversation
|
@JainTwinkle , This is the same as: Please let me know if this works with MANA/Open MPI, and if not, then how many of the earlier commits will work. Then I will push into 'main' whichever commits seem to work with MANA/Open MPI. As I explained, I'm trying to clean up the ugly code from the academic prototype. Then I'll push the cleaner code into 'main'. |
94e527e to
fe890c8
Compare
|
@JainTwinkle , This is the same as: Please let me know if this works with MANA/Open MPI, and if not, then how many of the earlier commits will work. Then I will push into 'main' whichever commits seem to work with MANA/Open MPI. For some reason, I don't see these bugs when testing against wave_mpi. So, your test with MANA/OpenMPI is very valuable. Thanks! |
fe890c8 to
47d5323
Compare
|
@JainTwinkle , The code to test is exactly the same. You can continue to work with your old version, or do a Thanks again for this testing!!! |
47d5323 to
f6952b6
Compare
* restart_plugin/mtcp_restart.c:reserveUpperHalfMemoryRegionsForCkptImgs() requires start1, end1, start2, end2. * start2 is roughly low end of stack, and end2 is 8 MB higher. * But because of address space randomization, the stacks of different ranks can be 1 GB away or more. So, choosing [ minHighMemStart, minHighMemStart + 8MB] is not sufficiient to reserve all possible stack regions. * So, this adds maxHighMemEnd (the maximum of highMemStart+8MB for the highMemStart (stack) in each rank.
f6952b6 to
d1b5043
Compare
|
This has two remaining commits that may or may not be useful. Most of the original commits disappeared when we pushed a lot of dev/gdc0/simplifyCopyBits into main. I'll come back later and see if these commits are useful. |
This has various bug fixes from dev/gdc0/simplifyCopyBits on origin, as well as cleaning up some code. I've tried to re-organize the order of the commits so that the obviously important commits come first. There was a report by @JainTwinkle that some of the commits caused problems when experimenting with Open MPI. I'm hoping that at least up to 31c9538 , everything is vanilla, and should work.
@JainTwinkle , could you test up to this commit (any maybe to the last commit), and tell us what works and what doesn't work for Open MPI?
Thanks.