help
help > RE: Non random permutations
Oct 19, 2020 11:10 AM | Andrew Zalesky
RE: Non random permutations
Hi Rob,
The random number generator is seeded with the same seed for each run, and thus the same set of permutations will be generated across different runs. This was purposefully included to ensure reproducibility for testing purposes and explains your observation. You would need to dig into the code and remove the commands related to seeding the random number generator to avoid this behaviour.
I cannot remember the specific function in which the generator is seeded, but it should only be a single line that would need to be commented out.
Andrew
Originally posted by Rob McCutcheon:
The random number generator is seeded with the same seed for each run, and thus the same set of permutations will be generated across different runs. This was purposefully included to ensure reproducibility for testing purposes and explains your observation. You would need to dig into the code and remove the commands related to seeding the random number generator to avoid this behaviour.
I cannot remember the specific function in which the generator is seeded, but it should only be a single line that would need to be commented out.
Andrew
Originally posted by Rob McCutcheon:
Dear Prof Zalesky,
I have noticed that when analysing relatively large networks (e.g >300 nodes) the time taken for analysis does not increase linearly e.g. time for 1000 perms>> time for 10*100 perms. I therefore planned to parallelise in a very basic manner by just e.g. running 50*100 permutations rather than a one off 5000 permutation run, and averaging the pvalues.
However it looks as though there may be a problem with this as when I run 100 permutations several times the pvalues are identical, suggesting to me that it is the same permutations occurring each time rather than being chosen at random. This also occurs even if I try setting different random seeds.
Do you know what might be going on here, both why there is the nonlinear increase in time taken to run permutations, and why the permutations appear to be identical when ran several times?
Thank you!
Rob
I have noticed that when analysing relatively large networks (e.g >300 nodes) the time taken for analysis does not increase linearly e.g. time for 1000 perms>> time for 10*100 perms. I therefore planned to parallelise in a very basic manner by just e.g. running 50*100 permutations rather than a one off 5000 permutation run, and averaging the pvalues.
However it looks as though there may be a problem with this as when I run 100 permutations several times the pvalues are identical, suggesting to me that it is the same permutations occurring each time rather than being chosen at random. This also occurs even if I try setting different random seeds.
Do you know what might be going on here, both why there is the nonlinear increase in time taken to run permutations, and why the permutations appear to be identical when ran several times?
Thank you!
Rob
Threaded View
Title | Author | Date |
---|---|---|
Rob McCutcheon | Oct 19, 2020 | |
Andrew Zalesky | Oct 19, 2020 | |