parallel sequentialfs, reproducibility, and substreams

8 Ansichten (letzte 30 Tage)
gagan sidhu
gagan sidhu am 22 Feb. 2024
hi again,
i have a question about the use of SubStreams for reproducibility.
theoretically speaking, sequentialfs should always output the same result, assuming the same folds/partitions and data are used on different runs.
i mean, the first feature that produces the highest accuracy for the given data/folds is not going to change because in order to make that determination, the remaining n-1 features must also be evaluated. i don't see how the random number seed would change the selection of these features (in a forward manner, at least).
so what does SubStreams do exactly? i'm corresponding with billy on figuring out why my macpro5,1 is using a crazy amount of ram for sequentialfs, and i'm wondering if this structure has something to do with it.
it's using a STUPID amount of memory. activity monitor in mac reports like ~50 gigs, but >100 gb is apparently being used and it's swapping out :/

Antworten (0)

Kategorien

Mehr zu Loops and Conditional Statements finden Sie in Help Center und File Exchange

Produkte


Version

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by