How to perform a merge between two large datasets using mapreduce
3 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
I'm comfortable the mapreduce paradigm and would like to use datasets and matlab's mapreduce functionality to merge two modestly large datasets (10's of gigabytes).
After reading through the documentation though, it's not obvious to me how to map 2 different datasets that require different map functions, into one dataset.
That sure seems like a canonical use case, I'm surprised it's not mentioned clearly in the documentation.
Is there a straight forward way of approaching a large merge?
0 Kommentare
Antworten (0)
Siehe auch
Kategorien
Mehr zu Large Files and Big Data finden Sie in Help Center und File Exchange
Produkte
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!