Slide 30 of 39
Notes:
Modified sort: sort locally only by position in final partition and then exchange data guaranteeing a balanced data flow.
Local/global steps: local step removes duplicates. Then partitions data to find duplicates in different sites (global step).
Note: the functions may differ for aggregation.