Slide 9 of 19
Notes:
I - run continuously at background (possible latency hit)
run only when disk space is exhausted <-------
II- the more segments cleaned at a time, the more opportunity to reorganize data
III- segments that are most fragmented (not a best choice)
IV- attempt to increase locality of future reads (combine files in a directory together) or sort the blocks by time and group the ones with similar age