Applications which process extremely large data sets, require large scale movement of data across multiple levels of the memory hierarchy. First, from storage to main memory, and from there to on-chip caches. Finally, the cached data is moved to the CPU registers, where it can be processed. By most estimates, this data movement incurs large...Read More