By Lars Hofhansl A quick note about data locality. In distributed systems data locality is an important property. Data locality, here, refers to the ability to move the computation close to where the data is. This is one of the key concepts of MapReduce in Hadoop. The distributed file system provides the framework with locality information, which is then used to split the computation into chunks w