An Efficient Foundation For Big Data Processing On Modern Clusters