On my hardware C++ sort implementation is almost three times faster than Mac OS X libc's:
λ ~/tmp/5448488/ master* make clean && make test rm -f sort_cpp sort_c ${CXX:-c++} -std=c++11 -O3 sort.cpp -o sort_cpp ${CC:-cc} -O3 sort.c -o sort_c ./sort_cpp Time: 11 seconds. ./sort_c Time: 28 seconds.