On my hardware C++ sort implementation is almost three times faster than Mac OS X libc's:
λ ~/tmp/5448488/ master* make clean && make test
rm -f sort_cpp sort_c
${CXX:-c++} -std=c++11 -O3 sort.cpp -o sort_cpp
${CC:-cc} -O3 sort.c -o sort_c
./sort_cpp
Time: 11 seconds.
./sort_c
Time: 28 seconds.