So with an off-the-shelf Linux box, you can write simple (but parallel) Haskell will outperform gcc's best efforts by a good margin -- today! Multicore programming just got a lot easier.
Now, to be clear: this is a very simplistic sample, but it does show the power of building code in a functional way.
Now, my question becomes, can Stackless Python beat it in cases where we aren't dealing with the global interpreter lock?