The traditional model of analysing performance of Python programs has been "run the profiler, find your bottlenecks, optimize them or move them to C". I personally find this approach grossly insufficient in many cases, especially in the context of PyPy. Particular problems are: In many large applications, the profile is flat: PyPy's own translation toolchain, Twisted or any modern web servers are