I started the development of tracemalloc at Wyplay, but I rewrote all the code for the version included in Python 3.4: complelty different API, you get different data, new implementation, etc. The old code:
https://github.com/wyplay/pytracemalloc/
The PEP 445 (malloc API) also allows to plug hooks on memory allocators at runtime, instead of having to recompile Python for that.