How much of the time for <= 256 bytes is from the overhead of parameter checking?<p>That is, I'm tempted to start using cffi for my code, but years ago I remember timing ctypes + Python type checking vs. using a C extension. I remember the latter being noticeably faster, enough to justify the extra time it took to implement the extension.