Personal plug, because the article mentions snakebite; hadoofus[1] is a C library that natively speaks to HDFS (v1), with a Python wrapper. No Java involved.<p>Pros (vs Snakebite):<p>* C, not Python. More portable to non-Python languages; probably faster<p>* Speaks v1 of the protocol (AFAIK, nothing else does this except via JNI to hadoop's Java client)<p>* Kerberos authentication supported (but not datanode tokens or the digest auth they employ)<p>Cons:<p>* Sort of abandonware. I don't personally use HDFS anymore, so it has stagnated.<p>* As an extension of the above, doesn't support the v2 (Protobuf-based) protocol<p>* C, not Python. All the usual potential issues with C that Python shields you from.<p>[1]: <a href="https://github.com/cemeyer/hadoofus" rel="nofollow">https://github.com/cemeyer/hadoofus</a>
Yeah, this is a place where punctuation in the title makes a big difference. Can we put the exclamation point (or substitute a period) back into the title?