TechEcho

13 comments

new299over 8 years ago

I'm a bit shocked to see this on the front page. That's possibly a comment on me as much as anything.But this is just really simple bit packing. Is this something many engineers (maybe frontend/JS) don't know about now?

评论 #13093595 未加载

评论 #13093566 未加载

评论 #13093802 未加载

评论 #13093787 未加载

评论 #13093740 未加载

bitcharmerover 8 years ago

This is simple binary packing approach to storing small-type data in wider types. Delta compression is a natural next step from here. I once published a blog post on that topic: <a href="http://bitcharmer.blogspot.co.uk/2013/12/how-to-serialise-array-of-doubles-with.html" rel="nofollow">http://bitcharmer.blogspot.co.uk/2013/12/how-to-serialise-ar...</a>Mind you, the reference implementation from the blog is really poor in terms of code quality and performance because I was not allowed to open-source what I actually developed for my client, but the ideas still stand. Delta compression can make a huge difference in some cases; because cpu cycles are cheap and bitwise operations are very fast it has the potential to bring serious benefits in some scenarios, ie. lower latency of transmitting (numeric) data over networks.

BurningFrogover 8 years ago

Since 3⁵ (243) is less than 256, you can actually fit 5 "trinary" numbers into a byte.The code gets more complex and slow, but not really by much.

评论 #13093874 未加载

评论 #13093869 未加载

评论 #13093583 未加载

gefhover 8 years ago

Alternatively, just store it naively and gzip. Or use a sparse array. Or, best, don't do anything until storage size is an actual, measured bottleneck

pjscottover 8 years ago

See also bitfields, which make the compiler generate the shifting and masking code for you:<a href="https://en.wikipedia.org/wiki/Bit_field" rel="nofollow">https://en.wikipedia.org/wiki/Bit_field</a>

评论 #13093409 未加载

Walkmanover 8 years ago

Either I don't understand something or the Resulting bytes after c should be '11100100' and after d should be '11100110'.

评论 #13094778 未加载

smaddoxover 8 years ago

And if you need to store values that are not powers of 2, you can use the following kind of encoding/decoding. In python3:<pre><code> from math import * def number_to_trits(n): # little-endian encoding # (i.e. smallest precision trit first) trits = [] v = n for i in range(1,9): v, d = divmod(v, 3**i) trits.append(d) return trits print(number_to_trits(8)) # [2, 2, 0, 0, 0, 0, 0, 0] def trits_to_number(trits): n = 0 for i, trit in enumerate(trits): n += trit*3**i return n print(trits_to_number([2,2])) # 8</code></pre>

mamispover 8 years ago

I also really liked this page on bitpacking, which also offers a technique to implement them using only math operations.<a href="http://number-none.com/product/Packing%20Integers/index.html" rel="nofollow">http://number-none.com/product/Packing%20Integers/index.html</a>I recently implemented a bitpacker in a game VM that didn't have bitwise operations, and the math-based version worked perfectly.

strikingover 8 years ago

Isn't it possible to use SQLite or something for something like this? That would greatly simplify most of these tasks, making your dataset reachable through a fast declarative language.As many games of Go as 100000 is, it's not so big you couldn't use a regular, untuned database.

评论 #13093737 未加载

bill45over 8 years ago

This method is only optimal if your numbers have ranges that fit nicely into powers of two. See here for optimal bit packing: <a href="https://codeplea.com/optimal-bit-packing" rel="nofollow">https://codeplea.com/optimal-bit-packing</a>

deutroniumover 8 years ago

Neat :)I wrote some very slow code to set/get the Nth bit in an array of bytes.I was planning on using it for Chess bitboards - <a href="https://en.wikipedia.org/wiki/Bitboard" rel="nofollow">https://en.wikipedia.org/wiki/Bitboard</a>, but it could be used for say 8 booleans to the byte.unsigned char getbit(unsigned char * bits,unsigned long n){<pre><code> return (bits[n/8] & (unsigned char)pow(2,n%8)) >> n%8; </code></pre> }void setbit(unsigned char * bits,unsigned long n, unsigned char val){<pre><code> bits[n/8] = (bits[n/8] & ~(unsigned char)pow(2,n%8)) | ((unsigned char)pow(2,n%8) * val); }</code></pre>

评论 #13093465 未加载

amingilaniover 8 years ago

Great read, but you should really move the motivation up to the top before you start the explanation.

ChicagoDaveover 8 years ago

"a lot" is two words.

评论 #13093606 未加载

13 comments

new299over 8 years ago

评论 #13093595 未加载

评论 #13093566 未加载

评论 #13093802 未加载

评论 #13093787 未加载

评论 #13093740 未加载

bitcharmerover 8 years ago

BurningFrogover 8 years ago

Since 3⁵ (243) is less than 256, you can actually fit 5 "trinary" numbers into a byte.The code gets more complex and slow, but not really by much.

评论 #13093874 未加载

评论 #13093869 未加载

评论 #13093583 未加载

gefhover 8 years ago

Alternatively, just store it naively and gzip. Or use a sparse array. Or, best, don't do anything until storage size is an actual, measured bottleneck

pjscottover 8 years ago

评论 #13093409 未加载

Walkmanover 8 years ago

Either I don't understand something or the Resulting bytes after c should be '11100100' and after d should be '11100110'.

评论 #13094778 未加载

smaddoxover 8 years ago

mamispover 8 years ago

strikingover 8 years ago

评论 #13093737 未加载

bill45over 8 years ago

deutroniumover 8 years ago

评论 #13093465 未加载

amingilaniover 8 years ago

Great read, but you should really move the motivation up to the top before you start the explanation.

ChicagoDaveover 8 years ago

"a lot" is two words.

评论 #13093606 未加载

How to save multiple numbers into one byte

13 comments

How to save multiple numbers into one byte

13 comments