科技回声

8 条评论

jamii超过 6 年前

My experience has been that the vast majority of papers on data-structures are at best misleading, and at worst deliberately biased.For example:> The hash table used by the authors of ART in their study was a chained hash table, but this kind of hash tables can be suboptimal in terms of space and performance due to their potentially high use of pointers.> Our experiments strongly indicate that neither ART nor Judy are competitive to the aforementioned hashing schemes in terms of performance, and, in the case of ART, sometimes not even in terms of space.<a href="https://www.victoralvarez.net/papers/A%20Comparison%20of%20Adaptive%20Radix%20Trees%20and%20Hash%20Tables%20-%20ICDE%202015.pdf" rel="nofollow">https://www.victoralvarez.net/papers/A%20Comparison%20of%20A...</a>

评论 #18422542 未加载

评论 #18422440 未加载

评论 #18422142 未加载

acidx超过 6 年前

Nice article and analysis! I'm actually considering scrapping the trie used in my project to something based off of this one, with some modifications:For instance, find_node(c, Node48) could avoid the branch if a non-existing index points to an additional entry in child_ptrs that's always NULL. Lookup would be comparable to the Node256 version.Another thing that could be done, is to scrap the Node48 entirely, and implement two new structs to replace it: Node32 and Node64, and use respectively AVX2 and AVX512. These can be based off of the Node16 version. It remains to be seen if these will yield better performance than the branchless Node48 above, especially if power management kicks in when mixing AVX512 with older SIMD generations.The trie in Lwan (<a href="https://lwan.ws" rel="nofollow">https://lwan.ws</a>) does an interesting trick to reduce the amount of memory used in the equivalent of a Node256: instead of 256 pointers to a node, it has only 8 pointers. Characters are hashed (MOD 8). The leaf node contains a linked list of key/value pair, and an actual string comparison is performed at the end. (Lwan cheats here by avoiding a string comparison if the linked list contains only 1 element.) Works pretty well, as it's part of the URL routing mechanism.One other experiment I've been making with tries, is to use the idea of key compression and use it in a different way: slice it every 4 or 8 bytes, consider those bytes as an arbitrary integer, and add every chunk of it to a hashmap<int, some_struct>, building a chain for the next lookup in some_struct. The prototype I wrote works pretty well.

评论 #18424067 未加载

评论 #18421637 未加载

faragon超过 6 年前

A good point for both RB trees and linear-addressing hash tables is that they can be implemented with vectors( [1], [2]), allowing the case of initial reservation for N elements, so with a tricky implementation you could even have the data structure with one or zero allocations (e. g. allocate the tree or the hash table in the stack). For tries you could use many memory pools for the different node sizes, apply path compression, and even a LUT accelerator for reaching the Nth level, but hardly could be implemented using a vector.[1] <a href="https://github.com/faragon/libsrt/blob/master/src/saux/stree.c" rel="nofollow">https://github.com/faragon/libsrt/blob/master/src/saux/stree...</a>[2] I'm implementing a key-value hash table that will be added to the same library as [1] with "srt_hmap" type, in one continuous allocation. Being able to use hash tables allocated both in the heap and in the stack (e.g. you could use a int32-int32 hash table allocated in the stack for computing the color frequency of a bitmap image). Being the HT performance 4 to 5x the performance of the RB-trees, including cost of rehashing - rehash implementation using techniques for avoiding moving all the data- (rehashing only available for the heap case).

评论 #18420400 未加载

namibj超过 6 年前

Just a friendly reminder that B-trees are often faster on modern microprocessors than RB-trees. See kbtree.h for a simple, yet fast example. I didn't test it, but I'd assume B-tries would be rather efficient.

评论 #18421188 未加载

评论 #18422154 未加载

marknadal超过 6 年前

Radix trees are one of the most under utilized data structures.They are great and have fantastic performance!I implemented a custom on disk storage engine with a Radix format, and am getting on a low end MacBook Air 2015 about 3K/acked writes to disk/second! It is now the default at <a href="https://github.com/amark/gun" rel="nofollow">https://github.com/amark/gun</a> the code is pretty short too.

repsilat超过 6 年前

> This is superior to binary-search: no branches (except for the test when bitfield is 0), and all the comparisons are done in parallelBranchless binary search isn't hard to implement if you know (or can bound) the number of elements statically. You just use the comparison result arithmetically instead of branching on it, and you unroll the loop.Obviously a binary search can't do comparisons in parallel, though.

评论 #18423182 未加载

laxk超过 6 年前

If somebody need a good/fast/well optimized Go implementation of the ART, check this out: <a href="https://github.com/plar/go-adaptive-radix-tree" rel="nofollow">https://github.com/plar/go-adaptive-radix-tree</a> (disclosure: I'm the author)

saagarjha超过 6 年前

I had to implement a trie for an Aho-Corasick implementation a while back, and I just used a std::unordered_set<unichar, std::unique_ptr<trie_node>> to store the children (this was Objective-C++, so I was using UTF-16 characters taken from an NSString). Worked well enough for the effort I put into it.

Beating hash tables with trees? The ART-ful radix trie

8 条评论

Beating hash tables with trees? The ART-ful radix trie

8 条评论