TechEcho

8 comments

shimonalmost 15 years ago

I once heard of a system for representing threaded comments in a relational database, wherein the ID of each comment was a slash-separated list of its ancestors, e.g.:1 Initial comment1/1 Re: Initial comment1/2 Re: Initial comment1/2/1 Re: Re: Initial comment1/2/2 ...These sorts of IDs have the wonderful property that you can efficiently select any subtree using a LIKE query, e.g. "SELECT * FROM comments WHERE path LIKE '1/%'". These queries are efficient because they use string btree indexes. And the results may even be sorted the right way, provided you have monotonically increasing IDs within each level.I think I read about this technique being used on Slashdot a few years ago, but I can't remember the name or find any reference. Still, it seems like a way to achieve the same benefits that the OP cites for comments with PostgreSQL's ltree support on any database with reasonable string indexing.Anyone know the name of that technique?

评论 #1390474 未加载

评论 #1390486 未加载

评论 #1390912 未加载

评论 #1390510 未加载

评论 #1391461 未加载

评论 #1391810 未加载

评论 #1391039 未加载

megaman821almost 15 years ago

If you are using Django, <a href="http://bitbucket.org/tabo/django-treebeard/overview" rel="nofollow">http://bitbucket.org/tabo/django-treebeard/overview</a>, is a great app to use. It implements several different tree storage methods.

bacarteralmost 15 years ago

I used these for a commenting system as well, they're really quite amazing. There's basically three options when storing a tree in a relational database - adjacency lists which are slow to read, nested sets which are slow for writes, and materialized path, which is slow for moving around nodes. It's good to have a fourth option which is straightforward to implement and has excellent performance.

zeegalmost 15 years ago

Just wanted to throw an update over here. The examples in my post are specifically geared towards a basic threaded comment structure, which Disqus is not. Our usage is much more complex than this example, but we are able to handle sorting by dynamic data (which we use in our "Most Popular" sort algorithm).In other words, recursion still works, and in our tests has proven faster than doing it at the application layer (no matter how you sugar coat it).

csytanalmost 15 years ago

I tackled this problem on appengine a while ago.Turns out that the average discussion thread has few enough comments that they can be fetched from the db in one call (e.g. by the thread_id). This makes even deeply nested comments fairly simple to query for, and then it is quite easy to rebuild the tree in memory.I think reddit uses this method too.

评论 #1391469 未加载

seldoalmost 15 years ago

The Oracle equivalent of this functionality if called CONNECT BY PRIOR:<a href="http://ss64.com/ora/connectby.html" rel="nofollow">http://ss64.com/ora/connectby.html</a>I used it back in 2002 to build some messageboards and it was pretty effing magical. Glad to see equivalent functionality finally make it to OSS.

petervandijckalmost 15 years ago

Cool. I thought I was the only one that couldn't figure out a nice way to do threaded comments, turns out even reddit orders stuff in memory.

评论 #1390476 未加载

评论 #1390415 未加载

papersmithalmost 15 years ago

Personally I just sort everything in memory like reddit, since I need to traverse all branches to calculate weighed scores, which is a balance of vote counts and age. I think reddit/HN do something similar? I wonder how Disqus handles this.I'm not really sure how to do that in SQL.

8 comments

shimonalmost 15 years ago

评论 #1390474 未加载

评论 #1390486 未加载

评论 #1390912 未加载

评论 #1390510 未加载

评论 #1391461 未加载

评论 #1391810 未加载

评论 #1391039 未加载

megaman821almost 15 years ago

bacarteralmost 15 years ago

zeegalmost 15 years ago

csytanalmost 15 years ago

评论 #1391469 未加载

seldoalmost 15 years ago

petervandijckalmost 15 years ago

Cool. I thought I was the only one that couldn't figure out a nice way to do threaded comments, turns out even reddit orders stuff in memory.

评论 #1390476 未加载

评论 #1390415 未加载

papersmithalmost 15 years ago

Postgresql's new recursive queries at Disqus: 500% speedup

8 comments

Postgresql's new recursive queries at Disqus: 500% speedup

8 comments