TechEcho

8 comments

ignoramousabout 9 years ago

Wikidata did a comprehensive analysis of Graph DBs [0], and settled on BlazeGraph with TitanDB coming a close second.Notably, there are quite a few omissions. DGraph and Cayley [1] being two of those. Interestingly, both are developed by Googlers. Cayley is used by Kythe.io [2], a Google project that kind of competes with srclib [3] by SourceGraph.Cayley has native JavaScript interface, which makes it an interesting choice for Node JS based apps.At work, we settled on TitanDB, primarily because it supports DynamoDB/Cassandra for storage and ElasticSearch. Most of the graph DBs rely on some storage engine or the other underneath-- Cayley supports LevelDB, for instance; whereas TitanDB supports BerkeleyDB apart from aforementioned DyanmoDB and Cassandra.[0] <a href="https://docs.google.com/a/wikimedia.org/spreadsheets/d/1MXikljoSUVP77w7JKf9EXN40OB-ZkMqT8Y5b2NYVKbU/edit#gid=0" rel="nofollow">https://docs.google.com/a/wikimedia.org/spreadsheets/d/1MXik...</a>[1] <a href="https://github.com/google/cayley" rel="nofollow">https://github.com/google/cayley</a>[2] <a href="https://kythe.io" rel="nofollow">https://kythe.io</a>[3] <a href="https://srclib.org" rel="nofollow">https://srclib.org</a>

评论 #11325896 未加载

评论 #11325898 未加载

评论 #11329295 未加载

simonwabout 9 years ago

This is really exciting. I've been hoping for a robust, distributed open source Graph database ever since I first played with Freebase (which clearly had some amazing secret sauce, long-since purchased by Google). The engineer behind DGraph has worked on Google Knowledge Graph, the spiritual successor to Freebase, and obviously understands the space incredibly well: <a href="https://twitter.com/manishrjain" rel="nofollow">https://twitter.com/manishrjain</a>

nlabout 9 years ago

This looks excellent!Some questions because I need something like this:What does "distributed" mean in this context? Can the graph size be larger than the storage on a single node? If so, how is it partitioned (I think Titan was randomly partitioned)?Has any thought been given to in-graph processing (PageRank etc)?

评论 #11326299 未加载

pbowyerabout 9 years ago

All the graph database traversals I've seen are fairly simple (Friend of a friend, Movies starring X).Are they a good choice for turn-by-turn navigation, and answering questions (given a traffic dataset) like: "What has been the quickest route between A and P, departing at 8am on a Monday morning?"

评论 #11326787 未加载

jervenabout 9 years ago

Your landing page is missing any kind of "evidence" that it is scaleable, low-latency or high throughput.Also if you are sharing on predicate you will end up in big trouble. Predicates in most RDF datasets are not at all evenly distributed, tending more towards extreme value distributions. e.g. in UniProt the most common predicate has 2,419,000,171 occurrences, the least 1!Also if you are going to benchmark can I suggest the rather good LDBC ones[1]. Even if for marketing reasons you don't want them public they are good to show where you can improve.[1]<a href="http://www.ldbcouncil.org/" rel="nofollow">http://www.ldbcouncil.org/</a>

评论 #11330430 未加载

bikamonkiabout 9 years ago

Is a Graph DB suitable for use cases like products/homes/cars etc where users mostly do "and" queries to narrow down the results set? If so, is it faster than traditional SQL DB?

评论 #11325266 未加载

lobster_johnsonabout 9 years ago

This looks very promising!Are you planning to add filtering, supported by indexes? Seems a bit useless for production use if you can't filter a query by predicate, or even sort/limit. You could layer something like Elasticsearch on top of it, but then you lose all the graph support.Any thoughts on enforcing schemas?

评论 #11325915 未加载

edddabout 9 years ago

It has been a while since I've seen so many buzzwords in one HN topic.

8 comments

ignoramousabout 9 years ago

评论 #11325896 未加载

评论 #11325898 未加载

评论 #11329295 未加载

simonwabout 9 years ago

nlabout 9 years ago

评论 #11326299 未加载

pbowyerabout 9 years ago

评论 #11326787 未加载

jervenabout 9 years ago

评论 #11330430 未加载

bikamonkiabout 9 years ago

Is a Graph DB suitable for use cases like products/homes/cars etc where users mostly do "and" queries to narrow down the results set? If so, is it faster than traditional SQL DB?

DGraph – Scalable, Distributed, Low-Latency, High-Throughput Graph Database

8 comments

DGraph – Scalable, Distributed, Low-Latency, High-Throughput Graph Database

8 comments