Does anyone have a good list of books, tutorials, howtos and whatnot about graph databases in general, preferably with python examples (but any language is good really...)?<p>I've used graphdbs in the past but a nice collection of patterns and best practices would be nice - upping my game on this topic is a current interest of mine!
Could anyone explain to me what it means "native" versus "non-native" graph processing in that slide show? Ditto for "native" versus "non-native" graph storage. I simply have no idea what I'm supposed to picture when I see that.<p>Also, on the neo4j.org page, the claim that "graph data model['s] expressiveness supersedes the relational model" seems a little bit spurious, seeing as, as I understand it, the relational model and graph data are both anchored in first-order predicate logic, and therefore should be able to do the same things essentially (although Codd-style RDBMS with a little bit more fuss regarding the necessary schemas).
I just joined a project using neo4j. They're using the latest version so we've had to build our own python tooling. It's still very immature but hopefully we'll get it into an opensourcable state.<p>Modelling in graphs is new to me so I was wondering if anyone had any tips or pointers.
I'm working on a project that requires a tree datastructure (basically a graph but with only single-direction parent-child relationships) and the number of nodes will stay under a thousand. I could have chosen a graph database, but for my level of complexity I just used postgres and a table that has foriegn-key relationships to itself.<p>Then I made a rails front end using the acts-as-sane-tree gem, which is designed to use this postgres data model and recursive queries: <a href="https://github.com/chrisroberts/acts_as_sane_tree" rel="nofollow">https://github.com/chrisroberts/acts_as_sane_tree</a>
I had a look into python client library for neo4j a year ago, and couldn't find a way to perform multiple graph writes in a single transaction, because the only API available was the http one. Has that changed since ?
Many things in the graphdb space are broken.<p>Tinkerpop people are pushing too hard Gremlin DSL/API/whatever which is AFAIK only useful in some situation somewhat complex and more or less a nice way to write some common queries. But in simple situations any language with the raw Graph API can do the job. And there is still no drivers for Python in Rexster. I tried, but it was too complicated. Rexster itself is too complicated.<p>Neo4J with their own query language made things even more complicated. Instead of a “Graph that can be queried with your preferred language” you get a “Graph that can be queried with something that looks like SQL but is not“<p>ArangoDB is nice for people that want to do JavaScript full stack. Which is not the case of people doing Python.<p>Also, there is nobody marketing graphdbs just saying “it solve the general problem“. period.<p>The only thing that may hold you back from using graphdbs are performances but in <i>a lot of situtations</i> you don't care especially in situations where you want to be flexible and to move fast. That's where GraphDBs shine a lot. Of course there is also the graph/tree problem solving space but this is taken for granted.<p>GraphDB actors market a lot the <i>specialized database</i> aspect of graphdbs, nonetheless graphdbs are good even for solving generic webdev problems.<p>Also if you are looking for a Graph Database server that does just that, and where you can query the graph in Python 2.7 (or Scheme) have a look at: <a href="https://github.com/python-graph-lovestory/Java-GraphitiDB" rel="nofollow">https://github.com/python-graph-lovestory/Java-GraphitiDB</a>
The last graph database that I used for a large project was Virtuoso, which wasn't mentioned in the slides. It's worth a look:<p><a href="http://www.openlinksw.com/dataspace/doc/dav/wiki/Main/" rel="nofollow">http://www.openlinksw.com/dataspace/doc/dav/wiki/Main/</a>
graph-tool is a Python library for graph handling:<p><a href="http://jugad2.blogspot.com/2013/01/graph-tool-python-module-for-graph.html" rel="nofollow">http://jugad2.blogspot.com/2013/01/graph-tool-python-module-...</a>
The one thing I thought was missing from the python tooling for Neo4J was that because *.cyp files are so new they aren't yet handled by the standard documentation toolchain.
Tried neo4j , and I find it handy using python and py2neo . Since my laptop is limited in memory , I couldn't visualize graphs properly from web interface.
What is the secret ingredient of graph databases? The presentation linked from the presentation mentions physical addresses instead of IDs. I get that that would be a speedup, but I would expect it to be more like a constant factor?<p>Then maybe you can save all links from a node in the node, so you can get all the links with one read access. Fine. But as soon as you get to the second or third level, I would expect the magic to be gone. Say every node has 100 links. OK, so the first 100 links you get in constant time c. But to get the second level, you already need 100 requests (one for each node and it's attached link list). So 100c time. For the third level you need 10000 reads, 10000c time. The next level would be 1000000 requests.<p>Just saying I'd expect things to get ugly with a graph database pretty fast, too (not as fast as with a relational db, but still).<p>I haven't really coded a big graph based app, but my expectation would be that get really good performance, a hand coded solution would always be required. For example trying to squeeze as much of the relevant data into memory in a compressed way. Am I wrong?<p>Oh and also I am not sure how good relational DBs are at query optimization. Just because the visible model is "one row per link" doesn't mean the db couldn't do some intelligent caching internally.