Sophia – An embeddable key-value database

134 pointsby isaacbover 11 years ago

21 comments

rgbrgbover 11 years ago

I know it seems superficial but beautiful docs are one of my most trusted heuristics when I'm considering using a library. If the author cares about the aesthetics of the docs, it often means they care about the aesthetics of the code, which really does matter a lot. We can make ugly things or we can make beautiful things. I really respect people who take the time to make beautiful tools.

评论 #6396967 未加载

评论 #6397098 未加载

saurikover 11 years ago

Anyone know anything about how this compares in practice to Lightning MDB (which uses a memory mapped B-tree, I think, and is apparently insanely faster than most of the other enbedded key-value stores people normally examine)?

评论 #6398469 未加载

评论 #6398060 未加载

apendletonover 11 years ago

Having just gone through the exercise of picking an embedded key-value store for a project, some things that would be nice: how does it compare to other things besides leveldb (which, to be frank, isn't a stellar performer)? In particular, how does it compare to Tokyo/Kyoto Cabinet, Lightning MDB, or Sqlite4's LSM? Does it support data compression (either with a single pre-selected algorithm like LevelDB and Snappy, or in pluggable fashion like LSM)? How does it deal with concurrent access by multiple processes?

评论 #6398064 未加载

clumsysmurfover 11 years ago

For Java / Android, I've been using H2's MVStore, which is log structured and uses counted B+-trees. It's nice not having to go through JNI for good performance in Java.<a href="http://www.h2database.com/html/mvstore.html" rel="nofollow">http://www.h2database.com/html/mvstore.html</a>

评论 #6397227 未加载

评论 #6398621 未加载

dfischerover 11 years ago

Typography is hard to read.

评论 #6396926 未加载

i_have_to_speakover 11 years ago

Cute website. Some random thoughts:Concurrency:- No mention of it. There appear to be spin locks in the source. No multi-threaded tests.Stability and data safety:- Github has 2 days of history, and 4kLoC of test code. Why should I trust my data to you?"high load environment":- So what exactly does it do in a "high load environment"? How do you define "high load" in the first place? CPU load? I/O load from other processes? What shortcomings of the competition under a "high load environment" are you trying overcome?Backup:- How do I do hot backup?Benchmark:- LevelDB is not a fair comparison as it offers additional non-trivial functionality (snapshots) that cannot be built up on top of Sophia. LevelDB APIs are also safe for concurrent use, which adds overhead. Kyoto Cabinet would have been more suitable as a peer to benchmark with.- 3 million records with 16-byte keys and 100-byte values is not really an interesting benchmark dataset.- Iteration over a static database is not interesting, either. Is there any alternative other than locking an entire mutating database for the duration of iteration?

pwpwpover 11 years ago

I, for one, wouldn't trust my data to a library by somebody who uses the same text decoration for hyperlinks as for plain text.

评论 #6398699 未加载

oscargrouchover 11 years ago

Really guys, can you give more constructive or at least more (not based in bullshit assumptions) comments? if not, just shut up..This is a non-trivial effort, and all people do is to complain about the font face or if the punctuation was right?First, in the benchmarks it just crush leveldb, this is already by itself a great achievement. can you confront the benchmarks? you do it one yourself with a different configuration? no?Second, if you are not a database expert and can create proper critics (constructive or not), just keep it to yourself.. i wonder how so many people get up with all of this conclusions so fast, without a proper look at the source code and to have a reasonable amount of time to know what are they talking about.its very hard to create things like this, but very easy to critisize without any background.. dont forget about itif you have something to say about a small thing, that do not have a direct relation to the product or thing itself, if theres already one comment about it, that enough! do not spam, answering it, or creating new comments about it, this is just so rude and unrespectful..really, things are getting creepy on HN.. and its not only in this thread

评论 #6402145 未加载

Negitivefragsover 11 years ago

It says that the benchmark source is on github, but I can't find it.It doesn't appear to be in their primary repo.I would like to try and do my own test against another embedded data store like Berkeley DB but I want to know more about the conditions on the test. How many threads were used, that kind of thing.

评论 #6397302 未加载

Goopplesoftover 11 years ago

Very cool. As a suggestion, increase the link size under the main title, wasn't clear to me what the next step was at first after reading the introduction text.

laichzeit0over 11 years ago

In case any of the devs read this:1. Can multiple processes use the same database concurrently? (Separate address space processes, not fork()'d)2. Have you tested this with uClib/cross compiler? (I would like to use it on a MIPs embedded router)The reason I ask this is because I recently had the displeasure of having to hack a non-volatile RAM library to work with shared memory / thread safe and something small like this would be a perfect replacement with a lot less pain.

dkhenryover 11 years ago

Was this man's computer use being charged by the key stroke? I mean I understand using a few abbreviations here and there, but i.c ? At least name your files descriptively.I would avoid using this for realzies if only for the fact that if something broke trying to fix it in that code base would be prohibitive

hosay123over 11 years ago

Looks nice, but note this doesn't appear to support consistent reads (unlike LevelDB snapshots)

conductorover 11 years ago

I love the simplicity of the site and the C code. I will definitely use it, thank you.

MichaelGGover 11 years ago

I've got a need for something like this, but would like to have the keys and values delta encoded to achieve simple, yet effective, compression.

评论 #6398878 未加载

acron0over 11 years ago

Had a quick, 30 min bash at a win32 port using msinttypes and pthread-win32 but no luck yet :( Would love to see one though...

ksecover 11 years ago

Something for Mozilla to consider using inside Firefox inplace of LevelDB ( If that was ever landed )

busterover 11 years ago

I'd be far more interested in benchmarks versus BDB (and maybe even sqlite).

maakuover 11 years ago

Snapshots? I could find it in the documentation..

jgalt212over 11 years ago

any word on support for unicode keys?

评论 #6398059 未加载

luisbebopover 11 years ago

Awesome work, congratulations!