Always nice to see more alternatives to Elasticsearch. That project could be so much better with some proper planning and focus.<p>There's also Toshi: <a href="https://github.com/toshi-search/Toshi" rel="nofollow">https://github.com/toshi-search/Toshi</a> which is built on top of Tantivy: <a href="https://github.com/tantivy-search/tantivy" rel="nofollow">https://github.com/tantivy-search/tantivy</a><p>And for C++, there's Xapiland: <a href="https://github.com/Kronuz/Xapiand" rel="nofollow">https://github.com/Kronuz/Xapiand</a><p>And for Go, there's Blast: <a href="https://github.com/mosuka/blast" rel="nofollow">https://github.com/mosuka/blast</a> built on Bleve: <a href="https://github.com/blevesearch/bleve" rel="nofollow">https://github.com/blevesearch/bleve</a>
Huh. When this was initially posted it had a weird and commercially restrictive license, but it looks like that's been reverted, possibly after (polite) discussion on /r/rust. It's MPL 2 now.<p><a href="https://github.com/valeriansaliou/sonic/issues/52#issuecomment-475911660" rel="nofollow">https://github.com/valeriansaliou/sonic/issues/52#issuecomme...</a>
Performance figures are awesome. Also language support is great.<p>> Sonic only keeps the N most recently pushed results for a given word<p>This index discards old entries. This is fine for messages, in which aging items lose relevance. Yet the developer uses it for a help desk, which I think should give equal importance to all items.<p>In this area I would say the main comperitor is Groonga. It can be integrated into PostgreSQL with the PGroonga extension, and it indexes all of the data. However it consumes way more ram.
Wow. I’ve spent the last three weeks building a custom search solution in Kotlin - in my case I’m using tokenizers from Lucene and using a radix trie as an index. I actually looked at using Bleve (another rust search lib) initially but it didn’t have the right language support<p>Glancing over this it looks like a nearly perfect fit for my use case I just wish I had seen this a couple of weeks earlier!
Talk about perfect timing!<p>I was looking for something just like this for a project in my team. We had been using this setup where a huge chunk of the data was being stored in triplicate: some of it in ES, some more of it in another database and finally the whole dataset in our data warehouse.<p>Hopefully I can use this to only provide the index + full text capability and just use the warehouse itself as the main db because the query performance is similar enough and the warehouse is criminally underused for what we pay for it.