We released Thoth ML, the machine learning module of project Thoth (https://github.com/trulia/thoth) to open source. Thoth is a real-time Solr monitor and search analysis engine. The Thoth ML module applies machine learning algorithms to the data collected by Thoth to gain useful insights. Currently, this module consists of the query time predictor and the query pattern recognition tool.<p>The query time predictor receives a Solr search request and in real time (1-3 milli seconds) predicts if the query is going to be slow or fast. This prediction can be used by the search infrastructure in multiple ways. At Trulia, the slow or fast prediction is used to route some requests to either a slow pool or a fast pool. This is done to ensure that critical user queries don't have to compete with complex, time consuming queries for resources. This leads to instantaneous query execution and hence, a pleasant user experience.<p>The query pattern recognition tool uses probabilistic topic modeling to find commonly recurring patterns in Solr search requests. These patterns can be quite useful as seen in the example here (https://github.com/trulia/thoth-ml/wiki/Query-Pattern-Recognition).<p>Here's the Thoth Machine Learning repository link : https://github.com/trulia/thoth-ml