Hey HN, glad to see this here—I’m the founder and CEO of Not Diamond. Not Diamond makes it super easy to train your own custom AI model routers on your data to outperform any single model by intelligently routing to the highest-quality model for each query. We beat every foundation model on every major benchmark at a lower cost and latency.<p>To train a router, you literally just upload a dataset with your inputs and eval scores for different models. It’s completely agnostic to your choice of scoring metrics, frameworks, or tools. And if you don’t have your own eval data, you can still use Not Diamond’s base router out of the box—it takes <5m to set up.<p>Some other features worth noting:<p>• Python, TypeScript, and REST API support<p>• Option to route to faster/cheaper models when doing so doesn’t impact quality<p>• Joint prompt optimization interface<p>• Online, real-time personalization to hyperpersonalize model recommendations to individual end users<p>• Blazing fast inference speeds (<100ms)<p>• Easy deployments to your private infra<p>Would love to hear what folks think.
I've been working with the team at Not Diamond and trying out the private beta for a couple of weeks, and the routing experience is great. It makes it really easy to use different models and also route based on different tradeoffs. All I had to do was grab API keys from the different LLM APIs and quickly set it up.