科技回声

1 comment

On PapersWithCode, different datasets have benchmarks: <a href="https://paperswithcode.com/datasets" rel="nofollow">https://paperswithcode.com/datasets</a>You can also break down by task here: <a href="https://paperswithcode.com/sota" rel="nofollow">https://paperswithcode.com/sota</a>For churn, you might go to time series forecasting first: <a href="https://paperswithcode.com/task/time-series-forecasting" rel="nofollow">https://paperswithcode.com/task/time-series-forecasting</a>They have this subtask which is a bit different because it's about novel products rather that continued sales, for example:<a href="https://paperswithcode.com/task/new-product-sales-forecasting" rel="nofollow">https://paperswithcode.com/task/new-product-sales-forecastin...</a>But you get the idea of how they organise by task. I'm curious about other benchmarks and interfaces too and would like to see others.I think HuggingFace and Kaggle have some overlap with different tasks that have benchmarks.

Ask HN: Benchmarks for models other than LLMs

1 comment

Ask HN: Benchmarks for models other than LLMs

1 comment