科技回声

1 comment

I built an open-source hallucination detector that identifies when LLM outputs contain information not present in the source context. The tool is particularly useful for RAG systems where ensuring factual accuracy is critical.Unlike most hallucination detection approaches that require separate LLM calls (which add cost and latency), this is a lightweight classifier built on HuggingFace transformers. It's adaptive, meaning it continuously improves as it processes more examples.Technical approach:- Uses a prototype memory system that maintains class examples for quick adaptation- Combines transformer embeddings with an adaptive neural layer- Trained on the RAGTruth benchmark dataset across QA, summarization, and data-to-text tasks- Achieves 80.7% recall overall (51.5% F1), with strongest performance on data-to-text generationExample usage:from adaptive_classifier import AdaptiveClassifier# Load pre-trained detectordetector = AdaptiveClassifier.from_pretrained("adaptive-classifier/llm-hallucination-detector")# Format input with context, query and responseinput_text = f"Context: {your_context}\nQuestion: {your_question}\nAnswer: {llm_response}"# Get predictionprediction = detector.predict(input_text)# Returns: [('HALLUCINATED', 0.72), ('NOT_HALLUCINATED', 0.28)]Current limitations:- Performance varies by task type (stronger on data-to-text, weaker on summarization precision)- Initial version focuses on binary classification; token-level detection is planned- The model is relatively small, so it won't catch subtle nuanced hallucinations that require deep domain knowledgeThe library's wider goal is to enable adaptive classification for use cases where models need to continuously learn from new examples. We've also built LLM routers and configuration optimizers with it.Would love feedback from anyone working on RAG systems or LLM evaluation. What metrics or capabilities would be most useful to you in a hallucination detector?Project: <a href="https://github.com/codelion/adaptive-classifier">https://github.com/codelion/adaptive-classifier</a>Docs: <a href="https://github.com/codelion/adaptive-classifier#hallucination-detector">https://github.com/codelion/adaptive-classifier#hallucinatio...</a>

1 comment

codelion2 个月前

Show HN: An adaptive classifier that detects hallucinations in LLM/RAG outputs

1 comment

Show HN: An adaptive classifier that detects hallucinations in LLM/RAG outputs

1 comment