Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
-
Updated
Jul 20, 2025
8000
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
FactSumm: Factual Consistency Scorer for Abstractive Summarization
Codebase, data and models for the SummaC paper in TACL
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
The official code of EMNLP 2022, "How Far are We from Robust Long Abstractive Summarization?".
The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.
Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarization
A answer generation pertubation pipeline to validate factuality metrics in NLP
Project of EMNLP2023 Findings "Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment"
Factual Explainable Recommendation Framework: Datasets & Evaluation
GSM8K-Consistency is a benchmark database for analyzing th E0E6 e consistency of Arithmetic Reasoning on GSM8K.
Real-time LLM hallucination guardrail — NLI + RAG fact-checking with token-level streaming halt. Drop-in for any LLM backend.
AlignRuScore - Adapting AlignScore, a metric for factual consistency evaluation, to Russian Language.
Add a description, image, and links to the factual-consistency topic page so that developers can more easily learn about it.
To associate your repository with the factual-consistency topic, visit your repo's landing page and select "manage topics."