retrieval-evaluation

Star

Here are 8 public repositories matching this topic...

mangopy / tool-retrieval-benchmark

Star

Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"

information-retrieval embedding-models large-language-models tool-learning retrieval-evaluation

Updated Dec 22, 2025
JavaScript

kidist-amde / amharic-ir-benchmarks

Star

Official codebase for the ACL 2025 Findings paper: Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval.

information-retrieval bert bm25 passage-retrieval ndcg text-embedding amharic-corpus mrr roberta amharic-nlp huggingface-transformers colbert multilingual-nlp low-resource-nlp dense-retrieval amharic-language retrieval-evaluation academic-benchmark

Updated Jul 26, 2025
Jupyter Notebook

mburaksayici / smallevals

Star

smallevals — CPU-fast, GPU-blazing fast offline retrieval evaluation for RAG systems with tiny QA models.

qa chroma question-generation weaviate qa-generation milvus vector-database qdrant chromadb rag-evaluation tiny-llm retrieval-evaluation offline-evaluation retrieval-metrics

Updated Dec 4, 2025
Python

toniIepure25 / FMRI2images

Star

image-reconstruction neuroscience pytorch medical-imaging neuroimaging nsd fmri representation-learning clip uncertainty-estimation probabilistic-modeling multimodal-learning self-supervised-learning brain-decoding contrastive-learning stable-diffusion openclip retrieval-evaluation vision-reconstruction

Updated Jan 20, 2026
Python

SrabanMondal / NSHG-RAG

Star

Research-grade neuro-symbolic RAG framework where retrieval is a policy, not a vector search, built for evaluation, ablation, and reliability analysis.

information-retrieval semantic-search umap bm25 hierarchical-clustering faiss rag neuro-symbolic ablation-study hybrid-retrieval knowledge-retrieval retrieval-evaluation llm-systems graph-retrieval

Updated Jan 4, 2026
Python

Arnav-Ajay / rag-systems-foundations

Star

A systems-level analysis of static RAG pipelines, isolating ingestion, retrieval, and ranking boundaries to expose structural failure modes before generation.

information-retrieval evaluation ranking chunking system-design rag ai-systems failure-modes hybrid-retrieval retrieval-augmented-generation retrieval-evaluation llm-systems

Updated Jan 24, 2026

tk-yasuno / visual-raptor-colvbert

Star

Visual RAPTOR ColBERT Integration System - Multimodal document retrieval with SigLIP, PyMuPDF, and evaluation metrics.

information-retrieval evaluation-metrics raptor document-retrieval pymupdf multimodal-retrieval document-embeddings image-embeddings knowledge-tree pdf-processing colbert siglip vision-language-models multimodal-rag rag-benchmarks retrieval-evaluation hierarchical-retrieval

Updated Oct 24, 2025
Python

Arnav-Ajay / rag-hybrid-retrieval

Star

A controlled experiment evaluating whether hybrid (dense + sparse) retrieval surfaces evidence that dense-only RAG systems misrank—without changing generation behavior.

bm25 rag hybrid-search dense-retrieval sparse-retrieval retrieval-evaluation

Updated Jan 23, 2026
Python

Improve this page

Add a description, image, and links to the retrieval-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the retrieval-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retrieval-evaluation

Here are 8 public repositories matching this topic...

mangopy / tool-retrieval-benchmark

kidist-amde / amharic-ir-benchmarks

mburaksayici / smallevals

toniIepure25 / FMRI2images

SrabanMondal / NSHG-RAG

Arnav-Ajay / rag-systems-foundations

tk-yasuno / visual-raptor-colvbert

Arnav-Ajay / rag-hybrid-retrieval

Improve this page

Add this topic to your repo