SOTAVerified

Sentence

Papers

Showing 76100 of 10752 papers

TitleStatusHype
Deduplicating Training Data Makes Language Models BetterCode2
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech EditingCode2
DiffCSE: Difference-based Contrastive Learning for Sentence EmbeddingsCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Enhancing Retrieval-Augmented Generation: A Study of Best PracticesCode2
BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language ModelCode2
BLASER: A Text-Free Speech-to-Speech Translation Evaluation MetricCode2
CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native SpeakersCode2
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at ScaleCode2
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language ModelsCode2
AutoRE: Document-Level Relation Extraction with Large Language ModelsCode2
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor DatasetCode2
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language ModelsCode2
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken ConversationsCode2
Abstractive Summarization of Spoken andWritten Instructions with BERTCode2
Learning representations of learning representationsCode2
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
ARAGOG: Advanced RAG Output GradingCode2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender SystemsCode2
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error CorrectionCode2
One Thousand and One Pairs: A "novel" challenge for long-context language modelsCode2
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase IdentificationCode2
CLUE: A Chinese Language Understanding Evaluation BenchmarkCode2
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language ModelsCode2
Show:102550
← PrevPage 4 of 431Next →

No leaderboard results yet.