SOTAVerified

Sentence

Papers

Showing 51100 of 10752 papers

TitleStatusHype
RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language ModelsCode2
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-EncoderCode2
Segment and Caption AnythingCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
Exploring Human-Like Translation Strategy with Large Language ModelsCode2
Active Retrieval Augmented GenerationCode2
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionCode2
SimCSE: Simple Contrastive Learning of Sentence EmbeddingsCode2
Improving CLIP Training with Language RewritesCode2
TEACH: Temporal Action Composition for 3D HumansCode2
Thought Anchors: Which LLM Reasoning Steps Matter?Code2
Toward Controlled Generation of TextCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
DelTA: An Online Document-Level Translation Agent Based on Multi-Level MemoryCode2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence EmbeddingsCode2
Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language ModellingCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
Compositional Visual Generation with Composable Diffusion ModelsCode2
Compositional Entailment Learning for Hyperbolic Vision-Language ModelsCode2
Comprehending and Ordering Semantics for Image CaptioningCode2
Decoupling Static and Hierarchical Motion Perception for Referring Video SegmentationCode2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph RefinementCode2
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMsCode2
CVSS Corpus and Massively Multilingual Speech-to-Speech TranslationCode2
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation ExtractionCode2
Deduplicating Training Data Makes Language Models BetterCode2
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech EditingCode2
DiffCSE: Difference-based Contrastive Learning for Sentence EmbeddingsCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Enhancing Retrieval-Augmented Generation: A Study of Best PracticesCode2
BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language ModelCode2
BLASER: A Text-Free Speech-to-Speech Translation Evaluation MetricCode2
CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native SpeakersCode2
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at ScaleCode2
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language ModelsCode2
AutoRE: Document-Level Relation Extraction with Large Language ModelsCode2
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor DatasetCode2
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language ModelsCode2
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken ConversationsCode2
Abstractive Summarization of Spoken andWritten Instructions with BERTCode2
Learning representations of learning representationsCode2
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
ARAGOG: Advanced RAG Output GradingCode2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender SystemsCode2
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error CorrectionCode2
One Thousand and One Pairs: A "novel" challenge for long-context language modelsCode2
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase IdentificationCode2
CLUE: A Chinese Language Understanding Evaluation BenchmarkCode2
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language ModelsCode2
Show:102550
← PrevPage 2 of 216Next →

No leaderboard results yet.