Sentence

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 10752 papers

Title	Date	Tasks	Status	Hype	Score
RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models	May 4, 2023	Information RetrievalOpen-Domain Question Answering	CodeCode Available	2	5
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder	May 24, 2022	DecoderInformation Retrieval	CodeCode Available	2	5
DreamLIP: Language-Image Pre-training with Long Captions	Mar 25, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	2	5
Segment and Caption Anything	Dec 1, 2023	Caption Generationobject-detection	CodeCode Available	2	5
Enhancing Retrieval-Augmented Generation: A Study of Best Practices	Jan 13, 2025	In-Context LearningRAG	CodeCode Available	2	5
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations	Aug 22, 2023	DecoderMachine Translation	CodeCode Available	2	5
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection	May 16, 2024	object-detectionObject Detection	CodeCode Available	2	5
SimCSE: Simple Contrastive Learning of Sentence Embeddings	Apr 18, 2021	Contrastive LearningData Augmentation	CodeCode Available	2	5
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning	Feb 5, 2024	In-Context LearningMetric Learning	CodeCode Available	2	5
TEACH: Temporal Action Composition for 3D Humans	Sep 9, 2022	Motion SynthesisSentence	CodeCode Available	2	5
Thought Anchors: Which LLM Reasoning Steps Matter?	Jun 23, 2025	counterfactualSentence	CodeCode Available	2	5
Toward Controlled Generation of Text	Mar 2, 2017	AttributeSentence	CodeCode Available	2	5
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions	Aug 16, 2023	Motion Expressions Guided Video SegmentationObject	CodeCode Available	2	5
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings	Apr 21, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	2	5
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory	Oct 10, 2024	Document TranslationMachine Translation	CodeCode Available	2	5
Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling	Jul 16, 2023	DiagnosticLanguage Modelling	CodeCode Available	2	5
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding	Nov 15, 2023	Highlight DetectionMoment Retrieval	CodeCode Available	2	5
Compositional Visual Generation with Composable Diffusion Models	Jun 3, 2022	Sentence	CodeCode Available	2	5
Compositional Entailment Learning for Hyperbolic Vision-Language Models	Oct 9, 2024	Language ModellingRepresentation Learning	CodeCode Available	2	5
Comprehending and Ordering Semantics for Image Captioning	Jun 14, 2022	Cross-Modal RetrievalImage Captioning	CodeCode Available	2	5
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation	Apr 4, 2024	Contrastive LearningReferring Expression	CodeCode Available	2	5
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement	Jun 18, 2025	Graph GenerationHallucination	CodeCode Available	2	5
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs	Jul 17, 2023	Instruction FollowingSentence	CodeCode Available	2	5
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation	Jan 11, 2022	SentenceSpeech-to-Speech Translation	CodeCode Available	2	5
Deduplicating Training Data Makes Language Models Better	Jul 14, 2021	Language ModelingLanguage Modelling	CodeCode Available	2	5
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings	Nov 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	2	5
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations	Feb 20, 2024	Sentence	CodeCode Available	2	5
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction	Sep 3, 2024	RelationRelation Extraction	CodeCode Available	2	5
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing	Feb 21, 2022	Few-Shot LearningSentence	CodeCode Available	2	5
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric	Dec 16, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2	5
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval	Jul 2, 2023	Biomedical Information RetrievalContrastive Learning	CodeCode Available	2	5
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	Apr 21, 2022	Cross-Modal RetrievalImage Retrieval	CodeCode Available	2	5
CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers	Oct 1, 2022	Grammatical Error CorrectionSentence	CodeCode Available	2	5
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training	Jun 2, 2023	Language ModelingLanguage Modelling	CodeCode Available	2	5
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations	Sep 26, 2019	Common Sense ReasoningGPU	CodeCode Available	2	5
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale	Mar 13, 2024	Constituency Grammar InductionLanguage Modeling	CodeCode Available	2	5
AutoRE: Document-Level Relation Extraction with Large Language Models	Mar 21, 2024	Document-level Relation ExtractionRelation	CodeCode Available	2	5
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset	May 18, 2022	Sentence	CodeCode Available	2	5
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems	Sep 16, 2024	Collaborative FilteringRecommendation Systems	CodeCode Available	2	5
ARAGOG: Advanced RAG Output Grading	Apr 1, 2024	Document EmbeddingLanguage Modeling	CodeCode Available	2	5
Abstractive Summarization of Spoken andWritten Instructions with BERT	Aug 21, 2020	Abstractive Text SummarizationArticles	CodeCode Available	2	5
Learning representations of learning representations	Apr 12, 2024	Sentence	CodeCode Available	2	5
ANAH: Analytical Annotation of Hallucinations in Large Language Models	May 30, 2024	Generative Question AnsweringHallucination	CodeCode Available	2	5
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models	Oct 5, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model	Aug 20, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction	Apr 23, 2022	Grammatical Error CorrectionSentence	CodeCode Available	2	5
One Thousand and One Pairs: A "novel" challenge for long-context language models	Jun 24, 2024	RetrievalSentence	CodeCode Available	2	5
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification	Aug 30, 2019	Paraphrase IdentificationSentence	CodeCode Available	2	5
CLUE: A Chinese Language Understanding Evaluation Benchmark	Apr 13, 2020	General ClassificationMachine Reading Comprehension	CodeCode Available	2	5
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models	Mar 15, 2024	RAGRetrieval	CodeCode Available	2	5

Show:10 25 50

← PrevPage 2 of 216Next →

No leaderboard results yet.