Sentence

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 10752 papers

Title	Date	Tasks	Status	Hype
Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models	Oct 18, 2022	Language ModellingSentence	CodeCode Available	8
Large Concept Models: Language Modeling in a Sentence Representation Space	Dec 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	7
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation	Jun 24, 2024	parameter-efficient fine-tuningSentence	CodeCode Available	7
Interactive Prompt Debugging with Sequence Salience	Apr 11, 2024	Sentencetext-classification	CodeCode Available	7
AutoTrain: No-code training for state-of-the-art models	Oct 21, 2024	Classificationimage-classification	CodeCode Available	7
Factuality Enhanced Language Models for Open-Ended Text Generation	Jun 9, 2022	MisconceptionsSentence	CodeCode Available	5
KBLaM: Knowledge Base augmented Language Model	Oct 14, 2024	8kGPU	CodeCode Available	5
Efficient Few-Shot Learning Without Prompts	Sep 22, 2022	Few-Shot LearningFew-Shot Text Classification	CodeCode Available	4
What Makes Good In-Context Examples for GPT-3?	Jan 17, 2021	Few-Shot LearningNatural Language Understanding	CodeCode Available	4
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation	Mar 29, 2022	Binary ClassificationSegmentation	CodeCode Available	4
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA	Sep 4, 2024	Question AnsweringSentence	CodeCode Available	4
2D Matryoshka Sentence Embeddings	Feb 22, 2024	RAGRepresentation Learning	CodeCode Available	4
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora	Dec 31, 2020	SentenceTranslation	CodeCode Available	3
Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models	Nov 27, 2024	ClassificationSentence	CodeCode Available	3
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models	May 23, 2024	HallucinationSentence	CodeCode Available	3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation	May 30, 2023	Machine TranslationSegmentation	CodeCode Available	3
Language Models are Few-Shot Learners	May 28, 2020	answerability predictionArticles	CodeCode Available	3
Bridging Language and Items for Retrieval and Recommendation	Mar 6, 2024	RetrievalSentence	CodeCode Available	3
Diffusion-LM Improves Controllable Text Generation	May 27, 2022	Language ModelingLanguage Modelling	CodeCode Available	3
PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts	Oct 17, 2017	General ClassificationSentence	CodeCode Available	3
Zero-shot Entity Linking with Less Data	Jul 1, 2022	Entity LinkingMulti-Task Learning	CodeCode Available	3
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models	Oct 4, 2024	Dense Video CaptioningSentence	CodeCode Available	2
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training	Jun 2, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation	May 19, 2023	HallucinationMachine Translation	CodeCode Available	2
Enhancing Retrieval-Augmented Generation: A Study of Best Practices	Jan 13, 2025	In-Context LearningRAG	CodeCode Available	2
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models	Mar 15, 2024	RAGRetrieval	CodeCode Available	2
Exploring Human-Like Translation Strategy with Large Language Models	May 6, 2023	HallucinationMachine Translation	CodeCode Available	2
Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling	Jul 16, 2023	DiagnosticLanguage Modelling	CodeCode Available	2
DreamLIP: Language-Image Pre-training with Long Captions	Mar 25, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	2
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale	Mar 13, 2024	Constituency Grammar InductionLanguage Modeling	CodeCode Available	2
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	Apr 21, 2022	Cross-Modal RetrievalImage Retrieval	CodeCode Available	2
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning	Feb 5, 2024	In-Context LearningMetric Learning	CodeCode Available	2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings	Nov 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Active Retrieval Augmented Generation	May 11, 2023	RetrievalRetrieval-augmented Generation	CodeCode Available	2
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory	Oct 10, 2024	Document TranslationMachine Translation	CodeCode Available	2
Comprehending and Ordering Semantics for Image Captioning	Jun 14, 2022	Cross-Modal RetrievalImage Captioning	CodeCode Available	2
CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers	Oct 1, 2022	Grammatical Error CorrectionSentence	CodeCode Available	2
CLUE: A Chinese Language Understanding Evaluation Benchmark	Apr 13, 2020	General ClassificationMachine Reading Comprehension	CodeCode Available	2
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding	Nov 15, 2023	Highlight DetectionMoment Retrieval	CodeCode Available	2
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings	Apr 21, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	2
Compositional Entailment Learning for Hyperbolic Vision-Language Models	Oct 9, 2024	Language ModellingRepresentation Learning	CodeCode Available	2
Compositional Visual Generation with Composable Diffusion Models	Jun 3, 2022	Sentence	CodeCode Available	2
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric	Dec 16, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation	Jan 11, 2022	SentenceSpeech-to-Speech Translation	CodeCode Available	2
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation	Apr 4, 2024	Contrastive LearningReferring Expression	CodeCode Available	2
Deduplicating Training Data Makes Language Models Better	Jul 14, 2021	Language ModelingLanguage Modelling	CodeCode Available	2
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations	Feb 20, 2024	Sentence	CodeCode Available	2
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction	Sep 3, 2024	RelationRelation Extraction	CodeCode Available	2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement	Jun 18, 2025	Graph GenerationHallucination	CodeCode Available	2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval	Jul 2, 2023	Biomedical Information RetrievalContrastive Learning	CodeCode Available	2

Show:10 25 50

← PrevPage 1 of 216Next →

No leaderboard results yet.