SOTAVerified

Language Modeling

Papers

Showing 1295113000 of 14182 papers

TitleStatusHype
Linguistic Versus Latent Relations for Modeling Coherent Flow in ParagraphsCode0
Accommodating Audio Modality in CLIP for Multimodal ProcessingCode0
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between ActionsCode0
HateBERT: Retraining BERT for Abusive Language Detection in EnglishCode0
HATE-ITA: New Baselines for Hate Speech Detection in ItalianCode0
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?Code0
Debiasing Pre-Trained Language Models via Efficient Fine-TuningCode0
DATETIME: A new benchmark to measure LLM translation and reasoning capabilitiesCode0
Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase ExtractionCode0
Large Language Model Can Be a Foundation for Hidden Rationale-Based RetrievalCode0
Attention as a Guide for Simultaneous Speech TranslationCode0
Attacks on Third-Party APIs of Large Language ModelsCode0
Large Language Model Capabilities in Perioperative Risk Prediction and PrognosticationCode0
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data VisualizationCode0
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?Code0
Heaps' Law in GPT-Neo Large Language Model Emulated CorporaCode0
A Transformer with Stack AttentionCode0
Data Similarity is Not Enough to Explain Language Model PerformanceCode0
Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared TaskCode0
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningCode0
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag CompetitionCode0
"It doesn't look good for a date": Transforming Critiques into Preferences for Conversational Recommendation SystemsCode0
“It doesn’t look good for a date”: Transforming Critiques into Preferences for Conversational Recommendation SystemsCode0
Helpful assistant or fruitful facilitator? Investigating how personas affect language model behaviorCode0
Data Selection for Fine-tuning Large Language Models Using Transferred Shapley ValuesCode0
Large Language Model Critics for Execution-Free Evaluation of Code ChangesCode0
Data Noising as Smoothing in Neural Network Language ModelsCode0
Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?Code0
Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural NetworksCode0
Learning to Maximize Mutual Information for Chain-of-Thought DistillationCode0
Alibaba-Translate China's Submission for WMT 2022 Metrics Shared TaskCode0
Item-side Fairness of Large Language Model-based Recommendation SystemCode0
A Toolkit for Efficient Learning of Lexical Units for Speech RecognitionCode0
Learning to Plan for Language Modeling from Unlabeled DataCode0
DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQLCode0
DATA: Differentiable ArchiTecture ApproximationCode0
DataChat: Prototyping a Conversational Agent for Dataset Search and VisualizationCode0
Can discrete information extraction prompts generalize across language models?Code0
Data augmentation using prosody and false starts to recognize non-native children's speechCode0
Iterative Counterfactual Data AugmentationCode0
Data Augmentation for Biomedical Factoid Question AnsweringCode0
Linking Theories and Methods in Cognitive Sciences via Joint Embedding of the Scientific Literature: The Example of Cognitive ControlCode0
Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of TransformersCode0
Heterogeneous Subgraph Transformer for Fake News DetectionCode0
AlgebraNetsCode0
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language ModelsCode0
DarijaBanking: A New Resource for Overcoming Language Barriers in Banking Intent Detection for Moroccan Arabic SpeakersCode0
DALLMi: Domain Adaption for LLM-based Multi-label ClassifierCode0
Large Language Model-Driven Curriculum Design for Mobile NetworksCode0
Cynical Selection of Language Model Training DataCode0
Show:102550
← PrevPage 260 of 284Next →

No leaderboard results yet.