SOTAVerified

Language Modeling

Papers

Showing 30513100 of 14182 papers

TitleStatusHype
Critic-Guided Decoding for Controlled Text GenerationCode1
Mark My Words: Analyzing and Evaluating Language Model WatermarksCode1
Mass-Producing Failures of Multimodal Systems with Language ModelsCode1
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word AlignmentCode1
Improving Seq2Seq Grammatical Error Correction via Decoding InterventionsCode1
Cross-Align: Modeling Deep Cross-lingual Interactions for Word AlignmentCode1
Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning ApproachCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic ChangeCode1
CDLM: Cross-Document Language ModelingCode1
MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based FinetuningCode1
A Practical Deep Learning-Based Acoustic Side Channel Attack on KeyboardsCode1
Making AI Less "Thirsty": Uncovering and Addressing the Secret Water Footprint of AI ModelsCode1
Improving Visual Grounding by Encouraging Consistent Gradient-based ExplanationsCode1
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM CompressionCode1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Making Language Models Better Tool Learners with Execution FeedbackCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery ModelsCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
Approaching Deep Learning through the Spectral Dynamics of WeightsCode1
DARTS: Differentiable Architecture SearchCode1
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout GroupsCode1
Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS ScoringCode1
Incorporating External POS Tagger for Punctuation RestorationCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human AnnotationCode1
Stabilizing Transformers for Reinforcement LearningCode1
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and CollaborationCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQLCode1
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary InitializationCode1
CXR-LLAVA: a multimodal large language model for interpreting chest X-ray imagesCode1
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuningCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
-former: Infinite Memory TransformerCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
CycleFormer : TSP Solver Based on Language ModelingCode1
AutoScrum: Automating Project Planning Using Large Language ModelsCode1
InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language ModelCode1
InfoLM: A New Metric to Evaluate Summarization & Data2Text GenerationCode1
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language ModelsCode1
Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of ThoughtCode1
InforMask: Unsupervised Informative Masking for Language Model PretrainingCode1
RealFormer: Transformer Likes Residual AttentionCode1
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment AnalysisCode1
Show:102550
← PrevPage 62 of 284Next →

No leaderboard results yet.