SOTAVerified

Language Modeling

Papers

Showing 31013125 of 14182 papers

TitleStatusHype
Approaching Deep Learning through the Spectral Dynamics of WeightsCode1
Massive Editing for Large Language Models via Meta LearningCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
DARTS: Differentiable Architecture SearchCode1
MambaLRP: Explaining Selective State Space Sequence ModelsCode1
Mapping Memes to Words for Multimodal Hateful Meme ClassificationCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
Cross-Thought for Sentence Encoder Pre-trainingCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language ModelCode1
Intent Representation Learning with Large Language Model for RecommendationCode1
CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy RewardCode1
Analysing Discrete Self Supervised Speech Representation for Spoken Language ModelingCode1
InternLM-Law: An Open Source Chinese Legal Large Language ModelCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Interpretable Language Modeling via Induction-head Ngram ModelsCode1
MarianCG: a code generation transformer model inspired by machine translationCode1
InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature InterpretationCode1
MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based FinetuningCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
Making AI Less "Thirsty": Uncovering and Addressing the Secret Water Footprint of AI ModelsCode1
Show:102550
← PrevPage 125 of 568Next →

No leaderboard results yet.