SOTAVerified

Language Modeling

Papers

Showing 31013150 of 14182 papers

TitleStatusHype
MatSciBERT: A Materials Domain Language Model for Text Mining and Information ExtractionCode1
Data-to-Text Generation with Iterative Text EditingCode1
Approaching Deep Learning through the Spectral Dynamics of WeightsCode1
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge DistillationCode1
Materials Informatics Transformer: A Language Model for Interpretable Materials Properties PredictionCode1
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation ApproachCode1
Data Efficient Masked Language Modeling for Vision and LanguageCode1
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action DetectionCode1
Matching Networks for One Shot LearningCode1
Cross-Thought for Sentence Encoder Pre-trainingCode1
MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics EducationCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsCode1
CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy RewardCode1
Analysing Discrete Self Supervised Speech Representation for Spoken Language ModelingCode1
Massive Editing for Large Language Models via Meta LearningCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Debiasing Methods in Natural Language Understanding Make Bias More AccessibleCode1
Interpreting Language Models with Contrastive ExplanationsCode1
CTAL: Pre-training Cross-modal Transformer for Audio-and-Language RepresentationsCode1
Investigating Fairness Disparities in Peer Review: A Language Model Enhanced ApproachCode1
Analysing The Impact of Sequence Composition on Language Model Pre-TrainingCode1
CTRAN: CNN-Transformer-based Network for Natural Language UnderstandingCode1
Mass-Producing Failures of Multimodal Systems with Language ModelsCode1
CTRL: A Conditional Transformer Language Model for Controllable GenerationCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
Towards Evaluating Generalist Agents: An Automated Benchmark in Open WorldCode1
Markovian Transformers for Informative Language ModelingCode1
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test ConstructionCode1
Avoiding Inference Heuristics in Few-shot Prompt-based FinetuningCode1
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Invariant Language ModelingCode1
A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial OptimizationCode1
CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language TechnologiesCode1
InvestLM: A Large Language Model for Investment using Financial Domain Instruction TuningCode1
Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model RecommendationCode1
IoT-LM: Large Multisensory Language Models for the Internet of ThingsCode1
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language ModelingCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question AnsweringCode1
Mapping Memes to Words for Multimodal Hateful Meme ClassificationCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Talking-Heads AttentionCode1
MarianCG: a code generation transformer model inspired by machine translationCode1
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
TAPEX: Table Pre-training via Learning a Neural SQL ExecutorCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Show:102550
← PrevPage 63 of 284Next →

No leaderboard results yet.