SOTAVerified

Language Modeling

Papers

Showing 18011850 of 14182 papers

TitleStatusHype
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-trainingCode1
Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment GenerationCode1
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary CaptioningCode1
Enhancing Vision-Language Model with Unmasked Token AlignmentCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
Improving Conversational Recommendation Systems via Counterfactual Data SimulationCode1
CREAM: Consistency Regularized Self-Rewarding Language ModelsCode1
Improving Aspect Sentiment Quad Prediction via Template-Order Data AugmentationCode1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
BiLD: Bi-directional Logits Difference Loss for Large Language Model DistillationCode1
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing TasksCode1
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
Improving Biomedical Pretrained Language Models with KnowledgeCode1
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language ModelCode1
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and GenerationCode1
Logical Fallacy DetectionCode1
Critic-Guided Decoding for Controlled Text GenerationCode1
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market DomainCode1
Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved NegativesCode1
LOLA -- An Open-Source Massively Multilingual Large Language ModelCode1
Improving End-to-End SLU performance with Prosodic Attention and DistillationCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language ModelCode1
Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model EvaluationCode1
Improved training of end-to-end attention models for speech recognitionCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
BioBART: Pretraining and Evaluation of A Biomedical Generative Language ModelCode1
Evaluating Human-Language Model InteractionCode1
CPM: A Large-scale Generative Chinese Pre-trained Language ModelCode1
CPLLM: Clinical Prediction with Large Language ModelsCode1
Bioformer: an efficient transformer language model for biomedical text miningCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
Protein Structure Tokenization: Benchmarking and New RecipeCode1
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from TextCode1
Improving antibody language models with native pairingCode1
Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little CostCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
Evaluating Language Model Finetuning Techniques for Low-resource LanguagesCode1
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational CurriculaCode1
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attentionCode1
Implicit Language Models are RNNs: Balancing Parallelization and ExpressivityCode1
LXMERT: Learning Cross-Modality Encoder Representations from TransformersCode1
Biomedical Event Extraction with Hierarchical Knowledge GraphsCode1
Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement LearningCode1
Evaluating Morphological Alignment of Tokenizers in 70 LanguagesCode1
Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social MediaCode1
cosFormer: Rethinking Softmax in AttentionCode1
Show:102550
← PrevPage 37 of 284Next →

No leaderboard results yet.