SOTAVerified

Language Modeling

Papers

Showing 18511900 of 14182 papers

TitleStatusHype
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic ChangeCode1
Critic-Guided Decoding for Controlled Text GenerationCode1
Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question AnsweringCode1
CriticEval: Evaluating Large Language Model as CriticCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-CommerceCode1
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-modelsCode1
Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text GenerationCode1
Improving Passage Retrieval with Zero-Shot Question GenerationCode1
Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine TranslationCode1
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language InferenceCode1
CREAM: Consistency Regularized Self-Rewarding Language ModelsCode1
MemCap: Memorizing Style Knowledge for Image CaptioningCode1
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced DistillationCode1
Improving Multi-Party Dialogue Discourse Parsing via Domain IntegrationCode1
MemeSem:A Multi-modal Framework for Sentimental Analysis of Meme via Transfer LearningCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
AudioBERT: Audio Knowledge Augmented Language ModelCode1
Blank Language ModelsCode1
Adaptive Attention Span in TransformersCode1
Improving Mandarin Speech Recogntion with Block-augmented TransformerCode1
Improving NER's Performance with Massive financial corpusCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
Merging Feed-Forward Sublayers for Compressed TransformersCode1
Exploring Large Language Model for Graph Data Understanding in Online Job RecommendationsCode1
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language ModelCode1
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model EvaluatorsCode1
Exploring Quantization for Efficient Pre-Training of Transformer Language ModelsCode1
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language ModelCode1
Exploring Stochastic Autoregressive Image Modeling for Visual RepresentationCode1
Exploring the Limits of Language ModelingCode1
CDLM: Cross-Document Language ModelingCode1
Improving Neural Machine Translation Models with Monolingual DataCode1
Improving Transformer Optimization Through Better InitializationCode1
CPM: A Large-scale Generative Chinese Pre-trained Language ModelCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
Adaptive Attention Span in Computer VisionCode1
CPLLM: Clinical Prediction with Large Language ModelsCode1
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEditCode1
RARR: Researching and Revising What Language Models Say, Using Language ModelsCode1
Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language ModelsCode1
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game ModelsCode1
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video UnderstandingCode1
BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant SupervisionCode1
Extensive Self-Contrast Enables Feedback-Free Language Model AlignmentCode1
Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM AgentsCode1
Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based TechniquesCode1
Show:102550
← PrevPage 38 of 284Next →

No leaderboard results yet.