SOTAVerified

Language Modeling

Papers

Showing 49014950 of 14182 papers

TitleStatusHype
HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic AnalysisCode0
Evolving Subnetwork Training for Large Language ModelsCode0
Improving Complex Knowledge Base Question Answering via Question-to-Action and Question-to-Question AlignmentCode0
Circuit Stability Characterizes Language Model GeneralizationCode0
Improving Context Aware Language ModelsCode0
Large Memory Layers with Product KeysCode0
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language ModelCode0
NegatER: Unsupervised Discovery of Negatives in Commonsense Knowledge BasesCode0
Contrastive Language Prompting to Ease False Positives in Medical Anomaly DetectionCode0
Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via AdaptersCode0
Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data AugmentationCode0
Examining Language Modeling Assumptions Using an Annotated Literary Dialect CorpusCode0
Large Product Key Memory for Pretrained Language ModelsCode0
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech RecognitionCode0
From Markov to Laplace: How Mamba In-Context Learns Markov ChainsCode0
ChartFormer: A Large Vision Language Model for Converting Chart Images into Tactile Accessible SVGsCode0
CItruS: Chunked Instruction-aware State Eviction for Long Sequence ModelingCode0
Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language InferenceCode0
Decomposed Prompting to Answer Questions on a Course Discussion BoardCode0
From Machine Translation to Code-Switching: Generating High-Quality Code-Switched TextCode0
High-risk learning: acquiring new word vectors from tiny dataCode0
Adversarial Style Augmentation via Large Language Model for Robust Fake News DetectionCode0
Contrastive learning of T cell receptor representationsCode0
Auto-tagging of Short Conversational Sentences using Natural Language Processing MethodsCode0
Disentangling Logic: The Role of Context in Large Language Model Reasoning CapabilitiesCode0
Cross-lingual Information Retrieval with BERTCode0
Disentangling and Integrating Relational and Sensory Information in Transformer ArchitecturesCode0
Drop Dropout on Single-Epoch Language Model PretrainingCode0
DropMicroFluidAgents (DMFAs): Autonomous Droplet Microfluidic Research Framework Through Large Language Model AgentsCode0
When Low Resource NLP Meets Unsupervised Language Model: Meta-pretraining Then Meta-learning for Few-shot Text ClassificationCode0
exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers ModelsCode0
Improving Generalization Performance by Switching from Adam to SGDCode0
Improving Grammatical Error Correction with Machine Translation PairsCode0
Applying a Pre-trained Language Model to Spanish Twitter Humor PredictionCode0
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust OptimizationCode0
Improving In-Context Learning with Small Language Model EnsemblesCode0
Low-rank passthrough neural networksCode0
Improving Information Extraction on Business Documents with Specific Pre-Training TasksCode0
Claim Optimization in Computational ArgumentationCode0
Improving Instruction Following in Language Models through Proxy-Based Uncertainty EstimationCode0
A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a DiscourseCode0
Improving Language Generation with Sentence Coherence ObjectiveCode0
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy OptimizationCode0
A Content-Based Novelty Measure for Scholarly Publications: A Proof of ConceptCode0
Character-Level Language Modeling with Deeper Self-AttentionCode0
Discriminative Policy Optimization for Token-Level Reward ModelsCode0
DrugTar Improves Druggability Prediction by Integrating Large Language Models and Gene OntologiesCode0
DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition ExtractionCode0
Hierarchical Quantized Representations for Script GenerationCode0
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answeringCode0
Show:102550
← PrevPage 99 of 284Next →

No leaderboard results yet.