SOTAVerified

Language Modeling

Papers

Showing 52515300 of 14182 papers

TitleStatusHype
CodeEditor: Learning to Edit Source Code with Pre-trained ModelsCode0
Efficient Inference for Large Language Model-based Generative RecommendationCode0
Guiding In-Context Learning of LLMs through Quality Estimation for Machine TranslationCode0
Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and BridgingCode0
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HDCode0
BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large VocabulariesCode0
A Language Model of Java Methods with Train/Test DeduplicationCode0
LLM-enhanced Self-training for Cross-domain Constituency ParsingCode0
Efficient Language Model Training through Cross-Lingual and Progressive Transfer LearningCode0
Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and HealthCode0
GTA: Gated Toxicity Avoidance for LM Performance PreservationCode0
Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision MakingCode0
Canonical and Surface Morphological Segmentation for Nguni LanguagesCode0
Interpretable-by-Design Text Understanding with Iteratively Generated Concept BottleneckCode0
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LMCode0
CodeKGC: Code Language Model for Generative Knowledge Graph ConstructionCode0
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language ModelingCode0
Attacks on Third-Party APIs of Large Language ModelsCode0
Barack's Wife Hillary: Using Knowledge Graphs for Fact-Aware Language ModelingCode0
Ankh: Optimized Protein Language Model Unlocks General-Purpose ModellingCode0
G-Safeguard: A Topology-Guided Security Lens and Treatment on LLM-based Multi-agent SystemsCode0
Alternating Synthetic and Real Gradients for Neural Language ModelingCode0
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change AnalysisCode0
Blank Collapse: Compressing CTC emission for the faster decodingCode0
Alternative structures for character-level RNNsCode0
Cross-Lingual UMLS Named Entity Linking using UMLS Dictionary Fine-TuningCode0
Alternative Weighting Schemes for ELMo EmbeddingsCode0
Interpreting Biomedical VLMs on High-Imbalance Out-of-Distributions: An Insight into BiomedCLIP on RadiologyCode0
Conversations in Galician: a Large Language Model for an Underrepresented LanguageCode0
Interpreting Large Text-to-Image Diffusion Models with Dictionary LearningCode0
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context ModelingCode0
Exploring Graph Representations of Logical Forms for Language ModelingCode0
From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine ReaderCode0
GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge?Code0
Interweaving Memories of a Siamese Large Language ModelCode0
Into the crossfire: evaluating the use of a language model to crowdsource gun violence reportsCode0
BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition ModelingCode0
Differentially Private Steering for Large Language Model AlignmentCode0
Intra-Layer Recurrence in Transformers for Language ModelingCode0
Baseline: A Library for Rapid Modeling, Experimentation and Development of Deep Learning Algorithms targeting NLPCode0
Data augmentation using prosody and false starts to recognize non-native children's speechCode0
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender BiasCode0
Chain-of-Model Learning for Language ModelCode0
Introducing Aspects of Creativity in Automatic Poetry GenerationCode0
A Transformer with Stack AttentionCode0
Conveyor: Efficient Tool-aware LLM Serving with Tool Partial ExecutionCode0
Efficient Machine Translation Domain AdaptationCode0
Exploring Language Model Generalization in Low-Resource Extractive QACode0
Chain of Code: Reasoning with a Language Model-Augmented Code EmulatorCode0
Group and Shuffle: Efficient Structured Orthogonal ParametrizationCode0
Show:102550
← PrevPage 106 of 284Next →

No leaderboard results yet.