SOTAVerified

Language Modeling

Papers

Showing 41514200 of 14182 papers

TitleStatusHype
PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry GenerationCode0
Team Ohio State at CMCL 2021 Shared Task: Fine-Tuned RoBERTa for Eye-Tracking Data PredictionCode0
Show and Guide: Instructional-Plan Grounded Vision and Language ModelCode0
UIO at SemEval-2023 Task 12: Multilingual fine-tuning for sentiment classification in low-resource languagesCode0
Wanda++: Pruning Large Language Models via Regional GradientsCode0
Structured Content Preservation for Unsupervised Text Style TransferCode0
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language ModelsCode0
Anatomy of Neural Language ModelsCode0
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language RepresentationCode0
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern ArchitecturesCode0
LLMs as Educational Analysts: Transforming Multimodal Data Traces into Actionable Reading Assessment ReportsCode0
Frequency Is What You Need: Word-frequency Masking Benefits Vision-Language Model Pre-trainingCode0
KEST: Kernel Distance Based Efficient Self-Training for Improving Controllable Text GenerationCode0
End-to-End Attention-based Large Vocabulary Speech RecognitionCode0
LegiLM: A Fine-Tuned Legal Language Model for Data ComplianceCode0
LEGOBench: Scientific Leaderboard Generation BenchmarkCode0
Benchmarking Large Language Model Uncertainty for Prompt OptimizationCode0
Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccinesCode0
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
KG-BERT: BERT for Knowledge Graph CompletionCode0
Autoencoders as Tools for Program SynthesisCode0
KGLink: A column type annotation method that combines knowledge graph and pre-trained language modelCode0
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven AgentsCode0
Benchmarking Long-tail Generalization with Likelihood SplitsCode0
LLMSat: A Large Language Model-Based Goal-Oriented Agent for Autonomous Space ExplorationCode0
Developing Safe and Responsible Large Language Model : Can We Balance Bias Reduction and Language Understanding in Large Language Models?Code0
Tracr-Injection: Distilling Algorithms into Pre-trained Language ModelsCode0
KidLM: Advancing Language Models for Children -- Early Insights and Future DirectionsCode0
KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney DiseaseCode0
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language ModelCode0
End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language ModelsCode0
A Tailored Pre-Training Model for Task-Oriented Dialog GenerationCode0
GPoeT-2: A GPT-2 Based Poem GeneratorCode0
Are VLMs Really BlindCode0
KitchenScale: Learning to predict ingredient quantities from recipe contextsCode0
Benchmarking Misuse Mitigation Against Covert AdversariesCode0
LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-trainingCode0
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag CompetitionCode0
KL-Divergence Guided Temperature SamplingCode0
KLMo: Knowledge Graph Enhanced Pretrained Language Model with Fine-Grained RelationshipsCode0
KL Penalty Control via Perturbation for Direct Preference OptimizationCode0
Length Optimization in Conformal PredictionCode0
Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared TaskCode0
End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model PretrainingCode0
Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language ModelsCode0
Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed TrainingCode0
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image CaptioningCode0
Knowing Where and What: Unified Word Block Pretraining for Document UnderstandingCode0
End-to-End Task-Oriented Dialog Modeling with Semi-Structured Knowledge ManagementCode0
AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuningCode0
Show:102550
← PrevPage 84 of 284Next →

No leaderboard results yet.