SOTAVerified

Language Modeling

Papers

Showing 44014450 of 14182 papers

TitleStatusHype
Language Alignment via Nash-learning and Adaptive feedback0
video-SALMONN: Speech-Enhanced Audio-Visual Large Language ModelsCode0
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification0
CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans0
TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech SynthesizersCode1
FIRST: Faster Improved Listwise Reranking with Single Token DecodingCode2
Inferring Pluggable Types with Machine Learning0
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship EmbeddingsCode1
GiusBERTo: A Legal Language Model for Personal Data De-identification in Italian Court of Auditors Decisions0
Brain-Like Language Processing via a Shallow Untrained Multihead Attention NetworkCode0
Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation0
MoA: Mixture of Sparse Attention for Automatic Large Language Model CompressionCode2
Safely Learning with Private Data: A Federated Learning Framework for Large Language ModelCode1
InternLM-Law: An Open Source Chinese Legal Large Language ModelCode1
A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative GenerationCode0
TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems0
Unsupervised Morphological Tree Tokenizer0
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path PlanningCode2
Advantage Alignment Algorithms0
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate0
Factual Dialogue Summarization via Learning from Large Language Models0
A Large Language Model Outperforms Other Computational Approaches to the High-Throughput Phenotyping of Physician Notes0
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal BehaviorsCode1
Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning0
Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model EvaluationCode0
Demystifying Language Model Forgetting with Low-rank Example Associations0
SPL: A Socratic Playground for Learning Powered by Large Language Model0
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods0
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models0
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking0
Ranking LLMs by compression0
Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction0
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics0
VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language ModelCode1
LiveMind: Low-latency Large Language Models with Simultaneous InferenceCode1
Information Guided Regularization for Fine-tuning Language ModelsCode0
Measuring Sample Importance in Data Pruning for Language Models based on Information Entropy0
Enhancing the LLM-Based Robot Manipulation Through Human-Robot Collaboration0
Transferable speech-to-text large language model alignment module0
Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided DecodingCode0
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsCode3
The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It0
LIT: Large Language Model Driven Intention Tracking for Proactive Human-Robot Collaboration -- A Robot Sous-Chef Application0
Block-level Text Spotting with LLMs0
In-Context Former: Lightning-fast Compressing Context for Large Language Model0
BiLD: Bi-directional Logits Difference Loss for Large Language Model DistillationCode1
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language ModelsCode3
Towards Holistic Language-video Representation: the language model-enhanced MSR-Video to Text Dataset0
Show:102550
← PrevPage 89 of 284Next →

No leaderboard results yet.