SOTAVerified

Language Modeling

Papers

Showing 42014250 of 14182 papers

TitleStatusHype
Reasoning-Grounded Natural Language Explanations for Language ModelsCode0
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity0
Test-Time Training Provably Improves Transformers as In-context Learners0
TigerLLM -- A Family of Bangla Large Language ModelsCode0
Don't Forget It! Conditional Sparse Autoencoder Clamping Works for Unlearning0
BriLLM: Brain-inspired Large Language Model0
Large language model-powered AI systems achieve self-replication with no human intervention0
LLM Agents for Education: Advances and Applications0
Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models0
ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction TuningCode0
Hybrid Agents for Image Restoration0
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More0
LLMs Working in Harmony: A Survey on the Technological Aspects of Building Effective LLM-Based Multi Agent Systems0
SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable0
TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics0
Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model0
NeurIPS 2023 LLM Efficiency Fine-tuning Competition0
PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning0
MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis0
SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation0
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation0
Tempest: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search0
Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge0
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs0
Reinforcement Learning is all You Need0
Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity0
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language ModelCode0
SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability0
Language-Enhanced Representation Learning for Single-Cell TranscriptomicsCode0
Why LLMs Cannot Think and How to Fix It0
Token Weighting for Long-Range Language ModelingCode0
xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation0
Toward a method for LLM-enabled Indoor Navigation0
Medical Large Language Model Benchmarks Should Prioritize Construct Validity0
Leveraging Knowledge Graphs and LLMs for Context-Aware Messaging0
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo0
BAMBI: Developing Baby Language Models for Italian0
Global Position Aware Group Choreography using Large Language Model0
Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback0
Accelerating MoE Model Inference with Expert Sharding0
Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study0
D3PO: Preference-Based Alignment of Discrete Diffusion Models0
A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models0
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations0
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model AdaptationCode0
Training Plug-n-Play Knowledge Modules with Deep Context Distillation0
Understanding the Quality-Diversity Trade-off in Diffusion Language ModelsCode0
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity DocumentsCode0
Position-Aware Depth Decay Decoding (D^3): Boosting Large Language Model Inference Efficiency0
Show:102550
← PrevPage 85 of 284Next →

No leaderboard results yet.