SOTAVerified

Language Modeling

Papers

Showing 12011250 of 14182 papers

TitleStatusHype
Generative Modeling for Mathematical DiscoveryCode2
LLMs Working in Harmony: A Survey on the Technological Aspects of Building Effective LLM-Based Multi Agent Systems0
NeurIPS 2023 LLM Efficiency Fine-tuning Competition0
SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable0
TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics0
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More0
Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model0
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation0
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and EditingCode3
MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis0
PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning0
Hybrid Agents for Image Restoration0
SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation0
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language ModelCode2
Tempest: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search0
Toward a method for LLM-enabled Indoor Navigation0
Leveraging Knowledge Graphs and LLMs for Context-Aware Messaging0
Medical Large Language Model Benchmarks Should Prioritize Construct Validity0
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo0
xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation0
Global Position Aware Group Choreography using Large Language Model0
Token Weighting for Long-Range Language ModelingCode0
Language-Enhanced Representation Learning for Single-Cell TranscriptomicsCode0
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs0
Reinforcement Learning is all You Need0
Why LLMs Cannot Think and How to Fix It0
BAMBI: Developing Baby Language Models for Italian0
SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability0
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language ModelCode0
Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge0
Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity0
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action AlignmentCode3
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity DocumentsCode0
D3PO: Preference-Based Alignment of Discrete Diffusion Models0
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Understanding the Quality-Diversity Trade-off in Diffusion Language ModelsCode0
Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback0
LongProLIP: A Probabilistic Vision-Language Model with Long Context TextCode2
Training Plug-n-Play Knowledge Modules with Deep Context Distillation0
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models0
Position-Aware Depth Decay Decoding (D^3): Boosting Large Language Model Inference Efficiency0
Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations0
OASIS: Order-Augmented Strategy for Improved Code Search0
Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study0
BiasEdit: Debiasing Stereotyped Language Models via Model EditingCode1
Mellow: a small audio language model for reasoningCode2
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model AdaptationCode0
Show:102550
← PrevPage 25 of 284Next →

No leaderboard results yet.