SOTAVerified

Language Modeling

Papers

Showing 651700 of 14182 papers

TitleStatusHype
GOFA: A Generative One-For-All Model for Joint Graph Language ModelingCode2
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive DistillationCode2
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language ModelCode2
iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvementCode2
SOLO: A Single Transformer for Scalable Vision-Language ModelingCode2
PsycoLLM: Enhancing LLM for Psychological Understanding and EvaluationCode2
Just read twice: closing the recall gap for recurrent language modelsCode2
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
Mixture of A Million ExpertsCode2
MiniGPT-Med: Large Language Model as a General Interface for Radiology DiagnosisCode2
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
AutoFlow: Automated Workflow Generation for Large Language Model AgentsCode2
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script GenerationCode2
RegMix: Data Mixture as Regression for Language Model Pre-trainingCode2
Learning Formal Mathematics From Intrinsic MotivationCode2
Teola: Towards End-to-End Optimization of LLM-based ApplicationsCode2
RoboUniView: Visual-Language Model with Unified View Representation for Robotic ManipulationCode2
EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and VotingCode2
FIRST: Faster Improved Listwise Reranking with Single Token DecodingCode2
MoA: Mixture of Sparse Attention for Automatic Large Language Model CompressionCode2
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path PlanningCode2
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU TasksCode2
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for EnsemblingCode2
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLMCode2
Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and ReactionCode2
AgentReview: Exploring Peer Review Dynamics with LLM AgentsCode2
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process RefinementCode2
mDPO: Conditional Preference Optimization for Multimodal Large Language ModelsCode2
Large Scale Transfer Learning for Tabular Data via Language ModelingCode2
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning AbilitiesCode2
Explore the Limits of Omni-modal Pretraining at ScaleCode2
On Softmax Direct Preference Optimization for RecommendationCode2
Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language ModelCode2
StreamBench: Towards Benchmarking Continuous Improvement of Language AgentsCode2
Discovering Preference Optimization Algorithms with and for Large Language ModelsCode2
RS-Agent: Automating Remote Sensing Tasks through Intelligent AgentCode2
LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language ModelCode2
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean DataCode2
BLSP-Emo: Towards Empathetic Large Speech-Language ModelsCode2
Small-E: Small Language Model with Linear Attention for Efficient Speech SynthesisCode2
Simplified and Generalized Masked Diffusion for Discrete DataCode2
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social ExperiencesCode2
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language ModelsCode2
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language ModelsCode2
Block Transformer: Global-to-Local Language Modeling for Fast InferenceCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLMCode2
Generative Pre-trained Speech Language Model with Efficient Hierarchical TransformerCode2
GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language ModelCode2
Show:102550
← PrevPage 14 of 284Next →

No leaderboard results yet.