SOTAVerified

Language Modeling

Papers

Showing 48514900 of 14182 papers

TitleStatusHype
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions0
AdaFisher: Adaptive Second Order Optimization via Fisher InformationCode2
Chain of Tools: Large Language Model is an Automatic Multi-tool Learner0
gzip Predicts Data-dependent Scaling LawsCode1
LoQT: Low-Rank Adapters for Quantized PretrainingCode2
Towards Multi-Task Multi-Modal Models: A Video Generative Perspective0
A Survey of Multimodal Large Language Model from A Data-centric PerspectiveCode2
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search0
Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models0
Theoretical Analysis of Weak-to-Strong Generalization0
How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect0
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
MoEUT: Mixture-of-Experts Universal TransformersCode2
A transfer learning framework for weak-to-strong generalization0
Evolutionary Large Language Model for Automated Feature TransformationCode1
Finetuning Large Language Model for Personalized RankingCode1
Large Language Model Pruning0
Large Language Model (LLM) for Standard Cell Layout Design Optimization0
Large Language Model Sentinel: LLM Agent for Adversarial Purification0
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign UsersCode1
Enhancing Augmentative and Alternative Communication with Card Prediction and Colourful Semantics0
LM4LV: A Frozen Large Language Model for Low-level Vision TasksCode2
RAEE: A Robust Retrieval-Augmented Early Exiting Framework for Efficient Inference0
DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZCode5
Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems0
DnA-Eval: Enhancing Large Language Model Evaluation through Decomposition and Aggregation0
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMsCode0
iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization0
Sparse Matrix in Large Language Model Fine-tuningCode1
Composed Image Retrieval for Remote SensingCode2
Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor0
Scaling Laws for Discriminative Classification in Large Language Models0
Sparse maximal update parameterization: A holistic approach to sparse training dynamicsCode2
Emergence of a High-Dimensional Abstraction Phase in Language TransformersCode0
SEP: Self-Enhanced Prompt Tuning for Visual-Language ModelCode0
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs0
GECKO: Generative Language Model for English, Code and Korean0
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning0
Aya 23: Open Weight Releases to Further Multilingual Progress0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
Extracting Prompts by Inverting LLM OutputsCode2
Lessons from the Trenches on Reproducible Evaluation of Language Models0
BiMix: A Bivariate Data Mixing Law for Language Model Pretraining0
Efficient Medical Question Answering with Knowledge-Augmented Question GenerationCode0
Not All Language Model Features Are LinearCode2
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model InferenceCode1
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation0
From Text to Pixel: Advancing Long-Context Understanding in MLLMsCode1
Large language models can be zero-shot anomaly detectors for time series?Code2
Show:102550
← PrevPage 98 of 284Next →

No leaderboard results yet.