SOTAVerified

Language Modeling

Papers

Showing 48764900 of 14182 papers

TitleStatusHype
DnA-Eval: Enhancing Large Language Model Evaluation through Decomposition and Aggregation0
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMsCode0
iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization0
Sparse Matrix in Large Language Model Fine-tuningCode1
Composed Image Retrieval for Remote SensingCode2
Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor0
Scaling Laws for Discriminative Classification in Large Language Models0
Sparse maximal update parameterization: A holistic approach to sparse training dynamicsCode2
Emergence of a High-Dimensional Abstraction Phase in Language TransformersCode0
SEP: Self-Enhanced Prompt Tuning for Visual-Language ModelCode0
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs0
GECKO: Generative Language Model for English, Code and Korean0
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning0
Aya 23: Open Weight Releases to Further Multilingual Progress0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
Extracting Prompts by Inverting LLM OutputsCode2
Lessons from the Trenches on Reproducible Evaluation of Language Models0
BiMix: A Bivariate Data Mixing Law for Language Model Pretraining0
Efficient Medical Question Answering with Knowledge-Augmented Question GenerationCode0
Not All Language Model Features Are LinearCode2
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model InferenceCode1
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation0
From Text to Pixel: Advancing Long-Context Understanding in MLLMsCode1
Large language models can be zero-shot anomaly detectors for time series?Code2
Show:102550
← PrevPage 196 of 568Next →

No leaderboard results yet.