16k

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 146 papers

Title	Date	Tasks	Status	Hype
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence	Jun 17, 2024	16kLanguage Modeling	CodeCode Available	9
Global Structure-from-Motion Revisited	Jul 29, 2024	16k	CodeCode Available	7
Code Llama: Open Foundation Models for Code	Aug 24, 2023	16kCode Generation	CodeCode Available	6
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness	May 27, 2022	16k4k	CodeCode Available	6
Learning to (Learn at Test Time): RNNs with Expressive Hidden States	Jul 5, 2024	16k8k	CodeCode Available	5
Long-form factuality in large language models	Mar 27, 2024	16kForm	CodeCode Available	4
FlashDMoE: Fast Distributed MoE in a Single Kernel	Jun 5, 2025	16kCPU	CodeCode Available	3
M+: Extending MemoryLLM with Scalable Long-Term Memory	Feb 1, 2025	16kGPU	CodeCode Available	3
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation	Oct 4, 2024	16kCode Generation	CodeCode Available	3
LinFusion: 1 GPU, 1 Minute, 16K Image	Sep 3, 2024	16kCausal Inference	CodeCode Available	3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data	Aug 7, 2024	16k2k	CodeCode Available	3
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset	May 17, 2024	16kBenchmarking	CodeCode Available	3
SnapKV: LLM Knows What You are Looking for Before Generation	Apr 22, 2024	16kGPU	CodeCode Available	3
Training-Free Long-Context Scaling of Large Language Models	Feb 27, 2024	16k	CodeCode Available	3
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding	Aug 28, 2023	16kCode Completion	CodeCode Available	3
Investigating Efficiently Extending Transformers for Long Input Summarization	Aug 8, 2022	16kLong-range modeling	CodeCode Available	3
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents	May 27, 2025	16k	CodeCode Available	2
Training Long-Context LLMs Efficiently via Chunk-wise Optimization	May 22, 2025	16kGPU	CodeCode Available	2
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key	Jan 16, 2025	16kHallucination	CodeCode Available	2
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K	Feb 6, 2024	16kBenchmarking	CodeCode Available	2
Giraffe: Adventures in Expanding Context Lengths in LLMs	Aug 21, 2023	16k4k	CodeCode Available	2
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding	Jun 29, 2023	16kImage Captioning	CodeCode Available	2
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention	May 24, 2025	16k4k	CodeCode Available	1
Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs	Feb 4, 2025	16kDescriptive	CodeCode Available	1
Denial-of-Service Poisoning Attacks against Large Language Models	Oct 14, 2024	16kSpeech-to-Text	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 6Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Suprime2	1'"	1	—	Unverified