16k

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 146 papers

Title	Date	Tasks	Status	Hype
UniCode^2: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation	Jun 25, 2025	16k	—Unverified	0
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling	Jun 12, 2025	16kRetrieval	CodeCode Available	0
How Far Are We from Optimal Reasoning Efficiency?	Jun 8, 2025	16kBenchmarking	CodeCode Available	0
FlashDMoE: Fast Distributed MoE in a Single Kernel	Jun 5, 2025	16kCPU	CodeCode Available	3
FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian	May 28, 2025	16k	—Unverified	0
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents	May 27, 2025	16k	CodeCode Available	2
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences	May 27, 2025	16kLong-Context Understanding	CodeCode Available	0
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention	May 24, 2025	16k4k	CodeCode Available	1
Training Long-Context LLMs Efficiently via Chunk-wise Optimization	May 22, 2025	16kGPU	CodeCode Available	2
PSC: Extending Context Window of Large Language Models via Phase Shift Calibration	May 18, 2025	16kPosition	CodeCode Available	0
Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM	May 13, 2025	16k8k	CodeCode Available	0
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning	May 12, 2025	16kBenchmarking	—Unverified	0
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications	Mar 21, 2025	16k4k	CodeCode Available	0
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims	Mar 11, 2025	16kAbstract generation	—Unverified	0
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second	Mar 9, 2025	16kCT Reconstruction	CodeCode Available	0
Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation	Feb 26, 2025	16k2k	—Unverified	0
EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts	Feb 20, 2025	16kDecoder	—Unverified	0
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification	Feb 12, 2025	16k4k	—Unverified	0
Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs	Feb 4, 2025	16kDescriptive	CodeCode Available	1
M+: Extending MemoryLLM with Scalable Long-Term Memory	Feb 1, 2025	16kGPU	CodeCode Available	3
Parallel Sequence Modeling via Generalized Spatial Propagation Network	Jan 21, 2025	16kComputational Efficiency	—Unverified	0
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key	Jan 16, 2025	16kHallucination	CodeCode Available	2
Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning	Dec 30, 2024	16kBinary Classification	—Unverified	0
SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs	Dec 9, 2024	16k	—Unverified	0
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences	Dec 9, 2024	16k	—Unverified	0
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels	Dec 3, 2024	16k	CodeCode Available	0
Bimanual Dexterity for Complex Tasks	Nov 20, 2024	16k	—Unverified	0
Piecing It All Together: Verifying Multi-Hop Multimodal Claims	Nov 14, 2024	16kAll	—Unverified	0
Model Editing for LLMs4Code: How Far are We?	Nov 11, 2024	16kCode Generation	CodeCode Available	0
Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation	Nov 11, 2024	16kBenchmarking	CodeCode Available	0
Denial-of-Service Poisoning Attacks against Large Language Models	Oct 14, 2024	16kSpeech-to-Text	CodeCode Available	1
Neural Fourier Modelling: A Highly Compact Approach to Time-Series Analysis	Oct 7, 2024	16kAnomaly Detection	CodeCode Available	1
Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension	Oct 5, 2024	16kData Augmentation	—Unverified	0
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation	Oct 4, 2024	16kCode Generation	CodeCode Available	3
Extending Context Window of Large Language Models from a Distributional Perspective	Oct 2, 2024	16k8k	CodeCode Available	0
LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs	Sep 3, 2024	16kBenchmarking	CodeCode Available	1
LinFusion: 1 GPU, 1 Minute, 16K Image	Sep 3, 2024	16kCausal Inference	CodeCode Available	3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data	Aug 7, 2024	16k2k	CodeCode Available	3
Global Structure-from-Motion Revisited	Jul 29, 2024	16k	CodeCode Available	7
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images	Jul 16, 2024	16k	CodeCode Available	1
Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies	Jul 9, 2024	16kTask 2	—Unverified	0
Learning to (Learn at Test Time): RNNs with Expressive Hidden States	Jul 5, 2024	16k8k	CodeCode Available	5
LongIns: A Challenging Long-context Instruction-based Exam for LLMs	Jun 25, 2024	16k4k	—Unverified	0
Inferring Pluggable Types with Machine Learning	Jun 21, 2024	16kLanguage Modeling	—Unverified	0
LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors	Jun 20, 2024	16kInstruction Following	CodeCode Available	1
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models	Jun 20, 2024	16k4k	—Unverified	0
Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual Understanding	Jun 17, 2024	16kLanguage Modelling	CodeCode Available	0
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence	Jun 17, 2024	16kLanguage Modeling	CodeCode Available	9
An Empirical Study of Mamba-based Language Models	Jun 12, 2024	16kIn-Context Learning	—Unverified	0
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset	May 17, 2024	16kBenchmarking	CodeCode Available	3

Show:10 25 50

← PrevPage 1 of 3Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Suprime2	1'"	1	—	Unverified