SOTAVerified

2k

Papers

Showing 125 of 288 papers

TitleStatusHype
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
Understanding and Improving Length Generalization in Recurrent Models0
A strengthened bound on the number of states required to characterize maximum parsimony distance0
Structured Variational D-Decomposition for Accurate and Stable Low-Rank Approximation0
Latent Wavelet Diffusion: Enabling 4K Image Synthesis for Free0
Tradeoffs between Mistakes and ERM Oracle Calls in Online and Transductive Online Learning0
Test-Time Training Done Right0
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language ModelsCode1
MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment DatabaseCode1
Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questionsCode1
PIIvot: A Lightweight NLP Anonymization Framework for Question-Anchored Tutoring Dialogues0
Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning0
UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning0
ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative AnnotationCode0
Calibrating Translation Decoding with Quality Estimation on LLMsCode0
aiXamine: Simplified LLM Safety and Security0
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis0
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading0
On Linear Representations and Pretraining Data Frequency in Language Models0
Seedream 3.0 Technical Report0
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration0
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
FastVAR: Linear Visual Autoregressive Modeling via Cached Token PruningCode2
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual ScenesCode2
Show:102550
← PrevPage 1 of 12Next →

No leaderboard results yet.