SOTAVerified

2k

Papers

Showing 125 of 288 papers

TitleStatusHype
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Long-context LLMs Struggle with Long In-context LearningCode5
Scaling Granite Code Models to 128K ContextCode4
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D AssetsCode4
MovieChat+: Question-aware Sparse Memory for Long Video Question AnsweringCode4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
Highly Accurate Dichotomous Image SegmentationCode4
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality DataCode3
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View SynthesisCode3
CAMixerSR: Only Details Need More "Attention"Code3
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual ScenesCode2
FastVAR: Linear Visual Autoregressive Modeling via Cached Token PruningCode2
Ultra-Resolution Adaptation with EaseCode2
Elevating Flow-Guided Video Inpainting with Reference GenerationCode2
VFIMamba: Video Frame Interpolation with State Space ModelsCode2
Task Me AnythingCode2
Linear Attention Sequence ParallelismCode2
AIR-Bench: Benchmarking Large Audio-Language Models via Generative ComprehensionCode2
STICKERCONV: Generating Multimodal Empathetic Responses from ScratchCode2
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion ModelsCode2
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsCode2
Show:102550
← PrevPage 1 of 12Next →

No leaderboard results yet.