SOTAVerified

2k

Papers

Showing 150 of 288 papers

TitleStatusHype
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Long-context LLMs Struggle with Long In-context LearningCode5
Scaling Granite Code Models to 128K ContextCode4
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D AssetsCode4
MovieChat+: Question-aware Sparse Memory for Long Video Question AnsweringCode4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
Highly Accurate Dichotomous Image SegmentationCode4
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality DataCode3
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View SynthesisCode3
CAMixerSR: Only Details Need More "Attention"Code3
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
FastVAR: Linear Visual Autoregressive Modeling via Cached Token PruningCode2
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual ScenesCode2
Ultra-Resolution Adaptation with EaseCode2
Elevating Flow-Guided Video Inpainting with Reference GenerationCode2
VFIMamba: Video Frame Interpolation with State Space ModelsCode2
Task Me AnythingCode2
Linear Attention Sequence ParallelismCode2
AIR-Bench: Benchmarking Large Audio-Language Models via Generative ComprehensionCode2
STICKERCONV: Generating Multimodal Empathetic Responses from ScratchCode2
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsCode2
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion ModelsCode2
HHAvatar: Gaussian Head Avatar with Dynamic HairsCode2
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View SynthesisCode2
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise TrainingCode2
XGen-7B Technical ReportCode2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head AvatarsCode2
High-fidelity 3D Human Digitization from Single 2K Resolution ImagesCode2
Hyena Hierarchy: Towards Larger Convolutional Language ModelsCode2
Any-resolution Training for High-resolution Image SynthesisCode2
Towards Metrical Reconstruction of Human FacesCode2
FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid DatasetCode2
360MonoDepth: High-Resolution 360deg Monocular Depth EstimationCode2
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language ModelsCode1
MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment DatabaseCode1
Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questionsCode1
CascadeV: An Implementation of Wurstchen Architecture for Video GenerationCode1
ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable CompressionCode1
SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and BenchmarkCode1
How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMsCode1
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video ModelsCode1
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer AccelerationCode1
Scene-Text Grounding for Text-Based Video Question AnsweringCode1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
Training Matting Models without Alpha LabelsCode1
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination DetectorCode1
Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumCode1
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.