2k

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 288 papers

Title	Date	Tasks	Status	Hype
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos	Jan 7, 2025	2kLanguage Modeling	CodeCode Available	5
Long-context LLMs Struggle with Long In-context Learning	Apr 2, 2024	2kIn-Context Learning	CodeCode Available	5
Scaling Granite Code Models to 128K Context	Jul 18, 2024	2k4k	CodeCode Available	4
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets	May 30, 2024	2k3D geometry	CodeCode Available	4
MovieChat+: Question-aware Sparse Memory for Long Video Question Answering	Apr 26, 2024	2kQuestion Answering	CodeCode Available	4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection	Mar 11, 2024	2D Object Detection2k	CodeCode Available	4
Highly Accurate Dichotomous Image Segmentation	Mar 6, 2022	2k3D Reconstruction	CodeCode Available	4
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Apr 9, 2025	2kDecision Making	CodeCode Available	3
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction	Feb 17, 2025	2kAutonomous Driving	CodeCode Available	3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data	Aug 7, 2024	16k2k	CodeCode Available	3
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Jun 10, 2024	2k3DGS	CodeCode Available	3
CAMixerSR: Only Details Need More "Attention"	Feb 29, 2024	2k8k	CodeCode Available	3
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization	Jul 14, 2025	2kImage Generation	CodeCode Available	2
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization	Jul 10, 2025	2kQuantization	CodeCode Available	2
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning	Mar 30, 2025	2kGPU	CodeCode Available	2
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes	Mar 30, 2025	2kImage Generation	CodeCode Available	2
Ultra-Resolution Adaptation with Ease	Mar 20, 2025	2k4k	CodeCode Available	2
Elevating Flow-Guided Video Inpainting with Reference Generation	Dec 12, 2024	2kVideo Inpainting	CodeCode Available	2
VFIMamba: Video Frame Interpolation with State Space Models	Jul 2, 2024	2k4k	CodeCode Available	2
Task Me Anything	Jun 17, 2024	2kAttribute	CodeCode Available	2
Linear Attention Sequence Parallelism	Apr 3, 2024	2k	CodeCode Available	2
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension	Feb 12, 2024	2kAutomatic Speech Recognition	CodeCode Available	2
STICKERCONV: Generating Multimodal Empathetic Responses from Scratch	Jan 20, 2024	2kEmpathetic Response Generation	CodeCode Available	2
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models	Dec 21, 2023	2kImage Inpainting	CodeCode Available	2
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models	Dec 21, 2023	2k	CodeCode Available	2
HHAvatar: Gaussian Head Avatar with Dynamic Hairs	Dec 5, 2023	2k	CodeCode Available	2
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis	Dec 4, 2023	2kDepth Estimation	CodeCode Available	2
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training	Sep 19, 2023	2kPosition	CodeCode Available	2
XGen-7B Technical Report	Sep 7, 2023	2k8k	CodeCode Available	2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars	May 22, 2023	2kImage Matting	CodeCode Available	2
High-fidelity 3D Human Digitization from Single 2K Resolution Images	Mar 27, 2023	2k3D Human Reconstruction	CodeCode Available	2
Hyena Hierarchy: Towards Larger Convolutional Language Models	Feb 21, 2023	2k8k	CodeCode Available	2
Any-resolution Training for High-resolution Image Synthesis	Apr 14, 2022	2kImage Generation	CodeCode Available	2
Towards Metrical Reconstruction of Human Faces	Apr 13, 2022	2k3D Face Reconstruction	CodeCode Available	2
FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset	Mar 26, 2022	2k3D Face Reconstruction	CodeCode Available	2
360MonoDepth: High-Resolution 360deg Monocular Depth Estimation	Jan 1, 2022	2kDepth Estimation	CodeCode Available	2
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models	May 29, 2025	2k4k	CodeCode Available	1
MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment Database	May 25, 2025	2kDiversity	CodeCode Available	1
Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions	May 23, 2025	2kBenchmarking	CodeCode Available	1
CascadeV: An Implementation of Wurstchen Architecture for Video Generation	Jan 28, 2025	2kVideo Generation	CodeCode Available	1
ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression	Dec 4, 2024	2kLogical Reasoning	CodeCode Available	1
SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark	Dec 1, 2024	2k4D reconstruction	CodeCode Available	1
How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs	Oct 24, 2024	2kMachine Translation	CodeCode Available	1
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models	Oct 14, 2024	2kBenchmarking	CodeCode Available	1
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration	Oct 2, 2024	2kDenoising	CodeCode Available	1
Scene-Text Grounding for Text-Based Video Question Answering	Sep 22, 2024	2kContrastive Learning	CodeCode Available	1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models	Aug 28, 2024	2k4k	CodeCode Available	1
Training Matting Models without Alpha Labels	Aug 20, 2024	2kImage Matting	CodeCode Available	1
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector	Jun 17, 2024	2kHallucination	CodeCode Available	1
Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum	May 21, 2024	2k8k	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 6Next →

No leaderboard results yet.