SOTAVerified

2k

Papers

Showing 2650 of 288 papers

TitleStatusHype
HHAvatar: Gaussian Head Avatar with Dynamic HairsCode2
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View SynthesisCode2
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise TrainingCode2
XGen-7B Technical ReportCode2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head AvatarsCode2
High-fidelity 3D Human Digitization from Single 2K Resolution ImagesCode2
Hyena Hierarchy: Towards Larger Convolutional Language ModelsCode2
Any-resolution Training for High-resolution Image SynthesisCode2
Towards Metrical Reconstruction of Human FacesCode2
FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid DatasetCode2
360MonoDepth: High-Resolution 360deg Monocular Depth EstimationCode2
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language ModelsCode1
MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment DatabaseCode1
Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questionsCode1
CascadeV: An Implementation of Wurstchen Architecture for Video GenerationCode1
ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable CompressionCode1
SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and BenchmarkCode1
How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMsCode1
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video ModelsCode1
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer AccelerationCode1
Scene-Text Grounding for Text-Based Video Question AnsweringCode1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
Training Matting Models without Alpha LabelsCode1
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination DetectorCode1
Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumCode1
Show:102550
← PrevPage 2 of 12Next →

No leaderboard results yet.