SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1060110650 of 661570 papers

TitleStatusHype
Universal Segmentation at Arbitrary Granularity with Language InstructionCode2
Towards Learning a Generalist Model for Embodied NavigationCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D GaussiansCode2
GaussianHead: High-fidelity Head Avatars with Learnable Gaussian DerivationCode2
SchurVINS: Schur Complement-Based Lightweight Visual Inertial Navigation SystemCode2
Data Management For Training Large Language Models: A SurveyCode2
D-Bot: Database Diagnosis System using Large Language ModelsCode2
ImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationCode2
VEXIR2Vec: An Architecture-Neutral Embedding Framework for Binary SimilarityCode2
Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal ForecastingCode2
DeepCache: Accelerating Diffusion Models for FreeCode2
3D Face Reconstruction with the Geometric Guidance of Facial Part SegmentationCode2
CoLLiE: Collaborative Training of Large Language Models in an Efficient WayCode2
Dense Optical Tracking: Connecting the DotsCode2
Gaussian Grouping: Segment and Edit Anything in 3D ScenesCode2
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style AdapterCode2
Segment and Caption AnythingCode2
FSGS: Real-Time Few-shot View Synthesis using Gaussian SplattingCode2
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion ModelsCode2
CompGS: Smaller and Faster Gaussian Splatting with Vector QuantizationCode2
Fast ODE-based Sampling for Diffusion Models in Around 5 StepsCode2
Distributed Global Structure-from-Motion with a Deep Front-EndCode2
VTimeLLM: Empower LLM to Grasp Video MomentsCode2
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and PlanningCode2
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model GenerationCode2
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from VideoCode2
AlignBench: Benchmarking Chinese Alignment of Large Language ModelsCode2
Zero Bubble Pipeline ParallelismCode2
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person PerspectivesCode2
BioCLIP: A Vision Foundation Model for the Tree of LifeCode2
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous DrivingCode2
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional InpaintingCode2
HUGS: Human Gaussian SplatsCode2
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving ApplicationsCode2
A Graph-Based Approach for Category-Agnostic Pose EstimationCode2
MMA-Diffusion: MultiModal Attack on Diffusion ModelsCode2
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation SamplingCode2
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-AllocationCode2
Biomedical knowledge graph-optimized prompt generation for large language modelsCode2
Neural Fields with Thermal Activations for Arbitrary-Scale Super-ResolutionCode2
FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher InformationCode2
Zooming Out on Zooming In: Advancing Super-Resolution for Remote SensingCode2
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D GenerationCode2
Gaussian Shell Maps for Efficient 3D Human GenerationCode2
TransNeXt: Robust Foveal Visual Perception for Vision TransformersCode2
War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World WarsCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
Graph Prompt Learning: A Comprehensive Survey and BeyondCode2
SatCLIP: Global, General-Purpose Location Embeddings with Satellite ImageryCode2
Show:102550
← PrevPage 213 of 13232Next →