SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 67016750 of 661570 papers

TitleStatusHype
TextSSR: Diffusion-based Data Synthesis for Scene Text RecognitionCode2
RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian SplattingCode2
Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal DatasetsCode2
Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic PerspectiveCode2
SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure FramesCode2
LamRA: Large Multimodal Model as Your Advanced Retrieval AssistantCode2
SfM-Free 3D Gaussian Splatting via Hierarchical TrainingCode2
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified FlowsCode2
NLPrompt: Noise-Label Prompt Learning for Vision-Language ModelsCode2
V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and PredictionCode2
Commit0: Library Generation from ScratchCode2
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMsCode2
InstantSwap: Fast Customized Concept Swapping across Sharp Shape DifferencesCode2
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual PreferencesCode2
TinyFusion: Diffusion Transformers Learned ShallowCode2
Refine3DNet: Scaling Precision in 3D Object Reconstruction from Multi-View RGB Images using AttentionCode2
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and RerankingCode2
BIGCity: A Universal Spatiotemporal Model for Unified Trajectory and Traffic State Data AnalysisCode2
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context SparsificationCode2
Scaling New Frontiers: Insights into Large Recommendation ModelsCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image ClassificationCode2
Playable Game GenerationCode2
Ref-GS: Directional Factorization for 2D Gaussian SplattingCode2
Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor EnvironmentsCode2
Automatic Differentiation-based Full Waveform Inversion with Flexible WorkflowsCode2
PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video GenerationCode2
Forensics Adapter: Unleashing CLIP for Generalizable Face Forgery DetectionCode2
VLSBench: Unveiling Visual Leakage in Multimodal SafetyCode2
OpenQDC: Open Quantum Data CommonsCode2
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-TuningCode2
KV Shifting Attention Enhances Language ModelingCode2
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long VideosCode2
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-WorldCode2
DeMo: Decoupled Momentum OptimizationCode2
TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian SplattingCode2
L4acados: Learning-based models for acados, applied to Gaussian process-based predictive controlCode2
SADG: Segment Any Dynamic Gaussian Without Object TrackersCode2
SuperGaussians: Enhancing Gaussian Splatting Using Primitives with Spatially Varying ColorsCode2
Lost & Found: Tracking Changes from Egocentric Observations in 3D Dynamic Scene GraphsCode2
Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action RecognitionCode2
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial TasksCode2
Auto-Encoded Supervision for Perceptual Image Super-ResolutionCode2
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary SegmentationCode2
Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2Code2
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language ModelsCode2
ETAP: Event-based Tracking of Any PointCode2
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth IntegrationCode2
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVsCode2
GaussianSpeech: Audio-Driven Gaussian AvatarsCode2
Show:102550
← PrevPage 135 of 13232Next →