SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1060110650 of 661570 papers

TitleStatusHype
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised TrainingCode2
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in VideoCode2
Gradient Boosting Reinforcement LearningCode2
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of MindCode2
CompGS: Smaller and Faster Gaussian Splatting with Vector QuantizationCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Contextual Object Detection with Multimodal Large Language ModelsCode2
V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position EncodingCode2
Exploration-Driven Generative Interactive EnvironmentsCode2
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-ResolutionCode2
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language NavigationCode2
FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State SpaceCode2
DreamGaussian4D: Generative 4D Gaussian SplattingCode2
GraphKAN: Enhancing Feature Extraction with Graph Kolmogorov Arnold NetworksCode2
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse PromptsCode2
Referring to Any PersonCode2
Training Language Models to Self-Correct via Reinforcement LearningCode2
GITA: Graph to Visual and Textual Integration for Vision-Language Graph ReasoningCode2
Training on test proteins improves fitness, structure, and function predictionCode2
mGPT: Few-Shot Learners Go MultilingualCode2
Promptus: Can Prompts Streaming Replace Video Streaming with Stable DiffusionCode2
TimeFilter: Patch-Specific Spatial-Temporal Graph Filtration for Time Series ForecastingCode2
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything ModelCode2
YOLOPv2: Better, Faster, Stronger for Panoptic Driving PerceptionCode2
CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and InteractionsCode2
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language NavigationCode2
A large-scale multicenter breast cancer DCE-MRI benchmark dataset with expert segmentationsCode2
MonoOcc: Digging into Monocular Semantic Occupancy PredictionCode2
Self-Supervised Any-Point Tracking by Contrastive Random WalksCode2
Click-Calib: A Robust Extrinsic Calibration Method for Surround-View SystemsCode2
ERA-CoT: Improving Chain-of-Thought through Entity Relationship AnalysisCode2
Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent DiffusionCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic MasksCode2
Generating Long Semantic IDs in Parallel for RecommendationCode2
Graphs Meet AI Agents: Taxonomy, Progress, and Future OpportunitiesCode2
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic SegmentationCode2
Three New Validators and a Large-Scale Benchmark Ranking for Unsupervised Domain AdaptationCode2
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied AgentsCode2
Learning from All VehiclesCode2
LambdaNetworks: Modeling Long-Range Interactions Without AttentionCode2
Next Patch Prediction for Autoregressive Visual GenerationCode2
The Stable Artist: Steering Semantics in Diffusion Latent SpaceCode2
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of ParametersCode2
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image UnderstandingCode2
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision TransformersCode2
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View CompletionCode2
Optimization Methods for Personalizing Large Language Models through Retrieval AugmentationCode2
Active Generalized Category DiscoveryCode2
COLD: A Benchmark for Chinese Offensive Language DetectionCode2
Show:102550
← PrevPage 213 of 13232Next →