SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 63016350 of 661570 papers

TitleStatusHype
Recurrent Diffusion for Large-Scale Parameter GenerationCode2
A generalizable 3D framework and model for self-supervised learning in medical imagingCode2
Avoiding Shortcuts: Enhancing Channel-Robust Specific Emitter Identification via Single-Source Domain GeneralizationCode2
Investigating the Scalability of Approximate Sparse Retrieval Algorithms to Massive DatasetsCode2
A Survey on Diffusion Models for Anomaly DetectionCode2
Agent-R: Training Language Model Agents to Reflect via Iterative Self-TrainingCode2
Reasoning Language Models: A BlueprintCode2
Advancing Language Model Reasoning through Reinforcement Learning and Inference ScalingCode2
Beyond Any-Shot Adaptation: Predicting Optimization Outcome for Robustness Gains without Extra PayCode2
Diffusion Models in Recommendation Systems: A SurveyCode2
LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual TasksCode2
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education SystemsCode2
Discrete Prior-based Temporal-coherent Content Prediction for Blind Face Video RestorationCode2
FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable LocalizationCode2
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context ScenarioCode2
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the KeyCode2
Prompt-CAM: A Simpler Interpretable Transformer for Fine-Grained AnalysisCode2
Practical Continual Forgetting for Pre-trained Vision ModelsCode2
Lossless Compression of Vector IDs for Approximate Nearest Neighbor SearchCode2
A Simple Aerial Detection Baseline of Multimodal Language ModelsCode2
Scaling up self-supervised learning for improved surgical foundation modelsCode2
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh GenerationCode2
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image GenerationCode2
Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic DesignCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
What Limits LLM-based Human Simulation: LLMs or Our Design?Code2
The Devil is in Temporal Token: High Quality Video Reasoning SegmentationCode2
GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection ChallengeCode2
Vision Foundation Models for Computed TomographyCode2
CityDreamer4D: Compositional Generative Model of Unbounded 4D CitiesCode2
Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous DrivingCode2
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal UnderstandingCode2
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal UnderstandingCode2
PokerBench: Training Large Language Models to become Professional Poker PlayersCode2
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process ThinkingCode2
OptiChat: Bridging Optimization Models and Practitioners with Large Language ModelsCode2
Flow: Modularized Agentic Workflow AutomationCode2
RWKV-UNet: Improving UNet with Long-Range Cooperation for Effective Medical Image SegmentationCode2
Enhancing Retrieval-Augmented Generation: A Study of Best PracticesCode2
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific LiteratureCode2
Imagine while Reasoning in Space: Multimodal Visualization-of-ThoughtCode2
SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM)Code2
Leveraging ASIC AI Chips for Homomorphic EncryptionCode2
AlphaNet: Scaling Up Local-frame-based Atomistic Interatomic PotentialCode2
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-ResolutionCode2
Deep Learning and Foundation Models for Weather Prediction: A SurveyCode2
F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian SplattingCode2
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation ModelsCode2
ChemAgent: Self-updating Library in Large Language Models Improves Chemical ReasoningCode2
Show:102550
← PrevPage 127 of 13232Next →