SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1320113250 of 474278 papers

TitleStatusHype
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual DescriptorCode0
Molecular Machine Learning Using Euler Characteristic TransformsCode0
Rectifying Adversarial Sample with Low Entropy Prior for Test-Time Defense0
Effects of structure on reasoning in instance-level Self-DiscoverCode0
Be the Change You Want to See: Revisiting Remote Sensing Change Detection PracticesCode1
Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary ModelCode0
Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition0
Adaptive Gate-Aware Mamba Networks for Magnetic Resonance Fingerprinting0
Flow-Anchored Consistency ModelsCode2
Hybrid-View Attention for csPCa Classification in TRUSCode0
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky0
STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and BenchmarkingCode0
CORE-ReID V2: Advancing the Domain Adaptation for Object Re-Identification with Optimized Training and Ensemble FusionCode0
Cross-domain Hyperspectral Image Classification based on Bi-directional Domain AdaptationCode0
MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder DiagnosisCode0
A Fuzzy Supervisor Agent Design for Clinical Reasoning Assistance in a Multi-Agent Educational Clinical Scenario SimulationCode0
JoyTTS: LLM-based Spoken Chatbot With Voice CloningCode0
Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback0
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation ScalingCode0
LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling0
Wildlife Target Re-Identification Using Self-supervised Learning in Non-Urban SettingsCode0
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving0
Can LLMs Identify Critical Limitations within Scientific Research? A Systematic Evaluation on AI Research Papers0
Prompt learning with bounding box constraints for medical image segmentationCode0
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion0
Answer Matching Outperforms Multiple Choice for Language Model Evaluation0
Explainable AI for Comprehensive Risk Assessment for Financial Reports: A Lightweight Hierarchical Transformer Network ApproachCode0
IMASHRIMP: Automatic White Shrimp (Penaeus vannamei) Biometrical Analysis from Laboratory Images Using Computer Vision and Deep LearningCode0
CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD EditingCode0
RGC-VQA: An Exploration Database for Robotic-Generated Video Quality AssessmentCode0
MTCNet: Motion and Topology Consistency Guided Learning for Mitral Valve Segmentationin 4D UltrasoundCode0
Listwise Preference Alignment Optimization for Tail Item RecommendationCode0
DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-TuningCode0
CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real ScenariosCode0
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMsCode0
Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive CachingCode0
From Sentences to Sequences: Rethinking Languages in Biological SystemCode0
Temporally-Aware Supervised Contrastive Learning for Polyp Counting in ColonoscopyCode0
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement LearningCode0
PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal ViewCode0
F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt TuningCode0
DistZO2: High-Throughput and Memory-Efficient Zeroth-Order Fine-tuning LLMs with Distributed Parallel ComputingCode0
Cautious Next Token PredictionCode1
From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images0
AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-benchCode2
Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and PhysicsCode1
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic AgentsCode3
MathOptAI.jl: Embed trained machine learning predictors into JuMP modelsCode2
ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation0
Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization0
Show:102550
← PrevPage 265 of 9486Next →