SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1315113200 of 474278 papers

TitleStatusHype
Voyaging into Unbounded Dynamic Scenes from a Single View0
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World ModelsCode0
A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models0
Graph Collaborative Attention Network for Link Prediction in Knowledge GraphsCode0
Combining Graph Neural Networks and Mixed Integer Linear Programming for Molecular Inference under the Two-Layered Model0
LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language ModelsCode1
CTR-Guided Generative Query Suggestion in Conversational Search0
Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual LearningCode0
Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection0
Quantum Stochastic Walks for Portfolio Optimization: Theory and Implementation on Financial Networks0
Stochastic Human Motion Prediction with Memory of Action Transition and Action CharacteristicCode0
Temporal Continual Learning with Prior Compensation for Human Motion PredictionCode0
Learning Disentangled Stain and Structural Representations for Semi-Supervised Histopathology SegmentationCode0
skfolio: Portfolio Optimization in PythonCode5
Taylor-Model Physics-Informed Neural Networks (PINNs) for Ordinary Differential EquationsCode0
PresentAgent: Multimodal Agent for Presentation Video GenerationCode2
All-atom inverse protein folding through discrete flow matchingCode0
Open-Vocabulary Object Detection in UAV Imagery: A Review and Future PerspectivesCode0
Low-Light Enhancement via Encoder-Decoder Network with Illumination GuidanceCode0
Team RAS in 9th ABAW Competition: Multimodal Compound Expression Recognition Approach0
Four Shades of Life Sciences: A Dataset for Disinformation Detection in the Life SciencesCode0
Chat2SPaT: A Large Language Model Based Tool for Automating Traffic Signal Control Plan ManagementCode0
SciVid: Cross-Domain Evaluation of Video Models in Scientific ApplicationsCode0
ObjectRL: An Object-Oriented Reinforcement Learning CodebaseCode0
MLASDO: a software tool to detect and explain clinical and omics inconsistencies applied to the Parkinson's Progression Markers Initiative cohortCode0
On the rankability of visual embeddingsCode0
MGSfM: Multi-Camera Geometry Driven Global Structure-from-MotionCode0
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional VideosCode0
Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling0
GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation0
Agent-Based Detection and Resolution of Incompleteness and Ambiguity in Interactions with Large Language Models0
LTLCrit: A Temporal Logic-based LLM Critic for Safe and Efficient Embodied Agents0
Recon, Answer, Verify: Agents in Search of Truth0
Dyn-O: Building Structured World Models with Object-Centric Representations0
Be the Change You Want to See: Revisiting Remote Sensing Change Detection PracticesCode1
LRM-1B: Towards Large Routing Model0
Transforming Calabi-Yau Constructions: Generating New Calabi-Yau Manifolds with Transformers0
EvoAgentX: An Automated Framework for Evolving Agentic WorkflowsCode7
AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions0
CoreCodeBench: A Configurable Multi-Scenario Repository-Level BenchmarkCode1
GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph LearningCode2
Behaviour Space Analysis of LLM-driven Meta-heuristic Discovery0
Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations0
Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps0
Evaluating the Evaluators: Trust in Adversarial Robustness Tests0
Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach0
SAMed-2: Selective Memory Enhanced Medical Segment Anything ModelCode1
Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation0
Communication Efficient, Differentially Private Distributed Optimization using Correlation-Aware Sketching0
Large Language Models for Combinatorial Optimization: A Systematic Review0
Show:102550
← PrevPage 264 of 9486Next →