SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 38013850 of 661570 papers

TitleStatusHype
AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment0
Mathematical Foundations of Deep Learning0
SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D ReconstructionCode0
Embodied Foundation Models at the Edge: A Survey of Deployment Constraints and Mitigation Strategies0
Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models0
A Proposal-Free Query-Guided Network for Grounded Multimodal Named Entity Recognition0
When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution0
The Mechanics of CNN Filtering with Rectification0
Implicit Patterns in LLM-Based Binary Analysis0
How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation0
A Re-ranking Method using K-nearest Weighted Fusion for Person Re-identificationCode0
CoPRS: Learning Positional Prior from Chain-of-Thought for Reasoning SegmentationCode0
MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image UnderstandingCode0
LMEB: Long-horizon Memory Embedding BenchmarkCode0
R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic SegmentationCode0
EntropyCache: Decoded Token Entropy Guided KV Caching for Diffusion Language ModelsCode0
Efficient Video Diffusion with Sparse Information Transmission for Video CompressionCode0
MOSAIC: Multi-Objective Slice-Aware Iterative Curation for AlignmentCode0
Benchmarking PDF Parsers on Table Extraction with LLM-based Semantic EvaluationCode0
Memento-Skills: Let Agents Design AgentsCode0
ProCal: Probability Calibration for Neighborhood-Guided Source-Free Domain AdaptationCode0
Statistical Characteristic-Guided Denoising for Rapid High-Resolution Transmission Electron Microscopy ImagingCode0
RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language ModelsCode0
PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and AlignmentCode0
GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian SplattingCode0
Rethinking MLLM Itself as a Segmenter with a Single Segmentation TokenCode0
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial OptimizationCode0
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?Code0
StreamingThinker: Large Language Models Can Think While ReadingCode0
Object-Centric Representation Learning for Enhanced 3D Semantic Scene Graph PredictionCode0
Multimodal OCR: Parse Anything from DocumentsCode0
MeInTime: Bridging Age Gap in Identity-Preserving Face RestorationCode0
MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language ModelsCode0
TAU-R1: Visual Language Model for Traffic Anomaly UnderstandingCode0
Motion-o: Trajectory-Grounded Video ReasoningCode0
Synergizing Deep Learning and Biological Heuristics for Extreme Long-Tail White Blood Cell ClassificationCode0
FILT3R: Latent State Adaptive Kalman Filter for Streaming 3D ReconstructionCode0
Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact EstimationCode0
Expanding mmWave Datasets for Human Pose Estimation with Unlabeled Data and LiDAR DatasetsCode0
SuperOcc: Toward Cohesive Temporal Modeling for Superquadric-based 3D Occupancy PredictionCode0
What Is Wrong with Synthetic Data for Scene Text Recognition? A Strong Synthetic Engine with Diverse Simulations and Self-EvolutionCode0
PFGNet: A Fully Convolutional Frequency-Guided Peripheral Gating Network for Efficient Spatiotemporal Predictive LearningCode0
A Multi-Agent Perception-Action Alliance for Efficient Long Video ReasoningCode0
To See is Not to Master: Teaching LLMs to Use Private Libraries for Code GenerationCode0
Probing Cultural Signals in Large Language Models through Author ProfilingCode0
SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal NavigationCode0
HAViT: Historical Attention Vision TransformerCode0
MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-EvolutionCode0
HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language ModelsCode0
DriftGuard: Mitigating Asynchronous Data Drift in Federated LearningCode0
Show:102550
← PrevPage 77 of 13232Next →