SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1825118300 of 474278 papers

TitleStatusHype
An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman ProblemCode1
Sparse identification of nonlinear dynamics and Koopman operators with Shallow Recurrent Decoder NetworksCode1
Time Series Embedding Methods for Classification Tasks: A ReviewCode1
Unraveling Normal Anatomy via Fluid-Driven Anomaly RandomizationCode1
Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level CaptioningCode1
WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm ControlCode1
Leveraging Textual Anatomical Knowledge for Class-Imbalanced Semi-Supervised Multi-Organ SegmentationCode1
Enhancing Biomedical Relation Extraction with DirectionalityCode1
Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent SystemCode1
Self-Supervised Diffusion MRI Denoising via Iterative and Stable RefinementCode1
Enhancing kelp forest detection in remote sensing images using crowdsourced labels with Mixed Vision Transformers and ConvNeXt segmentation modelsCode1
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware OptimizationCode1
Do as We Do, Not as You Think: the Conformity of Large Language ModelsCode1
MixRec: Individual and Collective Mixing Empowers Data Augmentation for Recommender SystemsCode1
EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature FusionCode1
Implicit Neural Surface Deformation with Explicit Velocity FieldsCode1
Training-Free Zero-Shot Temporal Action Detection with Vision-Language ModelsCode1
Emotion estimation from video footage with LSTMCode1
Can Large Language Models Understand Preferences in Personalized Recommendation?Code1
KAA: Kolmogorov-Arnold Attention for Enhancing Attentive Graph Neural NetworksCode1
Pairwise RM: Perform Best-of-N Sampling with Knockout TournamentCode1
FlanEC: Exploring Flan-T5 for Post-ASR Error CorrectionCode1
Need for Speed: A Comprehensive Benchmark of JPEG Decoders in PythonCode1
Online Preference Alignment for Language Models via Count-based ExplorationCode1
Multi-Instance Partial-Label Learning with Margin AdjustmentCode1
SRMT: Shared Memory for Multi-agent Lifelong PathfindingCode1
NExtLong: Toward Effective Long-Context Training without Long DocumentsCode1
REX: Causal Discovery based on Machine Learning and Explainability techniquesCode1
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image GenerationCode1
Tackling Small Sample Survival Analysis via Transfer Learning: A Study of Colorectal Cancer PrognosisCode1
DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology ModelingCode1
MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing NetworksCode1
Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment LabelsCode1
LASER: Lip Landmark Assisted Speaker Detection for RobustnessCode1
WinPCA: A package for windowed principal component analysisCode1
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-TuningCode1
Generating with Fairness: A Modality-Diffused Counterfactual Framework for Incomplete Multimodal RecommendationsCode1
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradientsCode1
Med-R^2: Crafting Trustworthy LLM Physicians via Retrieval and Reasoning of Evidence-Based MedicineCode1
Multi-Modality Collaborative Learning for Sentiment AnalysisCode1
Physics of Skill LearningCode1
SMamba: Sparse Mamba for Event-based Object DetectionCode1
Modality Interactive Mixture-of-Experts for Fake News DetectionCode1
Towards Accurate Unified Anomaly SegmentationCode1
Assisting Mathematical Formalization with A Learning-based Premise RetrieverCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
TFLOP: Table Structure Recognition Framework with Layout Pointer MechanismCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
Show:102550
← PrevPage 366 of 9486Next →