SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2025120300 of 474278 papers

TitleStatusHype
TrackMe:A Simple and Effective Multiple Object Tracking Annotation ToolCode1
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional BootstrappingCode1
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentationCode1
Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference LearningCode1
Cooperation and Fairness in Multi-Agent Reinforcement LearningCode1
NeuralMAG: Fast and Generalizable Micromagnetic Simulation with Deep Neural NetsCode1
How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation ThresholdCode1
MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object DetectionCode1
Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-AttentionCode1
EViT-Unet: U-Net Like Efficient Vision Transformer for Medical Image Segmentation on Mobile and Edge DevicesCode1
Evaluating Deep Unlearning in Large Language ModelsCode1
Non-Invasive to Invasive: Enhancing FFA Synthesis from CFP with a Benchmark Dataset and a Novel NetworkCode1
Quanta Video RestorationCode1
GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete OptimizationCode1
DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine DomainCode1
UniMTS: Unified Pre-training for Motion Time SeriesCode1
LoGU: Long-form Generation with Uncertainty ExpressionsCode1
EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary SearchCode1
MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart ProblemsCode1
Enhancing Large Language Models' Situated Faithfulness to External ContextsCode1
Synthesizing Post-Training Data for LLMs through Multi-Agent SimulationCode1
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor EnvironmentsCode1
How EEG preprocessing shapes decoding performanceCode1
Zero-shot Generalist Graph Anomaly Detection with Unified Neighborhood PromptsCode1
Toward Generalizing Visual Brain Decoding to Unseen SubjectsCode1
ControlSR: Taming Diffusion Models for Consistent Real-World Image Super ResolutionCode1
syren-new: Precise formulae for the linear and nonlinear matter power spectra with massive neutrinos and dynamical dark energyCode1
DRACO: Differentiable Reconstruction for Arbitrary CBCT OrbitsCode1
Decomposing The Dark Matter of Sparse AutoencodersCode1
Croc: Pretraining Large Multimodal Models with Cross-Modal ComprehensionCode1
TimeSeriesExam: A time series understanding examCode1
CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy TrafficCode1
Shape Transformation Driven by Active Contour for Class-Imbalanced Semi-Supervised Medical Image SegmentationCode1
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control AgentsCode1
ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mobility PredictionCode1
xPerT: Extended Persistence TransformerCode1
ANT: Adaptive Noise Schedule for Time Series Diffusion ModelsCode1
Personalized Image Generation with Large Multimodal ModelsCode1
Unlocking the Full Potential of High-Density Surface EMG: Novel Non-Invasive High-Yield Motor Unit DecompositionCode1
MomentumSMoE: Integrating Momentum into Sparse Mixture of ExpertsCode1
Do LLMs "know" internally when they follow instructions?Code1
Rethinking Transformer for Long Contextual Histopathology Whole Slide Image AnalysisCode1
Self-supervised contrastive learning performs non-linear system identificationCode1
Paths-over-Graph: Knowledge Graph Empowered Large Language Model ReasoningCode1
Almost-Linear RNNs Yield Highly Interpretable Symbolic Codes in Dynamical Systems ReconstructionCode1
FedMSE: Federated learning for IoT network intrusion detectionCode1
Distance between Relevant Information Pieces Causes Bias in Long-Context LLMsCode1
LESS: Label-Efficient and Single-Stage Referring 3D SegmentationCode1
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-GuidanceCode1
ControlAgent: Automating Control System Design via Novel Integration of LLM Agents and Domain ExpertiseCode1
Show:102550
← PrevPage 406 of 9486Next →