SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1705117100 of 474278 papers

TitleStatusHype
A Multi-Power Law for Loss Curve Prediction Across Learning Rate SchedulesCode1
Sampling Innovation-Based Adaptive Compressive SensingCode1
Atlas: Multi-Scale Attention Improves Long Context Image ModelingCode1
GS-I^3: Gaussian Splatting for Surface Reconstruction from Illumination-Inconsistent ImagesCode1
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene UnderstandingCode1
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature AnalysisCode1
EgoEvGesture: Gesture Recognition Based on Egocentric Event CameraCode1
SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language ModelsCode1
Modality-Composable Diffusion Policy via Inference-Time Distribution-level CompositionCode1
History-Aware Transformation of ReID Features for Multiple Object TrackingCode1
Exploring Contextual Attribute Density in Referring Expression CountingCode1
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?Code1
MAVEN: Multi-modal Attention for Valence-Arousal Emotion NetworkCode1
DPF-Net: Physical Imaging Model Embedded Data-Driven Underwater Image EnhancementCode1
VISO-Grasp: Vision-Language Informed Spatial Object-centric 6-DoF Active View Planning and Grasping in Clutter and InvisibilityCode1
EXAONE Deep: Reasoning Enhanced Language ModelsCode1
LLM-Driven Multi-step Translation from C to Rust using Static AnalysisCode1
Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic CognitionCode1
Semi-Decision-Focused Learning with Deep Ensembles: A Practical Framework for Robust Portfolio OptimizationCode1
Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language ModelsCode1
Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank AdaptationCode1
TERL: Large-Scale Multi-Target Encirclement Using Transformer-Enhanced Reinforcement LearningCode1
Hyperbolic Safety-Aware Vision-Language ModelsCode1
Revisiting Training-Inference Trigger Intensity in Backdoor AttacksCode1
QDM: Quadtree-Based Region-Adaptive Sparse Diffusion Models for Efficient Image Super-ResolutionCode1
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie DubbingCode1
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM PlanningCode1
Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic EnvironmentsCode1
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop TrainingCode1
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud AnalysisCode1
SEAL: Semantic Aware Image WatermarkingCode1
O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language ModelsCode1
3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene ReconstructionCode1
Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video ReconstructionCode1
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and ReasoningCode1
UStyle: Waterbody Style Transfer of Underwater Scenes by Depth-Guided Feature SynthesisCode1
Observation-only learning of neural mapping schemes for gappy satellite-derived ocean colour parametersCode1
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech TokensCode1
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion GenerationCode1
Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data AnalysisCode1
Variational Bayesian Personalized RankingCode1
LuSeg: Efficient Negative and Positive Obstacles Segmentation via Contrast-Driven Multi-Modal Feature Fusion on the LunarCode1
Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open SpaceCode1
Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-tuningCode1
A Survey of Cross-domain Graph Learning: Progress and Future DirectionsCode1
GNNs as Predictors of Agentic Workflow PerformancesCode1
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion PriorCode1
Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty?Code1
APLA: A Simple Adaptation Method for Vision TransformersCode1
Similarity-Aware Token Pruning: Your VLM but FasterCode1
Show:102550
← PrevPage 342 of 9486Next →