SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1895119000 of 474278 papers

TitleStatusHype
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object DetectionCode1
Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-trainingCode1
Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity CollaborationCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
Differential Alignment for Domain Adaptive Object DetectionCode1
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and GroundingCode1
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language ModelsCode1
Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial LearningCode1
GIRAFFE: Design Choices for Extending the Context Length of Visual Language ModelsCode1
MedMax: Mixed-Modal Instruction Tuning for Training Biomedical AssistantsCode1
XPath Agent: An Efficient XPath Programming Agent Based on LLM for Web CrawlerCode1
Assessing the Limitations of Large Language Models in Clinical Fact DecompositionCode1
RCLMuFN: Relational Context Learning and Multiplex Fusion Network for Multimodal Sarcasm DetectionCode1
TimeCHEAT: A Channel Harmony Strategy for Irregularly Sampled Multivariate Time Series AnalysisCode1
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image SegmentationCode1
A Knowledge-enhanced Pathology Vision-language Foundation Model for Cancer DiagnosisCode1
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance SegmentationCode1
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan ScriptCode1
MT-LENS: An all-in-one Toolkit for Better Machine Translation EvaluationCode1
Re-Attentional Controllable Video Diffusion EditingCode1
Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality SettingsCode1
Universal Domain Adaptive Object Detection via Dual Probabilistic AlignmentCode1
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language ModelsCode1
TS-SatFire: A Multi-Task Satellite Image Time-Series Dataset for Wildfire Detection and PredictionCode1
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters ThemselvesCode1
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning TypesCode1
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model ArchitectureCode1
Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPsCode1
3D^2-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar ModelingCode1
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-trainingCode1
AMI-Net: Adaptive Mask Inpainting Network for Industrial Anomaly Detection and LocalizationCode1
SpeechPrune: Context-aware Token Pruning for Speech Information RetrievalCode1
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic SegmentationCode1
Data-driven Precipitation Nowcasting Using Satellite ImageryCode1
Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural AdjustmentsCode1
Conditional Diffusion Models Based Conditional Independence TestingCode1
MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion ModelsCode1
RAG Playground: A Framework for Systematic Evaluation of Retrieval Strategies and Prompt Engineering in RAG SystemsCode1
Does VLM Classification Benefit from LLM Description Semantics?Code1
Region-Based Optimization in Continual Learning for Audio Deepfake DetectionCode1
IDEA-Bench: How Far are Generative Models from Professional Designing?Code1
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIPCode1
Bayesian Flow Is All You Need to Sample Out-of-Distribution Chemical SpacesCode1
Aligning Visual and Semantic Interpretability through Visually Grounded Concept Bottleneck ModelsCode1
Deep Random Features for Scalable Interpolation of Spatiotemporal DataCode1
Relation-Guided Adversarial Learning for Data-free Knowledge TransferCode1
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric PriorsCode1
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image GenerationCode1
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM EnhancementCode1
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video DenoisingCode1
Show:102550
← PrevPage 380 of 9486Next →