SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1630116350 of 474278 papers

TitleStatusHype
Cross-Channel Unlabeled Sensing over a Union of Signal Subspaces0
Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning0
FedVLMBench: Benchmarking Federated Fine-Tuning of Vision-Language Models0
A Survey on the Role of Artificial Intelligence and Machine Learning in 6G-V2X Applications0
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning0
Towards Efficient and Effective Alignment of Large Language Models0
Foundation Model-Aided Deep Reinforcement Learning for RIS-Assisted Wireless Communication0
Advancing Exchange Rate Forecasting: Leveraging Machine Learning and AI for Enhanced Accuracy in Global Financial Markets0
Vision Generalist Model: A Survey0
Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era0
Wasserstein Hypergraph Neural Network0
UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images0
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning0
Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings0
An Effective End-to-End Solution for Multimodal Action Recognition0
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single ModelCode2
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding0
Leveraging LLMs for Mission Planning in Precision Agriculture0
A Novel Lightweight Transformer with Edge-Aware Fusion for Remote Sensing Image Captioning0
Rethinking Brain Tumor Segmentation from the Frequency Domain PerspectiveCode1
ComfyUI-R1: Exploring Reasoning Models for Workflow GenerationCode7
Towards Practical Alzheimer's Disease Diagnosis: A Lightweight and Interpretable Spiking Neural ModelCode1
ScaleLSD: Scalable Deep Line Segment Detection StreamlinedCode1
Evasion Attacks Against Bayesian Predictive ModelsCode0
HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person ScenariosCode0
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear AttentionCode0
Improving Personalized Search with Regularized Low-Rank Parameter UpdatesCode0
Consistent Story Generation with Asymmetry Zigzag SamplingCode0
MMME: A Spontaneous Multi-Modal Micro-Expression Dataset Enabling Visual-Physiological FusionCode0
SRPL-SFDA: SAM-Guided Reliable Pseudo-Labels for Source-Free Domain Adaptation in Medical Image SegmentationCode0
Apollo: A Posteriori Label-Only Membership Inference Attack Towards Machine UnlearningCode0
Discrete Scale-invariant Metric Learning for Efficient Collaborative FilteringCode0
IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic EnvironmentsCode2
Non-Contact Health Monitoring During Daily Personal Care RoutinesCode1
VerIF: Verification Engineering for Reinforcement Learning in Instruction FollowingCode2
DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety PromptCode1
Unmasking real-world audio deepfakes: A data-centric approachCode1
OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive AlignmentCode0
ScoreMix: Improving Face Recognition via Score Composition in Diffusion Generators0
MetricHMR: Metric Human Mesh Recovery from Monocular Images0
Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers0
Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question AnsweringCode0
Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMsCode0
Empirical and computer-aided robustness analysis of long-step and accelerated methods in smooth convex optimizationCode0
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction0
Prompt Variability Effects On LLM Code Generation0
Auto-Compressing Networks0
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian SplattingCode2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video ModelsCode2
LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection ChallengeCode1
Show:102550
← PrevPage 327 of 9486Next →