SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1225112300 of 474278 papers

TitleStatusHype
Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language ModelsCode0
Topology Enhanced MARL for Multi-Vehicle Cooperative Decision-Making of CAVsCode0
Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language ModelsCode0
Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal DataCode0
Vidi: Large Multimodal Models for Video Understanding and Editing0
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling0
Improving physics-informed neural network extrapolation via transfer learning and adaptive activation functionsCode0
Mixture of Raytraced ExpertsCode0
MOFSimBench: Evaluating Universal Machine Learning Interatomic Potentials In Metal--Organic Framework Molecular ModelingCode0
CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence PruningCode0
CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range VideosCode0
Dataset Ownership Verification for Pre-trained Masked ModelsCode0
MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic LearningCode0
Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene GraphCode0
Wavelet-based Decoupling Framework for low-light Stereo Image EnhancementCode0
Text-driven Multiplanar Visual Interaction for Semi-supervised Medical Image SegmentationCode0
QuRe: Query-Relevant Retrieval through Hard Negative Sampling in Composed Image RetrievalCode0
CytoSAE: Interpretable Cell Embeddings for HematologyCode0
DeltaDiff: Reality-Driven Diffusion with AnchorResiduals for Faithful SRCode0
DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric ReasoningCode0
Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and MethodCode0
TRIQA: Image Quality Assessment by Contrastive Pretraining on Ordered Distortion TripletsCode0
The benefits of query-based KGQA systems for complex and temporal questions in LLM eraCode0
Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image ClassificationCode0
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMsCode0
The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and ScientistCode0
AU-Blendshape for Fine-grained Stylized 3D Facial Expression ManipulationCode0
BOOKCOREF: Coreference Resolution at Book ScaleCode0
Out-of-distribution data supervision towards biomedical semantic segmentationCode0
RadioDiff-3D: A 3D3D Radio Map Dataset and Generative Diffusion Based Benchmark for 6G Environment-Aware CommunicationCode0
Text-ADBench: Text Anomaly Detection Benchmark based on LLMs EmbeddingCode0
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language ModelsCode0
MS-DGCNN++: A Multi-Scale Fusion Dynamic Graph Neural Network with Biological Knowledge Integration for LiDAR Tree Species ClassificationCode0
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scaleCode3
Second-Order Bounds for [0,1]-Valued Regression via Betting Loss0
A Bayesian Incentive Mechanism for Poison-Resilient Federated Learning0
YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive AssociationCode0
Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?0
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training0
Information-Theoretic Generalization Bounds of Replay-based Continual Learning0
Non-Adaptive Adversarial Face Generation0
A Privacy-Preserving Framework for Advertising Personalization Incorporating Federated Learning and Differential Privacy0
Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts0
Heat Kernel Goes Topological0
A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning0
Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models0
Federated Learning in Open- and Closed-Loop EMG Decoding: A Privacy and Performance Perspective0
Safeguarding Federated Learning-based Road Condition Classification0
Ranking Vectors Clustering: Theory and Applications0
Draw an Ugly Person An Exploration of Generative AIs Perceptions of Ugliness0
Show:102550
← PrevPage 246 of 9486Next →