SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 73017325 of 474278 papers

TitleStatusHype
GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal ModelsCode0
QwenCLIP: Boosting Medical Vision-Language Pretraining via LLM Embeddings and Prompt tuningCode0
A Brain Wave Encodes a Thousand Tokens: Modeling Inter-Cortical Neural Interactions for Effective EEG-based Emotion RecognitionCode0
FusionFM: All-in-One Multi-Modal Image Fusion with Flow MatchingCode0
CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model InferenceCode0
How Language Directions Align with Token Geometry in Multilingual LLMsCode0
MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAMCode0
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space DualityCode0
DEXTER: Diffusion-Guided EXplanations with TExtual Reasoning for Vision ModelsCode0
DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platformsCode0
P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical FormatsCode0
Assessing LLMs for Serendipity Discovery in Knowledge Graphs: A Case for Drug Repurposing0
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language ModelsCode0
FunReason-MT Technical Report: Advanced Data Synthesis Solution for Real-world Multi-Turn Tool-use0
DeceptionBench: A Comprehensive Benchmark for AI Deception Behaviors in Real-world ScenariosCode0
SCALAR: Scale-wise Controllable Visual Autoregressive LearningCode0
Trainable Dynamic Mask Sparse AttentionCode0
See it. Say it. Sorted: Agentic System for Compositional Diagram GenerationCode0
PID-controlled Langevin Dynamics for Faster Sampling of Generative ModelsCode0
Seg-VAR: Image Segmentation with Visual Autoregressive ModelingCode0
Medical Knowledge Intervention Prompt Tuning for Medical Image ClassificationCode0
R^2Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical RejectionCode0
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMsCode0
BioMedJImpact: A Comprehensive Dataset and LLM Pipeline for AI Engagement and Scientific Impact Analysis of Biomedical JournalsCode0
Temporal Object-Aware Vision Transformer for Few-Shot Video Object DetectionCode0
Show:102550
← PrevPage 293 of 18972Next →