SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1280112850 of 474278 papers

TitleStatusHype
A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding0
A Language-Driven Framework for Improving Personalized Recommendations: Merging LLMs with Traditional Algorithms0
What Demands Attention in Urban Street Scenes? From Scene Understanding towards Road Safety: A Survey of Vision-driven Datasets and Studies0
4KAgent: Agentic Any Image to 4K Super-Resolution0
OpenDPDv2: A Unified Learning and Optimization Framework for Neural Network Digital Predistortion0
SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep LayersCode0
Barriers in Integrating Medical Visual Question Answering into Radiology Workflows: A Scoping Review and Clinicians' Insights0
Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated PruningCode0
Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative ExpertsCode0
GR-LLMs: Recent Advances in Generative Recommendation Based on Large Language Models0
Explainable Artificial Intelligence in Biomedical Image Analysis: A Comprehensive Survey0
FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation0
What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models0
Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings0
MIND: A Multi-agent Framework for Zero-shot Harmful Meme DetectionCode0
Foundation models for time series forecasting: Application in conformal prediction0
Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework0
Learning from Sparse Point Labels for Dense Carcinosis Localization in Advanced Ovarian Cancer Assessment0
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality0
MagiC: Evaluating Multimodal Cognition Toward Grounded Visual Reasoning0
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model0
Reading a Ruler in the Wild0
Temporal Information Retrieval via Time-Specifier Model MergingCode0
Go to Zero: Towards Zero-shot Motion Generation with Million-scale DataCode0
MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint LearningCode0
ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention CaptureCode0
CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs0
Failure Forecasting Boosts Robustness of Sim2Real Rhythmic Insertion Policies0
Evaluating Attribute Confusion in Fashion Text-to-Image Generation0
LinguaMark: Do Multimodal Models Speak Fairly? A Benchmark-Based Evaluation0
Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset0
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd BehaviorCode0
Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction DetectionCode0
Integrating External Tools with Large Language Models to Improve Accuracy0
Residual Prior-driven Frequency-aware Network for Image FusionCode0
Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation0
Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices0
Fast Gaussian Processes under Monotonicity Constraints0
MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting ModelsCode2
Artificial Generals Intelligence: Mastering Generals.io with Reinforcement Learning0
MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image RetrievalCode0
From large-eddy simulations to deep learning: A U-net model for fast urban canopy flow predictionsCode0
GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning0
The Safety Gap Toolkit: Evaluating Hidden Dangers of Open-Source ModelsCode0
Gradients as an Action: Towards Communication-Efficient Federated Recommender Systems via Adaptive Action SharingCode0
SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity's Last Exam?0
HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars0
Multi-Sense Embeddings for Language Models and Knowledge DistillationCode0
CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text EmbeddingsCode0
Show:102550
← PrevPage 257 of 9486Next →