SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1625116300 of 474278 papers

TitleStatusHype
Geometry-Informed Neural Operator TransformerCode1
BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice TextCode1
DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated ImagesCode1
Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real TransferCode1
Simplified and Secure MCP Gateways for Enterprise AI IntegrationCode1
AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic PapersCode1
mrCAD: Multimodal Refinement of Computer-aided DesignsCode1
Efficient Reasoning for LLMs through Speculative Chain-of-ThoughtCode1
Relative Contrastive Learning for Sequential Recommendation with Similarity-based Positive Pair SelectionCode1
Neurosymbolic Association Rule Mining from Tabular DataCode1
LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place RecognitionCode1
AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language EmbeddingsCode1
AndroidGen: Building an Android Language Agent under Data ScarcityCode1
Semantic-Aligned Learning with Collaborative Refinement for Unsupervised VI-ReIDCode1
ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware DevelopmentCode1
Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented GenerationCode1
R-Sparse R-CNN: SAR Ship Detection Based on Background-Aware Sparse Learnable ProposalsCode1
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and ImputationCode1
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual RepresentationCode1
Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson DiseaseCode1
Clinical knowledge in LLMs does not translate to human interactionsCode1
CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person RetrievalCode1
Unsupervised Visual Chain-of-Thought Reasoning via Preference OptimizationCode1
Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providersCode1
Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude EconomyCode1
PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision ModelsCode1
Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup FunctionalCode1
What is the Added Value of UDA in the VFM Era?Code1
DOSE : Drum One-Shot Extraction from Music MixtureCode1
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object DetectionCode1
LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling MethodCode1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONSCode1
Action Flow Matching for Continual Robot LearningCode1
E-InMeMo: Enhanced Prompting for Visual In-Context LearningCode1
VideoMultiAgents: A Multi-Agent Framework for Video Question AnsweringCode1
Mamba-Sea: A Mamba-based Framework with Global-to-Local Sequence Augmentation for Generalizable Medical Image SegmentationCode1
TableCenterNet: A one-stage network for table structure recognitionCode1
PhysioSync: Temporal and Cross-Modal Contrastive Learning Inspired by Physiological Synchronization for EEG-Based Emotion RecognitionCode1
iVR-GS: Inverse Volume Rendering for Explorable Visualization via Editable 3D Gaussian SplattingCode1
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual DependencyCode1
FRAG: Frame Selection Augmented Generation for Long Video and Long Document UnderstandingCode1
Beyond Cox Models: Assessing the Performance of Machine-Learning Methods in Non-Proportional Hazards and Non-Linear Survival AnalysisCode1
Quadratic Interest Network for Multimodal Click-Through Rate PredictionCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live StreamsCode1
CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured VideosCode1
A Comprehensive Survey of Synthetic Tabular Data GenerationCode1
IRIS: Interactive Research Ideation System for Accelerating Scientific DiscoveryCode1
Enhancing LLM-Based Agents via Global Planning and Hierarchical ExecutionCode1
VideoVista-CulturalLingo: 360^ Horizons-Bridging Cultures, Languages, and Domains in Video ComprehensionCode1
Show:102550
← PrevPage 326 of 9486Next →