SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1660116650 of 474278 papers

TitleStatusHype
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective0
Draft-based Approximate Inference for LLMsCode1
Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting FrameworkCode0
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation ModelsCode3
From Legal Texts to Defeasible Deontic Logic via LLMs: A Study in Automated Semantic Analysis0
Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations0
Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization0
RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping0
SurfR: Surface Reconstruction with Multi-scale Attention0
LLaVA-c: Continual Improved Visual Instruction Tuning0
CanadaFireSat: Toward high-resolution wildfire forecasting with multiple modalities0
Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting0
HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation0
Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF0
ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations0
Towards Robust Real-World Multivariate Time Series Forecasting: A Unified Framework for Dependency, Asynchrony, and Missingness0
Rethinking Range-View LiDAR Segmentation in Adverse Weather0
Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos0
MAMBO: High-Resolution Generative Approach for Mammography Images0
Can LLMs Ground when they (Don't) Know: A Study on Direct and Loaded Political Questions0
Structured Variational D-Decomposition for Accurate and Stable Low-Rank Approximation0
PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly0
On The Impact of Merge Request Deviations on Code Review Practices0
Explainable Compliance Detection with Multi-Hop Natural Language Inference on Assurance Case Structure0
Teaching Physical Awareness to LLMs through Sounds0
FloorplanMAE:A self-supervised framework for complete floorplan generation from partial inputs0
RHealthTwin: Towards Responsible and Multimodal Digital Twins for Personalized Well-being0
Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task0
Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation0
IntTrajSim: Trajectory Prediction for Simulating Multi-Vehicle driving at Signalized Intersections0
Evaluating Generative Vehicle Trajectory Models for Traffic Intersection Dynamics0
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning0
FROST-EMA: Finnish and Russian Oral Speech Dataset of Electromagnetic Articulography Measurements with L1, L2 and Imitated L2 Accents0
SEMA: a Scalable and Efficient Mamba like Attention via Token Localization and Averaging0
Your Agent Can Defend Itself against Backdoor Attacks0
SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models0
Reinforce LLM Reasoning through Multi-Agent Reflection0
Spatiotemporal deep learning models for detection of rapid intensification in cyclones0
HASFL: Heterogeneity-aware Split Federated Learning over Edge Computing Systems0
Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-k0
Re-Thinking the Automatic Evaluation of Image-Text Alignment in Text-to-Image Models0
DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View0
Fairness is Not Silence: Unmasking Vacuous Neutrality in Small Language Models0
MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding0
TrajFlow: Multi-modal Motion Prediction via Flow Matching0
Flow-Lenia: Emergent evolutionary dynamics in mass conservative continuous cellular automata0
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation0
Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization0
Societal AI Research Has Become Less Interdisciplinary0
Multimodal Representation Alignment for Cross-modal Information Retrieval0
Show:102550
← PrevPage 333 of 9486Next →