SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 88018850 of 661570 papers

TitleStatusHype
Bridging Domains through Subspace-Aware Model Merging0
Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference0
L^3:Scene-agnostic Visual Localization in the Wild0
Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization0
ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose Estimation0
Towards Robust Retrieval-Augmented Generation Based on Knowledge Graph: A Comparative Analysis0
From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial Electrocatalysts0
RL unknotter, hard unknots and unknotting number0
An Interpretable Generative Framework for Anomaly Detection in High-Dimensional Financial Time Series0
CAST: Modeling Visual State Transitions for Consistent Video Retrieval0
mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point Cloud0
SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation0
MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention0
A Hybrid Vision Transformer Approach for Mathematical Expression Recognition0
Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving Architectures0
Using Multimodal and Language-Agnostic Sentence Embeddings for Abstractive Summarization0
Research and Prototyping Study of an LLM-Based Chatbot for Electromagnetic Simulations0
Evolution Strategy-Based Calibration for Low-Bit Quantization of Speech Models0
Is continuous CoT better suited for multi-lingual reasoning?0
Quantifying Cross-Lingual Transfer in Paralinguistic Speech Tasks0
ViSA-Enhanced Aerial VLN: A Visual-Spatial Reasoning Enhanced Framework for Aerial Vision-Language Navigation0
SPEX: A Vision-Language Model for Land Cover Extraction on Spectral Remote Sensing ImagesCode0
Feedback Control for Small Budget Pacing0
Structure and Progress Aware Diffusion for Medical Image Segmentation0
ALOOD: Exploiting Language Representations for LiDAR-based Out-of-Distribution Object DetectionCode0
Speed3R: Sparse Feed-forward 3D Reconstruction Models0
C^2FG: Control Classifier-Free Guidance via Score Discrepancy Analysis0
NCL-UoR at SemEval-2026 Task 5: Embedding-Based Methods, Fine-Tuning, and LLMs for Word Sense Plausibility RatingCode0
Beyond the Markovian Assumption: Robust Optimization via Fractional Weyl Integrals in Imbalanced Data0
First-Order Geometry, Spectral Compression, and Structural Compatibility under Bounded Computation0
ImprovedGS+: A High-Performance C++/CUDA Re-Implementation Strategy for 3D Gaussian Splatting0
The Coupling Within: Flow Matching via Distilled Normalizing Flows0
Impact of LLMs news Sentiment Analysis on Stock Price Movement Prediction0
ModalImmune: Immunity Driven Unlearning via Self Destructive Training0
SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications0
Improving Conditional VAE with Non-Volume Preserving transformations0
BRIDGE: Benchmark for multi-hop Reasoning In long multimodal Documents with Grounded Evidence0
Bootstrapping Audiovisual Speech Recognition in Zero-AV-Resource Scenarios with Synthetic Visual Data0
Privacy-Preserving End-to-End Full-Duplex Speech Dialogue Models0
Learning Multiple Utterance-Level Attribute Representations with a Unified Speech Encoder0
Novel Semantic Prompting for Zero-Shot Action Recognition0
Causal Retrieval with Semantic Consideration0
Co-LoRA: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients0
Point-based Instance Completion with Scene Constraints0
From Semantic To Instance: A Semi-Self-Supervised Learning Approach0
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization0
A Simple "Motivation" Can Enhance Reinforcement Finetuning of Large Reasoning Models0
Noisy PDE Training Requires Bigger PINNs0
Towards Practical Benchmarking of Data Cleaning Techniques: On Generating Authentic Errors via Large Language Models0
Post-Disaster Affected Area Segmentation with a Vision Transformer (ViT)-based EVAP Model using Sentinel-2 and Formosat-5 Imagery0
Show:102550
← PrevPage 177 of 13232Next →