SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1735117400 of 474278 papers

TitleStatusHype
NSUN2 as a potential prognostic as well as therapeutic target in cancer by regulating m5C modification0
The influence of cell phenotype on collective cell invasion into the extracellular matrixCode0
A Framework for Controllable Multi-objective Learning with Annealed Stein Variational Hypernetworks0
SDS-Net: Shallow-Deep Synergism-detection Network for infrared small target detectionCode0
Text-to-LoRA: Instant Transformer AdaptionCode0
LETS Forecast: Learning Embedology for Time Series ForecastingCode1
PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time0
BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions0
MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning0
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill DiversificationCode1
Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey0
MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?0
DriveAction: A Benchmark for Exploring Human-like Driving Decisions in VLA Models0
AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical SearchCode0
Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration0
Transformers Beyond Order: A Chaos-Markov-Gaussian Framework for Short-Term Sentiment Forecasting of Any Financial OHLC timeseries Data0
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language ModelsCode1
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement LearningCode0
FADE: Frequency-Aware Diffusion Model Factorization for Video EditingCode1
STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous DrivingCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms0
SMAR: Soft Modality-Aware Routing Strategy for MoE-based Multimodal Large Language Models Preserving Language Capabilities0
Assessing the Impact of Anisotropy in Neural Representations of Speech: A Case Study on Keyword Spotting0
An Active Learning-Based Streaming Pipeline for Reduced Data Training of Structure Finding Models in Neutron DiffractometryCode0
Permutation-Free High-Order Interaction TestsCode0
FPDANet: A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound0
Implicit Neural Representation-Based MRI Reconstruction Method with Sensitivity Map Constraints0
Reliable Evaluation of MRI Motion Correction: Dataset and Insights0
ResPF: Residual Poisson Flow for Efficient and Physically Consistent Sparse-View CT Reconstruction0
Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues0
Prompting Wireless Networks: Reinforced In-Context Learning for Power Control0
Multi-Modal Large Models Based Beam Prediction: An Example Empowered by DeepSeek0
Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games0
A cautious user's guide in applying HMMs to physical systems0
Functional Architecture of the Human Hypothalamus: Cortical Coupling and Subregional Organization Using 7-Tesla fMRI0
Impact of the WHO's 90-70-90 Strategy on HPV-Related Cervical Cancer Control: A Mathematical Model Evaluation in China0
Spectral DerivativesCode0
NILMFormer: Non-Intrusive Load Monitoring that Accounts for Non-StationarityCode1
AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition0
Into the Unknown: From Structure to Disorder in Protein Function Prediction0
Integrating Complexity and Biological Realism: High-Performance Spiking Neural Networks for Breast Cancer Detection0
DermaCon-IN: A Multi-concept Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI ResearchCode0
AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation0
Neural Responses to Affective Sentences Reveal Signatures of Depression0
Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems0
Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models0
Reconstructing Heterogeneous Biomolecules via Hierarchical Gaussian Mixtures and Part Discovery0
Audio-Aware Large Language Models as Judges for Speaking Styles0
Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition0
Show:102550
← PrevPage 348 of 9486Next →