SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1865118700 of 474278 papers

TitleStatusHype
Confidence intervals for forced alignment boundaries using model ensemblesCode0
System Calls for Malware Detection and Classification: Methodologies and Applications0
Memory Access Characterization of Large Language Models in CPU Environment and its Potential Impacts0
Sensitivity-Aware Density Estimation in Multiple Dimensions0
The Promise of Spiking Neural Networks for Ubiquitous Computing: A Survey and New Perspectives0
Not All Jokes Land: Evaluating Large Language Models Understanding of Workplace Humor0
SOC-DGL: Social Interaction Behavior Inspired Dual Graph Learning Framework for Drug-Target Interaction IdentificationCode0
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
Synthesis of discrete-continuous quantum circuits with multimodal diffusion modelsCode2
STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent FrameworkCode1
EfficientFER: EfficientNetv2 Based Deep Learning Approach for Facial Expression RecognitionCode1
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
Compiler Optimization via LLM Reasoning for Efficient Model ServingCode2
FlexSelect: Flexible Token Selection for Efficient Long Video Understanding0
L3A: Label-Augmented Analytic Adaptation for Multi-Label Class Incremental LearningCode0
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
Affordance Benchmark for MLLMsCode0
Predicting Empirical AI Research Outcomes with Language Models0
PFMBench: Protein Foundation Model BenchmarkCode1
LD-RPMNet: Near-Sensor Diagnosis for Railway Point Machines0
Self-Supervised-ISAR-Net Enables Fast Sparse ISAR Imaging0
Near-Field Directional Modulation for RIS-Aided Movable Antenna MIMO Systems with Hardware Impairments0
ModuLM: Enabling Modular and Multimodal Molecular Relational Learning with Large Language Models0
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation0
Uncertainty-Aware Metabolic Stability Prediction with Dual-View Contrastive Learning0
Explainable-AI powered stock price prediction using time series transformers: A Case Study on BIST1000
A Group-Wise Narrow Beam Design for Uplink Channel Estimation in Hybrid Beamforming Systems0
Bridging Quantum and Classical Computing in Drug Design: Architecture Principles for Improved Molecule GenerationCode0
TRUST -- Transformer-Driven U-Net for Sparse Target Recovery0
anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding0
Training Beam Design for Channel Estimation in Hybrid mmWave MIMO Systems0
ProtInvTree: Deliberate Protein Inverse Folding with Reward-guided Tree Search0
Projection Pursuit Density Ratio Estimation0
Evaluating the Unseen Capabilities: How Many Theorems Do LLMs Know?0
Uncovering Bias Mechanisms in Observational Studies0
A Reinforcement Learning Approach for RIS-aided Fair Communications0
Protap: A Benchmark for Protein Modeling on Realistic Downstream ApplicationsCode1
Can AI Master Econometrics? Evidence from Econometrics AI Agent on Expert-Level Tasks0
Designing DSIC Mechanisms for Data Sharing in the Era of Large Language Models0
NR4DER: Neural Re-ranking for Diversified Exercise RecommendationCode0
Fast or Slow? Integrating Fast Intuition and Deliberate Thinking for Enhancing Visual Question Answering0
GThinker: Towards General Multimodal Reasoning via Cue-Guided RethinkingCode0
SocialEval: Evaluating Social Intelligence of Large Language ModelsCode0
Probing Neural Topology of Large Language ModelsCode0
SealQA: Raising the Bar for Reasoning in Search-Augmented Language ModelsCode3
DeepVerse: 4D Autoregressive Video Generation as a World Model0
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages0
Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs0
A Review on Coarse to Fine-Grained Animal Action Recognition0
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature PromptingCode0
Show:102550
← PrevPage 374 of 9486Next →