SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 79518000 of 661570 papers

TitleStatusHype
UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representations0
WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation0
Large Language Models as Annotators for Machine Translation Quality Estimation0
eLasmobranc Dataset: An Image Dataset for Elasmobranch Species Recognition and Biodiversity Monitoring0
CacheSolidarity: Preventing Prefix Caching Side Channels in Multi-tenant LLM Serving Systems0
Event-based Photometric Stereo via Rotating Illumination and Per-Pixel Learning0
Deep Randomized Distributed Function Computation (DeepRDFC): Neural Distributed Channel Simulation0
A PUF-Based Approach for Copy Protection of Intellectual Property in Neural Network Models0
Prioritizing Gradient Sign Over Modulus: An Importance-Aware Framework for Wireless Federated Learning0
Phase-Interface Instance Segmentation as a Visual Sensor for Laboratory Process Monitoring0
Interpretable Chinese Metaphor Identification via LLM-Assisted MIPVU Rule Script Generation: A Comparative Protocol Study0
PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction0
Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services0
Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization0
Evaluating randomized smoothing as a defense against adversarial attacks in trajectory prediction0
ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning0
A dataset of medication images with instance segmentation masks for preventing adverse drug events0
BALD-SAM: Disagreement-based Active Prompting in Interactive Segmentation0
PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words0
Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops0
Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis0
From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers0
Semantic Landmark Particle Filter for Robot Localisation in Vineyards0
V_0.5: Generalist Value Model as a Prior for Sparse RL Rollouts0
SiDiaC-v.2.0: Sinhala Diachronic Corpus Version 2.00
SNPgen: Phenotype-Supervised Genotype Representation and Synthetic Data Generation via Latent Diffusion0
Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models0
A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification0
When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS0
ECoLAD: Deployment-Oriented Evaluation for Automotive Time-Series Anomaly Detection0
Bridging the Skill Gap in Clinical CBCT Interpretation with CBCTRepD0
LLM2Vec-Gen: Generative Embeddings from Large Language Models2
Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control0
Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density EstimatorsCode0
When should we trust the annotation? Selective prediction for molecular structure retrieval from mass spectra0
Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals0
Pointy - A Lightweight Transformer for Point Cloud Foundation ModelsCode0
Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation0
Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style0
GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations0
ForwardFlow: Simulation only statistical inference using deep learning0
The Discrete Charm of the MLP: Binary Routing of Continuous Signals in Transformer Feed-Forward Layers0
Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots through LLM-Generated Probes0
MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems0
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color FidelityCode0
Artificial Intelligence as a Catalyst for Innovation in Software Engineering0
Leech Lattice Vector Quantization for Efficient LLM Compression0
Factorized Neural Implicit DMD for Parametric Dynamics0
Cross-Species Transfer Learning for Electrophysiology-to-Transcriptomics Mapping in Cortical GABAergic Interneurons0
RCTs & Human Uplift Studies: Methodological Challenges and Practical Solutions for Frontier AI Evaluation0
Show:102550
← PrevPage 160 of 13232Next →