SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2055120600 of 474278 papers

TitleStatusHype
My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals0
SEMFED: Semantic-Aware Resource-Efficient Federated Learning for Heterogeneous NLP Tasks0
Zero-Trust Foundation Models: A New Paradigm for Secure and Collaborative Artificial Intelligence for Internet of Things0
Emergent LLM behaviors are observationally equivalent to data leakageCode0
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language ModelsCode0
MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning0
Emotion Classification In-Context in Spanish0
Retrieval Visual Contrastive Decoding to Mitigate Object Hallucinations in Large Vision-Language ModelsCode0
Risk-aware Direct Preference Optimization under Nested Risk MeasureCode0
CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting0
The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages0
ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object TrackingCode1
Kernel Quantile Embeddings and Associated Probability MetricsCode0
Prot2Token: A Unified Framework for Protein Modeling via Next-Token PredictionCode1
Learning with Expected Signatures: Theory and Applications0
Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RLCode1
OpenNIRScap: An Open-Source, Low-Cost Wearable Near-Infrared Spectroscopy-based Brain Interfacing CapCode1
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear ApproximationCode1
Covariate-Adjusted Deep Causal Learning for Heterogeneous Panel Data Models0
A Characterization of Reny's Weakly Sequentially Rational Equilibrium through -Perfect γ-Weakly Sequentially Rational Equilibrium0
DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation0
Leveraging Cascaded Binary Classification and Multimodal Fusion for Dementia Detection through Spontaneous Speech0
Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection0
ReverbFX: A Dataset of Room Impulse Responses Derived from Reverb Effect Plugins for Singing Voice Dereverberation0
Holes in Latent Space: Topological Signatures Under Adversarial Influence0
Avoid Forgetting by Preserving Global Knowledge Gradients in Federated Learning with Non-IID Data0
Stochastic Preconditioning for Neural Field Optimization0
ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction0
The challenge of hidden gifts in multi-agent reinforcement learning0
Reconceptualizing Smart Microscopy: From Data Collection to Knowledge Creation by Multi-Agent Integration0
The Impact of a Chatbot's Ephemerality-Framing on Self-Disclosure Perceptions0
Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework0
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models0
MetaSTNet: Multimodal Meta-learning for Cellular Traffic Conformal Prediction0
ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image0
Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners0
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review0
Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments0
In-context learning capabilities of Large Language Models to detect suicide risk among adolescents from speech transcripts0
Robust fine-tuning of speech recognition models via model merging: application to disordered speech0
Large Language Models for IT Automation Tasks: Are We There Yet?0
Synergising Hierarchical Data Centers and Power Networks: A Privacy-Preserving Approach0
Byzantine-Resilient Distributed P2P Energy Trading via Spatial-Temporal Anomaly Detection0
Algorithmic Control Improves Residential Building Energy and EV Management when PV Capacity is High but Battery Capacity is Low0
AstroVisBench: A Code Benchmark for Scientific Computing and Visualization in Astronomy0
CardioPatternFormer: Pattern-Guided Attention for Interpretable ECG Classification with Transformer Architecture0
BrainStratify: Coarse-to-Fine Disentanglement of Intracranial Neural Dynamics0
Federated Learning-Distillation Alternation for Resource-Constrained IoT0
WeatherEdit: Controllable Weather Editing with 4D Gaussian FieldCode2
Intraday Functional PCA Forecasting of Cryptocurrency Returns0
Show:102550
← PrevPage 412 of 9486Next →