SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1555115600 of 474278 papers

TitleStatusHype
Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups0
Model-Agnostic, Temperature-Informed Sampling Enhances Cross-Year Crop Mapping with Deep Learning0
eLog analysis for accelerators: status and future outlook0
Rethinking Distributional IVs: KAN-Powered D-IV-LATE & Model Choice0
The Safety Reminder: A Soft Prompt to Reactivate Delayed Safety Awareness in Vision-Language Models0
SmartHome-Bench: A Comprehensive Benchmark for Video Anomaly Detection in Smart Homes Using Multi-Modal Large Language ModelsCode1
Constraint-Guided Prediction Refinement via Deterministic Diffusion Trajectories0
A large-scale, physically-based synthetic dataset for satellite pose estimation0
HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance0
Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors0
Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs0
NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models0
Leveraging MIMIC Datasets for Better Digital Health: A Review on Open Problems, Progress Highlights, and Future Promises0
AFBS:Buffer Gradient Selection in Semi-asynchronous Federated Learning0
Free Privacy Protection for Wireless Federated Learning: Enjoy It or Suffer from It?0
Unsupervised risk factor identification across cancer types and data modalities via explainable artificial intelligence0
MetaEformer: Unveiling and Leveraging Meta-patterns for Complex and Dynamic Systems Load Forecasting0
DiffS-NOCS: 3D Point Cloud Reconstruction through Coloring Sketches to NOCS Maps Using Diffusion Models0
Synesthesia of Machines (SoM)-Enhanced Sub-THz ISAC Transmission for Air-Ground Network0
QFFT, Question-Free Fine-Tuning for Adaptive ReasoningCode2
Active Adversarial Noise Suppression for Image Forgery Localization0
Transforming Chatbot Text: A Sequence-to-Sequence Approach0
CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making0
Federated Neuroevolution O-RAN: Enhancing the Robustness of Deep Reinforcement Learning xApps0
Humanity's Last Code Exam: Can Advanced LLMs Conquer Human's Hardest Code Competition?Code2
CliniDial: A Naturally Occurring Multimodal Dialogue Dataset for Team Reflection in Action During Clinical OperationCode0
Surprise Calibration for Better In-Context Learning0
Structured Program Synthesis using LLMs: Results and Insights from the IPARC Challenge0
Efficient multi-view training for 3D Gaussian Splatting0
Enhancing Clinical Models with Pseudo Data for De-identificationCode0
The Reflexive Integrated Information Unit: A Differentiable Primitive for Artificial ConsciousnessCode0
MLDebugging: Towards Benchmarking Code Debugging Across Multi-Library ScenariosCode0
Hybrid Meta-Learning Framework for Anomaly Forecasting in Nonlinear Dynamical Systems via Physics-Inspired Simulation and Deep Ensembles0
Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language ModelsCode2
Cross-architecture universal feature coding via distribution alignment0
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?0
Building Trustworthy AI by Addressing its 16+2 Desiderata with Goal-Directed Commonsense Reasoning0
Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption0
SecurityLingua: Efficient Defense of LLM Jailbreak Attacks via Security-Aware Prompt Compression0
Alphabet Index Mapping: Jailbreaking LLMs through Semantic Dissimilarity0
Rethinking Optimization: A Systems-Based Approach to Social Externalities0
From Experts to a Generalist: Toward General Whole-Body Control for Humanoid Robots0
RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control0
Adapting by Analogy: OOD Generalization of Visuomotor Policies via Functional Correspondence0
Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the TorusCode0
Humans, Machine Learning, and Language Models in Union: A Cognitive Study on Table UnionabilityCode0
SoundMind: RL-Incentivized Logic Reasoning for Audio-Language ModelsCode5
Frequency Dynamic Convolutions for Sound Event Detection0
Automated Risk Management Mechanisms in DeFi Lending Protocols: A Crosschain Comparative Analysis of Aave and CompoundCode0
Latent Representation Learning of Multi-scale Thermophysics: Application to Dynamics in Shocked Porous Energetic MaterialCode0
Show:102550
← PrevPage 312 of 9486Next →