SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2200122050 of 474278 papers

TitleStatusHype
Digital Semantic Communications: An Alternating Multi-Phase Training Strategy with Mask AttackCode1
scASDC: Attention Enhanced Structural Deep Clustering for Single-cell RNA-seq DataCode1
Learning Rule-Induced Subgraph Representations for Inductive Relation PredictionCode1
DataNarrative: Automated Data-Driven Storytelling with Visualizations and TextsCode1
UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEsCode1
The impact of internal variability on benchmarking deep learning climate emulatorsCode1
A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWaresCode1
UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster ScenariosCode1
Relevance Filtering for Embedding-based RetrievalCode1
EasyInv: Toward Fast and Better DDIM InversionCode1
COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data SynthesisCode1
KIF: Knowledge Identification and Fusion for Language Model Continual LearningCode1
Kolmogorov-Arnold Network for Online Reinforcement LearningCode1
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic SurgeryCode1
Higher-order-ReLU-KANs (HRKANs) for solving physics-informed neural networks (PINNs) more accurately, robustly and fasterCode1
Improving Ontology Requirements Engineering with OntoChat and Participatory PromptingCode1
GuidedNet: Semi-Supervised Multi-Organ Segmentation via Labeled Data Guide Unlabeled DataCode1
rule4ml: An Open-Source Tool for Resource Utilization and Latency Estimation for ML Models on FPGACode1
Cell Morphology-Guided Small Molecule Generation with GFlowNetsCode1
On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic SurveyCode1
Masked adversarial neural network for cell type deconvolution in spatial transcriptomicsCode1
LLMJudge: LLMs for Relevance JudgmentsCode1
Unsupervised Episode Detection for Large-Scale News EventsCode1
PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasoundCode1
Unleashing Artificial Cognition: Integrating Multiple AI SystemsCode1
Masked Graph Autoencoders with Contrastive Augmentation for Spatially Resolved Transcriptomics DataCode1
Exploring Scalability in Large-Scale Time Series in DeepVATS frameworkCode1
LiDAR-Event Stereo Fusion with HallucinationsCode1
SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic ScenesCode1
Efficient and Accurate Pneumonia Detection Using a Novel Multi-Scale Transformer ApproachCode1
Risk and cross validation in ridge regression with correlated samplesCode1
A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic SurgeryCode1
Diffusion Guided Language ModelingCode1
EMTeC: A Corpus of Eye Movements on Machine-Generated TextsCode1
Listwise Reward Estimation for Offline Preference-based Reinforcement LearningCode1
Scalable Transformer for High Dimensional Multivariate Time Series ForecastingCode1
EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI AgentsCode1
Tackling Noisy Clients in Federated Learning with End-to-end Label CorrectionCode1
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language ModelsCode1
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational CurriculaCode1
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text GuidanceCode1
pyBregMan: A Python library for Bregman ManifoldsCode1
TheGlueNote: Learned Representations for Robust and Flexible Note AlignmentCode1
Learning Fine-Grained Grounded Citations for Attributed Large Language ModelsCode1
Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height EstimationCode1
Ensemble everything everywhere: Multi-scale aggregation for adversarial robustnessCode1
Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive DebateCode1
Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy OptimizationCode1
Model-Based Transfer Learning for Contextual Reinforcement LearningCode1
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text DetectionCode1
Show:102550
← PrevPage 441 of 9486Next →