SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1910119150 of 474278 papers

TitleStatusHype
Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion0
Towards Scalable Schema Mapping using Large Language Models0
HESEIA: A community-based dataset for evaluating social biases in large language models, co-designed in real school settings in Latin America0
Robust Federated Learning against Model Perturbation in Edge Networks0
Online Fair Division with Additional Information0
Guiding Generative Storytelling with Knowledge Graphs0
Coordinated Beamforming for RIS-Empowered ISAC Systems over Secure Low-Altitude Networks0
Interactive Video Generation via Domain Adaptation0
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation0
Bi-Manual Joint Camera Calibration and Scene Representation0
DiG-Net: Enhancing Quality of Life through Hyper-Range Dynamic Gesture Recognition in Assistive Robotics0
Black-box Adversarial Attacks on CNN-based SLAM Algorithms0
SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping0
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction0
MGS3: A Multi-Granularity Self-Supervised Code Search Framework0
Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation0
SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems0
Bootstrapping LLM Robustness for VLM Safety via Reducing the Pretraining Modality Gap0
E^2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness0
Three Kinds of Negation in Knowledge and Their Mathematical Foundations0
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning0
P: A Universal Measure of Predictive Intelligence0
Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction0
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning0
AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models0
The Butterfly Effect in Pathology: Exploring Security in Pathology Foundation ModelsCode0
Attractor learning for spatiotemporally chaotic dynamical systems using echo state networks with transfer learning0
Beyond Linear Steering: Unified Multi-Attribute Control for Language Models0
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs0
Sparsity-Driven Parallel Imaging Consistency for Improved Self-Supervised MRI Reconstruction0
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation0
RCCDA: Adaptive Model Updates in the Presence of Concept Drift under a Constrained Resource Budget0
LKD-KGC: Domain-Specific KG Construction via LLM-driven Knowledge Dependency Parsing0
Towards Unified Modeling in Federated Multi-Task Learning via Subspace Decoupling0
Benchmarking Foundation Models for Zero-Shot Biometric Tasks0
Reasoning Can Hurt the Inductive Abilities of Large Language Models0
LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework0
Faithful and Robust LLM-Driven Theorem Proving for NLI Explanations0
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering0
The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It0
SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification0
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction0
Boosting Automatic Exercise Evaluation Through Musculoskeletal Simulation-Based IMU Data Augmentation0
Localizing Persona Representations in LLMs0
Deep Learning Weather Models for Subregional Ocean Forecasting: A Case Study on the Canary Current Upwelling System0
Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing0
Object Centric Concept Bottlenecks0
Cross-Attention Speculative Decoding0
Eye of Judgement: Dissecting the Evaluation of Russian-speaking LLMs with POLLUX0
Cloud Optical Thickness Retrievals Using Angle Invariant Attention Based Deep Learning Models0
Show:102550
← PrevPage 383 of 9486Next →