SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1955119600 of 474278 papers

TitleStatusHype
Multi-Agent Environments for Vehicle Routing ProblemsCode1
Why do language models perform worse for morphologically complex languages?Code1
Hugging Rain Man: A Novel Facial Action Units Dataset for Analyzing Atypical Facial Expressions in Children with Autism Spectrum DisorderCode1
HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response DatasetCode1
Detecting Human Artifacts from Text-to-Image ModelsCode1
Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction ProblemsCode1
Planning-Driven Programming: A Large Language Model Programming WorkflowCode1
Neuromorphic Attitude Estimation and ControlCode1
Revisiting the Integration of Convolution and Attention for Vision BackboneCode1
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource LanguagesCode1
G-RAG: Knowledge Expansion in Material ScienceCode1
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic SegmentationCode1
RestorerID: Towards Tuning-Free Face Restoration with ID PreservationCode1
StackEval: Benchmarking LLMs in Coding AssistanceCode1
Quantization without TearsCode1
Breaking Information Cocoons: A Hyperbolic Graph-LLM Framework for Exploration and Exploitation in Recommender SystemsCode1
Zero-Shot Low-Light Image Enhancement via Joint Frequency Domain Priors Guided DiffusionCode1
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual LearningCode1
Regional Attention for Shadow RemovalCode1
Learning to Cooperate with Humans using Generative AgentsCode1
Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error FeedbackCode1
UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow PredictionCode1
On the Consistency of Video Large Language Models in Temporal ComprehensionCode1
Attentive Contextual Attention for Cloud RemovalCode1
WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous DrivingCode1
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing ImagesCode1
OpenMS WebApps: Building User-Friendly Solutions for MS AnalysisCode1
CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability DetectionCode1
Globally Correlation-Aware Hard Negative GenerationCode1
Robust Planning with Compound LLM Architectures: An LLM-Modulo ApproachCode1
DRL-Based Optimization for AoI and Energy Consumption in C-V2X Enabled IoVCode1
Unsupervised Homography Estimation on Multimodal Image Pair via Alternating OptimizationCode1
DATTA: Domain-Adversarial Test-Time Adaptation for Cross-Domain WiFi-Based Human Activity RecognitionCode1
Unsupervised Foundation Model-Agnostic Slide-Level Representation LearningCode1
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic SegmentationCode1
DT-LSD: Deformable Transformer-based Line Segment DetectionCode1
DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning FeedbackCode1
LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image CompressionCode1
X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose EstimationCode1
OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater ImagingCode1
Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait SynthesisCode1
Teaching VLMs to Localize Specific Objects from In-context ExamplesCode1
Translating Electrocardiograms to Cardiac Magnetic Resonance Imaging Useful for Cardiac Assessment and Disease Screening: A Multi-Center Study AI for ECG to CMR Translation StudyCode1
Selective Attention: Enhancing Transformer through Principled Context ControlCode1
Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive TestingCode1
PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation with DevitoCode1
CDI: Copyrighted Data Identification in Diffusion ModelsCode1
NPGPT: Natural Product-Like Compound Generation with GPT-based Chemical Language ModelsCode1
UrbanDiT: A Foundation Model for Open-World Urban Spatio-Temporal LearningCode1
LEDRO: LLM-Enhanced Design Space Reduction and Optimization for Analog CircuitsCode1
Show:102550
← PrevPage 392 of 9486Next →