SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1835118400 of 474278 papers

TitleStatusHype
PIXELS: Progressive Image Xemplar-based Editing with Latent SurgeryCode1
A Simple Graph Contrastive Learning Framework for Short Text ClassificationCode1
ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark DatasetCode1
Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction AttacksCode1
FLOL: Fast Baselines for Real-World Low-Light EnhancementCode1
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part SegmentationCode1
NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision ProcessesCode1
Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective ScenesCode1
LAVCap: LLM-based Audio-Visual Captioning using Optimal TransportCode1
Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial NetworksCode1
BN-Pool: a Bayesian Nonparametric Approach to Graph PoolingCode1
A Study of In-Context-Learning-Based Text-to-SQL ErrorsCode1
FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time TrainingCode1
Towards Robust and Realistic Human Pose Estimation via WiFi SignalsCode1
Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis PlanningCode1
GRAPPA - A Hybrid Graph Neural Network for Predicting Pure Component Vapor PressuresCode1
Multimodal LLMs Can Reason about Aesthetics in Zero-ShotCode1
Efficient Traffic Prediction Through Spatio-Temporal DistillationCode1
CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random WalksCode1
Score-based 3D molecule generation with neural fieldsCode1
WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher LearningCode1
ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of MindCode1
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense AnticipationCode1
Enhancing Graph Representation Learning with Localized Topological FeaturesCode1
Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy HessiansCode1
Generative diffusion model with inverse renormalization group flowsCode1
MeshMask: Physics-Based Simulations with Masked Graph Neural NetworksCode1
NeurOp-Diff:Continuous Remote Sensing Image Super-Resolution via Neural Operator DiffusionCode1
GOTPR: General Outdoor Text-based Place Recognition Using Scene Graph Retrieval with OpenStreetMapCode1
Knowledge Graph-based Retrieval-Augmented Generation for Schema MatchingCode1
DualOpt: A Dual Divide-and-Optimize Algorithm for the Large-scale Traveling Salesman ProblemCode1
SwinTExCo: Exemplar-based video colorization using Swin TransformerCode1
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement LearningCode1
Enhancing the De-identification of Personally Identifiable Information in Educational DataCode1
CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code GenerationCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersCode1
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual AwarenessCode1
Poseidon: A ViT-based Architecture for Multi-Frame Pose Estimation with Adaptive Frame Weighting and Multi-Scale Feature FusionCode1
EmoNeXt: an Adapted ConvNeXt for Facial Emotion RecognitionCode1
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African LanguagesCode1
GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion GenerationCode1
A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction FollowingCode1
D^2-DPM: Dual Denoising for Quantized Diffusion Probabilistic ModelsCode1
AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual SegmentationCode1
Gandalf the Red: Adaptive Security for LLMsCode1
Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-BoundCode1
An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN ArchitecturesCode1
Enhancing Automated Interpretability with Output-Centric Feature DescriptionsCode1
Show:102550
← PrevPage 368 of 9486Next →