SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1960119650 of 474278 papers

TitleStatusHype
Heterogeneous Federated Learning: State-of-the-art and Research ChallengesCode1
Hyper-Representations as Generative Models: Sampling Unseen Neural Network WeightsCode1
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
Navigating the GAN Parameter Space for Semantic Image EditingCode1
Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation CorrentropyCode1
Physics-constrained convolutional neural networks for inverse problems in spatiotemporal partial differential equationsCode1
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New DatasetCode1
DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and ClassificationCode1
Automatic Change-Point Detection in Time Series via Deep LearningCode1
Optimization-Free Test-Time Adaptation for Cross-Person Activity RecognitionCode1
CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmark of Large Language Models in Mental Health CounselingCode1
FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language ModelsCode1
Discovering Objects that Can MoveCode1
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary PerspectiveCode1
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion PredictionCode1
Hierarchical Disentanglement-Alignment Network for Robust SAR Vehicle RecognitionCode1
Deep learning-based Crop Row Detection for Infield Navigation of Agri-RobotsCode1
Prediction-Guided Distillation for Dense Object DetectionCode1
Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual GroundingCode1
PointCaM: Cut-and-Mix for Open-Set Point Cloud LearningCode1
Conformal Prediction for Zero-Shot ModelsCode1
CASS: Cross Architectural Self-Supervision for Medical Image AnalysisCode1
Traceable Federated Continual LearningCode1
Distilling Model Failures as Directions in Latent SpaceCode1
Distillation and Refinement of Reasoning in Small Language Models for Document Re-rankingCode1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
Contextual Instance Decoupling for Robust Multi-Person Pose EstimationCode1
EgoVSR: Towards High-Quality Egocentric Video Super-ResolutionCode1
Referring Image Segmentation Using Text SupervisionCode1
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct SolutionsCode1
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
Enhancing Multimodal Large Language Models Complex Reason via Similarity ComputationCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
ModDrop++: A Dynamic Filter Network with Intra-subject Co-training for Multiple Sclerosis Lesion Segmentation with Missing ModalitiesCode1
CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic SegmentationCode1
COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMsCode1
Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization MicroscopyCode1
ASCON: Anatomy-aware Supervised Contrastive Learning Framework for Low-dose CT DenoisingCode1
ViDeBERTa: A powerful pre-trained language model for VietnameseCode1
Semantic Drift Compensation for Class-Incremental LearningCode1
On the Effectiveness of Spectral Discriminators for Perceptual Quality ImprovementCode1
3D Interaction Geometric Pre-training for Molecular Relational LearningCode1
Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data AugmentationCode1
NeuRI: Diversifying DNN Generation via Inductive Rule InferenceCode1
MG-TAR: Multi-View Graph Convolutional Networks for Traffic Accident Risk PredictionCode1
Capsules with Inverted Dot-Product Attention RoutingCode1
Lenna: Language Enhanced Reasoning Detection AssistantCode1
Preference-grounded Token-level Guidance for Language Model Fine-tuningCode1
Aligning Large Language Models through Synthetic FeedbackCode1
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual taskCode1
Show:102550
← PrevPage 393 of 9486Next →