SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2170121750 of 474278 papers

TitleStatusHype
ml_edm package: a Python toolkit for Machine Learning based Early Decision MakingCode1
Staircase Cascaded Fusion of Lightweight Local Pattern Recognition and Long-Range Dependencies for Structural Crack SegmentationCode1
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation ModelsCode1
MedDec: A Dataset for Extracting Medical Decisions from Discharge SummariesCode1
Causal-Guided Active Learning for Debiasing Large Language ModelsCode1
Multivariate Time-Series Anomaly Detection based on Enhancing Graph Attention Networks with Topological AnalysisCode1
O-Mamba: O-shape State-Space Model for Underwater Image EnhancementCode1
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive SurveyCode1
Enhancing Knowledge Tracing with Concept Map and Response DisentanglementCode1
T3M: Text Guided 3D Human Motion Synthesis from SpeechCode1
Memory-Efficient LLM Training with Online Subspace DescentCode1
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM AgentsCode1
RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question AnsweringCode1
Search-Based LLMs for Code OptimizationCode1
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors EmbeddingCode1
ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor ReconstructionCode1
Contrastive Representation Learning for Dynamic Link Prediction in Temporal NetworksCode1
An Evaluation of Deep Learning Models for Stock Market Trend PredictionCode1
Tackling Data Heterogeneity in Federated Learning via Loss DecompositionCode1
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination DetectionCode1
Scribbles for All: Benchmarking Scribble Supervised Segmentation Across DatasetsCode1
ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving ScenesCode1
Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and VisualAnalysis StrategyCode1
Non-Homophilic Graph Pre-Training and Prompt LearningCode1
OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and FusionCode1
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language ModelsCode1
Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image SizesCode1
UMAD: University of Macau Anomaly Detection Benchmark DatasetCode1
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language ModelsCode1
FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image EditingCode1
Quantization-aware Matrix Factorization for Low Bit Rate Image CompressionCode1
Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and TransformersCode1
Cross-Domain Foundation Model Adaptation: Pioneering Computer Vision Models for Geophysical Data AnalysisCode1
Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio SensorsCode1
Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video EnhancementCode1
A Benchmark for AI-based Weather Data AssimilationCode1
TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation ModelsCode1
CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity PredictionCode1
Sum of Squares CircuitsCode1
Interpretable Long-term Action Quality AssessmentCode1
OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts RemovalCode1
MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt TuningCode1
Approaching Deep Learning through the Spectral Dynamics of WeightsCode1
Does It Look Sequential? An Analysis of Datasets for Evaluation of Sequential RecommendationsCode1
FocusLLM: Precise Understanding of Long Context by Dynamic CondensingCode1
Great Memory, Shallow Reasoning: Limits of kNN-LMsCode1
MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMsCode1
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationCode1
Positional Prompt Tuning for Efficient 3D Representation LearningCode1
SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation SteeringCode1
Show:102550
← PrevPage 435 of 9486Next →