SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2140121450 of 474278 papers

TitleStatusHype
Dual-stream Feature Augmentation for Domain GeneralizationCode1
Hybrid Cost Volume for Memory-Efficient Optical FlowCode1
Residual Stream Analysis with Multi-Layer SAEsCode1
SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation AlgorithmsCode1
Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain AdaptationCode1
Learning to Solve Combinatorial Optimization under Positive Linear Constraints via Non-Autoregressive Neural NetworksCode1
Sparse Rewards Can Self-Train Dialogue AgentsCode1
CISCA and CytoDArk0: a Cell Instance Segmentation and Classification method for histo(patho)logical image Analyses and a new, open, Nissl-stained dataset for brain cytoarchitecture studiesCode1
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language ModelCode1
Accelerating Training with Neuron Interaction and Nowcasting NetworksCode1
Refining Wikidata Taxonomy using Large Language ModelsCode1
SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimationCode1
Diagram Formalization Enhanced Multi-Modal Geometry Problem SolverCode1
EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-ResolutionCode1
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer TrainingCode1
3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric PriorsCode1
Efficient Training of Large Vision Models via Advanced Automated Progressive LearningCode1
LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature IntegrationCode1
Enhancing Uncertainty Quantification in Drug Discovery with Censored Regression LabelsCode1
CoxKAN: Kolmogorov-Arnold Networks for Interpretable, High-Performance Survival AnalysisCode1
Operator Learning with Gaussian ProcessesCode1
Practical Forecasting of Cryptocoins Timeseries using Correlation PatternsCode1
MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of MiceCode1
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any ArchitectureCode1
Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language ModelsCode1
Parallel AutoRegressive Models for Multi-Agent Combinatorial OptimizationCode1
Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilitiesCode1
Few-shot Adaptation of Medical Vision-Language ModelsCode1
HUMOS: Human Motion Model Conditioned on Body ShapeCode1
UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing ImagesCode1
Towards Autonomous Cybersecurity: An Intelligent AutoML Framework for Autonomous Intrusion DetectionCode1
Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface ReconstructionCode1
Sirius: Contextual Sparsity with Correction for Efficient LLMsCode1
KAN See In the DarkCode1
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality DataCode1
LowFormer: Hardware Efficient Design for Convolutional Transformer BackbonesCode1
MAS4POI: a Multi-Agents Collaboration System for Next POI RecommendationCode1
Planning In Natural Language Improves LLM Search For Code GenerationCode1
Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image SegmentationCode1
LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-ResolutionCode1
Evaluating Open-Source Sparse Autoencoders on Disentangling Factual Knowledge in GPT-2 SmallCode1
Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and EvaluationCode1
iSeg: An Iterative Refinement-based Framework for Training-free SegmentationCode1
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced DistillationCode1
Rethinking HTG Evaluation: Bridging Generation and RecognitionCode1
UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse ViewsCode1
ExpLLM: Towards Chain of Thought for Facial Expression RecognitionCode1
Evaluation Study on SAM 2 for Class-agnostic Instance-level SegmentationCode1
RouterRetriever: Routing over a Mixture of Expert Embedding ModelsCode1
Explainable AI for computational pathology identifies model limitations and tissue biomarkersCode1
Show:102550
← PrevPage 429 of 9486Next →