SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2175121800 of 474278 papers

TitleStatusHype
Toward Enhancing Vehicle Color Recognition in Adverse Conditions: A Dataset and BenchmarkCode1
Low-Light Object Tracking: A BenchmarkCode1
Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of DistractorsCode1
Great Memory, Shallow Reasoning: Limits of kNN-LMsCode1
CHOTA: A Higher Order Accuracy Metric for Cell TrackingCode1
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationCode1
FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual LocalizationCode1
Sum of Squares CircuitsCode1
TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation ModelsCode1
MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt TuningCode1
Interpretable Long-term Action Quality AssessmentCode1
A Benchmark for AI-based Weather Data AssimilationCode1
NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei SegmentationCode1
CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity PredictionCode1
OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts RemovalCode1
V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?Code1
A toolbox for calculating objective image properties in aesthetics researchCode1
Security Attacks on LLM-based Code Completion ToolsCode1
Makeup-Guided Facial Privacy Protection via Untrained Neural Network PriorsCode1
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene UnderstandingCode1
Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New AlgorithmCode1
Training Matting Models without Alpha LabelsCode1
Generalizable Facial Expression RecognitionCode1
Navigating Spatio-Temporal Heterogeneity: A Graph Transformer Approach for Traffic ForecastingCode1
Neural Exploratory Landscape Analysis for Meta-Black-Box-OptimizationCode1
Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic SegmentationCode1
MUSES: 3D-Controllable Image Generation via Multi-Modal Agent CollaborationCode1
Multi-view Hand Reconstruction with a Point-Embedded TransformerCode1
CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese NetworkCode1
TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer LearningCode1
DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News DetectionCode1
Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear RepresentationsCode1
EPiC: Cost-effective Search-based Prompt Engineering of LLMs for Code GenerationCode1
Hologram Reasoning for Solving Algebra Problems with Geometry DiagramsCode1
SubgoalXL: Subgoal-based Expert Learning for Theorem ProvingCode1
An Efficient Sign Language Translation Using Spatial Configuration and Motion Dynamics with LLMsCode1
SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised PretrainingCode1
SysBench: Can Large Language Models Follow System Messages?Code1
Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question AnsweringCode1
Prompt-Agnostic Adversarial Perturbation for Customized Diffusion ModelsCode1
MUSE: Mamba is Efficient Multi-scale Learner for Text-video RetrievalCode1
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language ModelsCode1
Language Modeling on Tabular Data: A Survey of Foundations, Techniques and EvolutionCode1
Wave-Mask/Mix: Exploring Wavelet-Based Augmentations for Time Series ForecastingCode1
ViLReF: An Expert Knowledge Enabled Vision-Language Retinal Foundation ModelCode1
MPL: Lifting 3D Human Pose from Multi-view 2D PosesCode1
Task-level Distributionally Robust Optimization for Large Language Model-based Dense RetrievalCode1
Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image EnhancementCode1
CHECKWHY: Causal Fact Verification via Argument StructureCode1
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring TechniqueCode1
Show:102550
← PrevPage 436 of 9486Next →