SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 77017750 of 177340 papers

TitleStatusHype
Graph Neural Networks for Learning Equivariant Representations of Neural NetworksCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
A Multimodal Vision Foundation Model for Clinical DermatologyCode2
Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image FusionCode2
AID: Attention Interpolation of Text-to-Image DiffusionCode2
Efficient Image Pre-Training with Siamese Cropped Masked AutoencodersCode2
Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion ModelCode2
Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene AffordanceCode2
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTsCode2
Infrared Small Target Detection with Scale and Location SensitivityCode2
Pre-trained Vision and Language Transformers Are Few-Shot Incremental LearnersCode2
Linear Attention Sequence ParallelismCode2
LeGrad: An Explainability Method for Vision Transformers via Feature Formation SensitivityCode2
OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian SplattingCode2
DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object DetectionCode2
Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular StructuresCode2
RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length VideosCode2
SmartControl: Enhancing ControlNet for Handling Rough Visual ConditionsCode2
The CAST package for training and assessment of spatial prediction models in RCode2
Manipulating Large Language Models to Increase Product VisibilityCode2
DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering DocumentationCode2
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context ExamplesCode2
GoMVS: Geometrically Consistent Cost Aggregation for Multi-View StereoCode2
SFSORT: Scene Features-based Simple Online Real-Time TrackerCode2
Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image SegmentationCode2
LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence ParallelismCode2
NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and ResultsCode2
Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content ReferencesCode2
LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query EfficiencyCode2
An empirical study of LLaMA3 quantization: from LLMs to MLLMsCode2
JGLUE: Japanese General Language Understanding EvaluationCode2
Two Tales of Persona in LLMs: A Survey of Role-Playing and PersonalizationCode2
Gradformer: Graph Transformer with Exponential DecayCode2
Large Language Models for Next Point-of-Interest RecommendationCode2
S^2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image ClassificationCode2
Paint by Inpaint: Learning to Add Image Objects by Removing Them FirstCode2
WorldGPT: Empowering LLM as Multimodal World ModelCode2
GraCo: Granularity-Controllable Interactive SegmentationCode2
FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network PotentialsCode2
Time Evidence Fusion Network: Multi-source View in Long-Term Time Series ForecastingCode2
Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale AttentionCode2
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place RecognitionCode2
Evaluation of Retrieval-Augmented Generation: A SurveyCode2
From NeRFs to Gaussian Splats, and BackCode2
Improving Point-based Crowd Counting and Localization Based on Auxiliary Point GuidanceCode2
xFinder: Robust and Pinpoint Answer Extraction for Large Language ModelsCode2
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in MammographyCode2
ProtT3: Protein-to-Text Generation for Text-based Protein UnderstandingCode2
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous VehiclesCode2
Efficient Visual State Space Model for Image DeblurringCode2
Show:102550
← PrevPage 155 of 3547Next →