SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1275112800 of 474278 papers

TitleStatusHype
DINO in the Room: Leveraging 2D Foundation Models for 3D SegmentationCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task AlignmentCode2
Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional RepresentationCode2
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D VisionCode2
Gaussian Shell Maps for Efficient 3D Human GenerationCode2
Holodeck: Language Guided Generation of 3D Embodied AI EnvironmentsCode2
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-ExpertsCode2
Parameter-Efficient Fine-Tuning for Foundation ModelsCode2
LLM4Ranking: An Easy-to-use Framework of Utilizing Large Language Models for Document RerankingCode2
End-to-end Learnable Clustering for Intent Learning in RecommendationCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting MaskCode2
Process Reward Models That ThinkCode2
Demonstration of Robust and Efficient Quantum Property Learning with Shallow ShadowsCode2
Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient FlowsCode2
Comprehending and Ordering Semantics for Image CaptioningCode2
Guide to k-mer approaches for genomics across the tree of lifeCode2
RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone DesignCode2
DINO-Foresight: Looking into the Future with DINOCode2
Healthsheet: Development of a Transparency Artifact for Health DatasetsCode2
Feature Fusion Based on Mutual-Cross-Attention Mechanism for EEG Emotion RecognitionCode2
The Equalization Losses: Gradient-Driven Training for Long-tailed Object RecognitionCode2
Deep Bidirectional Language-Knowledge Graph PretrainingCode2
Model-Based Imitation Learning for Urban DrivingCode2
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked InputsCode2
Foundations and Recent Trends in Multimodal Mobile Agents: A SurveyCode2
Training Deep AutoEncoders for Collaborative FilteringCode2
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image TranslationCode2
S^2IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series ForecastingCode2
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event DetectionCode2
T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT SegmentationCode2
StyleTalk: One-shot Talking Head Generation with Controllable Speaking StylesCode2
When Spiking neural networks meet temporal attention image decoding and adaptive spiking neuronCode2
Benchmarking Benchmark Leakage in Large Language ModelsCode2
K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather ConditionsCode2
Kinetics: Rethinking Test-Time Scaling LawsCode2
Spatial-Semantic Collaborative Cropping for User Generated ContentCode2
Automatic and Universal Prompt Injection Attacks against Large Language ModelsCode2
The GigaMIDI Dataset with Features for Expressive Music Performance DetectionCode2
Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI CollaborationCode2
ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary PlanningCode2
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy PredictionCode2
Atlas: Few-shot Learning with Retrieval Augmented Language ModelsCode2
PokerKit: A Comprehensive Python Library for Fine-Grained Multi-Variant Poker Game SimulationsCode2
BWT construction and search at the terabase scaleCode2
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual UnderstandingCode2
TorchGeo: Deep Learning With Geospatial DataCode2
Transductive Active Learning: Theory and ApplicationsCode2
Show:102550
← PrevPage 256 of 9486Next →