SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 79518000 of 661570 papers

TitleStatusHype
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation ModelsCode2
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion ModelsCode2
KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgeCode2
Yuan 2.0-M32: Mixture of Experts with Attention RouterCode2
Multi-Behavior Generative RecommendationCode2
NoteLLM-2: Multimodal Large Representation Models for RecommendationCode2
Seeing the Image: Prioritizing Visual Correlation by Contrastive AlignmentCode2
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language ModelsCode2
Easy Problems That LLMs Get WrongCode2
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective RationalesCode2
Hybrid Fourier Score Distillation for Efficient One Image to 3D Object GenerationCode2
Improved Techniques for Optimization-Based Jailbreaking on Large Language ModelsCode2
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language ModelsCode2
DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMsCode2
Audio Mamba: Bidirectional State Space Model for Audio Representation LearningCode2
Poisoning Attacks and Defenses in Recommender Systems: A SurveyCode2
Composer's Assistant 2: Interactive Multi-Track MIDI Infilling with Fine-Grained User ControlCode2
Evaluating the World Model Implicit in a Generative ModelCode2
How Far Can We Compress Instant-NGP-Based NeRF?Code2
BLSP-Emo: Towards Empathetic Large Speech-Language ModelsCode2
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion ModelsCode2
QuickLLaMA: Query-aware Inference Acceleration for Large Language ModelsCode2
Split-and-Fit: Learning B-Reps via Structure-Aware Voronoi PartitioningCode2
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence ModelsCode2
GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly DetectionCode2
D3still: Decoupled Differential Distillation for Asymmetric Image RetrievalCode2
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal ModelsCode2
Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding NetworkCode2
Treeffuser: Probabilistic Predictions via Conditional Diffusions with Gradient-Boosted TreesCode2
Classic GNNs are Strong Baselines: Reassessing GNNs for Node ClassificationCode2
On Softmax Direct Preference Optimization for RecommendationCode2
Fredformer: Frequency Debiased Transformer for Time Series ForecastingCode2
Understanding Hallucinations in Diffusion Models through Mode InterpolationCode2
Toward Controlled Generation of TextCode2
Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and ReactionCode2
Extracting Prompts by Inverting LLM OutputsCode2
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
Optimal Transport Aggregation for Visual Place RecognitionCode2
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human InteractionsCode2
HairCLIPv2: Unifying Hair Editing via Proxy Feature BlendingCode2
Turning a CLIP Model into a Scene Text DetectorCode2
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Moving Object Segmentation in Point Cloud Data using Hidden Markov ModelsCode2
ChangeViT: Unleashing Plain Vision Transformers for Change DetectionCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
TroL: Traversal of Layers for Large Language and Vision ModelsCode2
One-for-More: Continual Diffusion Model for Anomaly DetectionCode2
Show:102550
← PrevPage 160 of 13232Next →