SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 79017950 of 661570 papers

TitleStatusHype
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in MammographyCode2
ProtT3: Protein-to-Text Generation for Text-based Protein UnderstandingCode2
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous VehiclesCode2
Efficient Visual State Space Model for Image DeblurringCode2
OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Fused Geometric and Semantic GuidanceCode2
S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language ModelsCode2
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision ModelsCode2
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation ModelsCode2
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion ModelsCode2
KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgeCode2
Yuan 2.0-M32: Mixture of Experts with Attention RouterCode2
Multi-Behavior Generative RecommendationCode2
NoteLLM-2: Multimodal Large Representation Models for RecommendationCode2
Seeing the Image: Prioritizing Visual Correlation by Contrastive AlignmentCode2
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language ModelsCode2
Easy Problems That LLMs Get WrongCode2
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective RationalesCode2
Hybrid Fourier Score Distillation for Efficient One Image to 3D Object GenerationCode2
Improved Techniques for Optimization-Based Jailbreaking on Large Language ModelsCode2
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language ModelsCode2
DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMsCode2
Audio Mamba: Bidirectional State Space Model for Audio Representation LearningCode2
Poisoning Attacks and Defenses in Recommender Systems: A SurveyCode2
Composer's Assistant 2: Interactive Multi-Track MIDI Infilling with Fine-Grained User ControlCode2
Evaluating the World Model Implicit in a Generative ModelCode2
How Far Can We Compress Instant-NGP-Based NeRF?Code2
BLSP-Emo: Towards Empathetic Large Speech-Language ModelsCode2
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion ModelsCode2
QuickLLaMA: Query-aware Inference Acceleration for Large Language ModelsCode2
Split-and-Fit: Learning B-Reps via Structure-Aware Voronoi PartitioningCode2
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence ModelsCode2
GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly DetectionCode2
D3still: Decoupled Differential Distillation for Asymmetric Image RetrievalCode2
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal ModelsCode2
Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding NetworkCode2
Treeffuser: Probabilistic Predictions via Conditional Diffusions with Gradient-Boosted TreesCode2
Classic GNNs are Strong Baselines: Reassessing GNNs for Node ClassificationCode2
On Softmax Direct Preference Optimization for RecommendationCode2
Fredformer: Frequency Debiased Transformer for Time Series ForecastingCode2
Understanding Hallucinations in Diffusion Models through Mode InterpolationCode2
Toward Controlled Generation of TextCode2
Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and ReactionCode2
Extracting Prompts by Inverting LLM OutputsCode2
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
Optimal Transport Aggregation for Visual Place RecognitionCode2
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human InteractionsCode2
HairCLIPv2: Unifying Hair Editing via Proxy Feature BlendingCode2
Show:102550
← PrevPage 159 of 13232Next →