SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1005110100 of 661570 papers

TitleStatusHype
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model FeedbackCode2
U-shaped Vision Mamba for Single Image DehazingCode2
MOMENT: A Family of Open Time-series Foundation ModelsCode2
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object DetectionCode2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything ModelCode2
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256KCode2
Fine-Tuned Language Models Generate Stable Inorganic Materials as TextCode2
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective FinetuningCode2
A Hard-to-Beat Baseline for Training-free CLIP-based AdaptationCode2
Linear-time Minimum Bayes Risk Decoding with Reference AggregationCode2
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI FeedbackCode2
Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and DefensesCode2
Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining MethodsCode2
4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic ScenesCode2
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal InstructionsCode2
Rethinking Optimization and Architecture for Tiny Language ModelsCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm PerspectivesCode2
Position: What Can Large Language Models Tell Us about Time Series AnalysisCode2
Light and Optimal Schrödinger Bridge MatchingCode2
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language ModelsCode2
Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object DetectorCode2
Guidance with Spherical Gaussian Constraint for Conditional DiffusionCode2
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric LearningCode2
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space ModelCode2
FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action RecognitionCode2
See More Details: Efficient Image Super-Resolution by Experts MiningCode2
Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency PerspectiveCode2
Flora: Low-Rank Adapters Are Secretly Gradient CompressorsCode2
Graph-enhanced Large Language Models in Asynchronous Plan ReasoningCode2
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language ModelsCode2
Training-Free Consistent Text-to-Image GenerationCode2
Large Language Models are Geographically BiasedCode2
Retrieval-Augmented Score Distillation for Text-to-3D GenerationCode2
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language ModelCode2
Robot Trajectron: Trajectory Prediction-based Shared Control for Robot ManipulationCode2
GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question AnsweringCode2
Minusformer: Improving Time Series Forecasting by Progressively Learning ResidualsCode2
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph CompletionCode2
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot LearningCode2
Jailbreaking Attack against Multimodal Large Language ModelCode2
Federated Learning with New Knowledge: Fundamentals, Advances, and FuturesCode2
Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language ModelsCode2
More Agents Is All You NeedCode2
Affordable Generative AgentsCode2
Change Point Detection with Copula Entropy based Two-Sample TestCode2
EffiBench: Benchmarking the Efficiency of Automatically Generated CodeCode2
ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image SegmentationCode2
GITA: Graph to Visual and Textual Integration for Vision-Language Graph ReasoningCode2
Improving Diffusion Models for Inverse Problems Using Optimal Posterior CovarianceCode2
Show:102550
← PrevPage 202 of 13232Next →