SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2100121050 of 474278 papers

TitleStatusHype
QuForge: A Library for Qudits SimulationCode1
Infer Human's Intentions Before Following Natural Language InstructionsCode1
CodonMPNN for Organism Specific and Codon Optimal Inverse FoldingCode1
Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal RecommendationCode1
Robust Scene Change Detection Using Visual Foundation Models and Cross-Attention MechanismsCode1
Topological SLAM in colonoscopies leveraging deep features and topological priorsCode1
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn InteractionCode1
HazeSpace2M: A Dataset for Haze Aware Single Image DehazingCode1
Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image TranslationCode1
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion ModelCode1
Counterfactual Token Generation in Large Language ModelsCode1
Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and MetricsCode1
Moner: Motion Correction in Undersampled Radial MRI with Unsupervised Neural RepresentationCode1
Beyond Redundancy: Information-aware Unsupervised Multiplex Graph Structure LearningCode1
ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology AnalysisCode1
HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic WorkflowsCode1
BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained DevicesCode1
Search for Efficient Large Language ModelsCode1
First Place Solution to the ECCV 2024 BRAVO Challenge: Evaluating Robustness of Vision Foundation Models for Semantic SegmentationCode1
FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text CompressionCode1
Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement LearningCode1
CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack OverflowCode1
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient ModificationCode1
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularizationCode1
CaBRNet, an open-source library for developing and evaluating Case-Based Reasoning ModelsCode1
Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI GenerationCode1
Plurals: A System for Guiding LLMs Via Simulated Social EnsemblesCode1
GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer LearningCode1
Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion ModelsCode1
Scalable Multi-Robot Informative Path Planning for Target Mapping via Deep Reinforcement LearningCode1
Inline Photometrically Calibrated Hybrid Visual SLAMCode1
Semi-LLIE: Semi-supervised Contrastive Learning with Mamba-based Low-light Image EnhancementCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
Enhancing Nighttime UAV Tracking with Light Distribution SuppressionCode1
Training Language Models to Win Debates with Self-Play Improves Judge AccuracyCode1
DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare DataCode1
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
SDCL: Students Discrepancy-Informed Correction Learning for Semi-supervised Medical Image SegmentationCode1
Face Forgery Detection with Elaborate BackboneCode1
HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean SpaceCode1
TiM4Rec: An Efficient Sequential Recommendation Model Based on Time-Aware Structured State Space Duality ModelCode1
In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow UnderstandingCode1
Fine-Tuning is Fine, if CalibratedCode1
TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based ModelsCode1
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL BenchmarkCode1
PDT: Uav Target Detection Dataset for Pests and Diseases TreeCode1
MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object ScenariosCode1
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual RecognitionCode1
Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference SpeedCode1
XTRUST: On the Multilingual Trustworthiness of Large Language ModelsCode1
Show:102550
← PrevPage 421 of 9486Next →