SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1710117150 of 474278 papers

TitleStatusHype
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic SpaceCode1
A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR DataCode1
Image-Goal Navigation Using Refined Feature Guidance and Scene Graph EnhancementCode1
Simulating Dual-Pixel Images From Ray Tracing For Depth EstimationCode1
Exploring Performance-Complexity Trade-Offs in Sound Event Detection ModelsCode1
Rethinking Few-Shot Adaptation of Vision-Language Models in Two StagesCode1
Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion ModelsCode1
CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal ControlCode1
BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion ModelCode1
GMG: A Video Prediction Method Based on Global Focus and Motion GuidedCode1
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web SearchCode1
Interpretable Image Classification via Non-parametric Part Prototype LearningCode1
Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSMCode1
OmniSTVG: Toward Spatio-Temporal Omni-Object Video GroundingCode1
OODD: Test-time Out-of-Distribution Detection with Dynamic DictionaryCode1
Label Unbalance in High-frequency TradingCode1
Enhancing Facial Privacy Protection via Weakening Diffusion PurificationCode1
Large-scale Pre-training for Grounded Video Caption GenerationCode1
From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLMCode1
VisTai: Benchmarking Vision-Language Models for Traditional Chinese in TaiwanCode1
Automatic quality control in multi-centric fetal brain MRI super-resolution reconstructionCode1
OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy PredictionCode1
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?Code1
StableFusion: Continual Video Retrieval via Frame AdaptationCode1
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-InterventionCode1
EFC++: Elastic Feature Consolidation with Prototype Re-balancing for Cold Start Exemplar-free Incremental LearningCode1
MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural RepresentationCode1
Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor CoresCode1
Low Complexity Point Tracking of the Myocardium in 2D EchocardiographyCode1
Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in HistopathologyCode1
AI-assisted Early Detection of Pancreatic Ductal Adenocarcinoma on Contrast-enhanced CTCode1
CoSTA: Cost-Sensitive Toolpath Agent for Multi-turn Image EditingCode1
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker EmbeddingsCode1
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object DetectionCode1
KVQ: Boosting Video Quality Assessment via Saliency-guided Local PerceptionCode1
Mamba time series forecasting with uncertainty quantificationCode1
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention DisruptionCode1
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape GameCode1
Panopticon: Advancing Any-Sensor Foundation Models for Earth ObservationCode1
Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion AttacksCode1
TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language ModelsCode1
An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable SimulationCode1
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground SimulationCode1
High-Resolution Uplink Sensing in Millimeter-Wave ISAC SystemsCode1
Image Quality Assessment: From Human to Machine PreferenceCode1
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identificationCode1
ZeroMerge: Parameter-Free KV Cache Compression for Memory-Efficient Long-Context LLMsCode1
The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based GenerationCode1
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective ReasoningCode1
Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a MeasurementCode1
Show:102550
← PrevPage 343 of 9486Next →