SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2115121200 of 474278 papers

TitleStatusHype
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal ModelCode1
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language ModelsCode1
DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic ConsistencyCode1
A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues PreservationCode1
Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image SegmentationCode1
A Case Study of Web App Coding with OpenAI Reasoning ModelsCode1
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form PlanningCode1
Enhancing Agricultural Environment Perception via Active Vision and Zero-Shot LearningCode1
PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics AnalysisCode1
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend ConditioningCode1
Familiarity-Aware Evidence Compression for Retrieval-Augmented GenerationCode1
Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement FrameworkCode1
Fundus image enhancement through direct diffusion bridgesCode1
Reinforcement Learning-based Model Predictive Control for Greenhouse Climate ControlCode1
Language Models Learn to Mislead Humans via RLHFCode1
Enhancing Perception of Key Changes in Remote Sensing Image Change CaptioningCode1
MEXMA: Token-level objectives improve sentence representationsCode1
MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image SegmentationCode1
PromSec: Prompt Optimization for Secure Generation of Functional Source Code with Large Language Models (LLMs)Code1
Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTCCode1
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMsCode1
Evaluating Image Hallucination in Text-to-Image Generation with Question-AnsweringCode1
Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous DrivingCode1
DenoMamba: A fused state-space model for low-dose CT denoisingCode1
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression ComprehensionCode1
Mastering Chess with a Transformer ModelCode1
MEOW: MEMOry Supervised LLM Unlearning Via Inverted FactsCode1
Multi-Grid Graph Neural Networks with Self-Attention for Computational MechanicsCode1
BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF ModellingCode1
SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow EstimationCode1
Generalized compression and compressive search of large datasetsCode1
Linguini: A benchmark for language-agnostic linguistic reasoningCode1
DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image FusionCode1
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningCode1
MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human MotionCode1
Measuring Human and AI Values Based on Generative Psychometrics with Large Language ModelsCode1
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for ReasoningCode1
Massively Multi-Person 3D Human Motion Forecasting with Scene ContextCode1
Self-Supervised Speed of Sound Recovery for Aberration-Corrected Photoacoustic Computed TomographyCode1
HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving ScenariosCode1
Diversify and Conquer: Diversity-Centric Data Selection with Iterative RefinementCode1
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented GenerationCode1
Leveraging Symmetry to Accelerate Learning of Trajectory Tracking Controllers for Free-Flying Robotic SystemsCode1
Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator AgentCode1
Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMsCode1
Ultrasound Image Enhancement with the Variance of Diffusion ModelsCode1
Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image DenoisersCode1
Contrasformer: A Brain Network Contrastive Transformer for Neurodegenerative Condition IdentificationCode1
LOLA -- An Open-Source Massively Multilingual Large Language ModelCode1
Show:102550
← PrevPage 424 of 9486Next →