SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1695117000 of 474278 papers

TitleStatusHype
SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real WorldCode1
MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction FusionCode1
FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client VectorsCode1
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language ModelsCode1
Agentic Keyframe Search for Video Question AnsweringCode1
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video UnderstandingCode1
Probabilistic Prompt Distribution Learning for Animal Pose EstimationCode1
QCPINN: Quantum-Classical Physics-Informed Neural Networks for Solving PDEsCode1
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the EdgeCode1
SALT: Singular Value Adaptation with Low-Rank TransformationCode1
CaKE: Circuit-aware Editing Enables Generalizable Knowledge LearnersCode1
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video DiffusionCode1
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language ModelsCode1
Design and Implementation of an FPGA-Based Hardware Accelerator for TransformerCode1
Narrative Trails: A Method for Coherent Storyline Extraction via Maximum Capacity Path OptimizationCode1
Performance-bounded Online Ensemble Learning Method Based on Multi-armed bandits and Its Applications in Real-time Safety AssessmentCode1
A Bird Song Detector for improving bird identification through Deep Learning: a case study from DoñanaCode1
SkyLadder: Better and Faster Pretraining via Context Window SchedulingCode1
HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario GenerationCode1
From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level AlignmentCode1
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation SystemsCode1
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationCode1
DeCaFlow: A Deconfounding Causal Generative ModelCode1
Visual Position Prompt for MLLM based Visual GroundingCode1
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning TransferCode1
PiEEG kit - bioscience Lab in home for your Brain and BodyCode1
Efficient Personalization of Quantized Diffusion Model without BackpropagationCode1
EarthScape: A Multimodal Dataset for Surficial Geologic Mapping and Earth Surface AnalysisCode1
What Makes a Reward Model a Good Teacher? An Optimization PerspectiveCode1
Explainable AI Components for Narrative Map ExtractionCode1
EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?Code1
EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point CloudsCode1
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal TransportCode1
UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection FrameworkCode1
Ambient Noise Full Waveform Inversion with Neural OperatorsCode1
BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?Code1
Multi-focal Conditioned Latent Diffusion for Person Image SynthesisCode1
Improving Adversarial Transferability on Vision Transformers via Forward Propagation RefinementCode1
High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous FlightCode1
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation LearningCode1
Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired TrainingCode1
MP-GUI: Modality Perception with MLLMs for GUI UnderstandingCode1
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning SegmentationCode1
MeshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative ModelingCode1
Advancing Medical Representation Learning Through High-Quality DataCode1
FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data ClassificationCode1
Capturing Smile Dynamics with the Quintic Volatility Model: SPX, Skew-Stickiness Ratio and VIXCode1
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future PerspectivesCode1
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape RoomsCode1
Inferring Event Descriptions from Time Series with Language ModelsCode1
Show:102550
← PrevPage 340 of 9486Next →