SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1840118450 of 474278 papers

TitleStatusHype
DH-Mamba: Exploring Dual-domain Hierarchical State Space Models for MRI ReconstructionCode1
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM TrainingCode1
Dataset Distillation via Committee VotingCode1
TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry OperationsCode1
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
Estimating Musical Surprisal in AudioCode1
Split Federated Learning Empowered Vehicular Edge Intelligence: Concept, Adaptive Design, and Future DirectionsCode1
RePoseD: Efficient Relative Pose Estimation With Known Depth InformationCode1
Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection MethodCode1
LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language ModelsCode1
Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs TrainingCode1
How GPT learns layer by layerCode1
A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor FusionCode1
MECD+: Unlocking Event-Level Causal Graph Discovery for Video ReasoningCode1
Skip Mamba Diffusion for Monocular 3D Semantic Scene CompletionCode1
MathReader : Text-to-Speech for Mathematical DocumentsCode1
RadAlign: Advancing Radiology Report Generation with Vision-Language Concept AlignmentCode1
D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generationCode1
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous LearningCode1
Multi-task Visual Grounding with Coarse-to-Fine Consistency ConstraintsCode1
ZNO-Eval: Benchmarking reasoning capabilities of large language models in UkrainianCode1
UR2P-Dehaze: Learning a Simple Image Dehaze Enhancer via Unpaired Rich Physical PriorCode1
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM TrainingCode1
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video CaptioningCode1
CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D ApplicationsCode1
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D ShapesCode1
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlappingCode1
VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware SparsificationCode1
Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMsCode1
Flash Window Attention: speedup the attention computation for Swin TransformerCode1
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical ReasoningCode1
NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without ReferencesCode1
Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech SynthesisCode1
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object DetectionCode1
Challenging reaction prediction models to generalize to novel chemistryCode1
Exploring Pose-Based Anomaly Detection for Retail Security: A Real-World Shoplifting Dataset and BenchmarkCode1
HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake DetectionCode1
EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware AccelerationCode1
Merging Feed-Forward Sublayers for Compressed TransformersCode1
kANNolo: Sweet and Smooth Approximate k-Nearest Neighbors SearchCode1
Understanding Impact of Human Feedback via Influence FunctionsCode1
Interpretable Enzyme Function Prediction via Residue-Level DetectionCode1
Pose-independent 3D Anthropometry from Sparse DataCode1
From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster trainingCode1
From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living ActivitiesCode1
Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse LanguagesCode1
Super-class guided Transformer for Zero-Shot Attribute ClassificationCode1
ExPO: Explainable Phonetic Trait-Oriented Network for Speaker VerificationCode1
StructSR: Refuse Spurious Details in Real-World Image Super-ResolutionCode1
Learning to generate feasible graphs using graph grammarsCode1
Show:102550
← PrevPage 369 of 9486Next →