SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 93519400 of 661570 papers

TitleStatusHype
An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLMCode2
A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness ConstraintCode2
IDGenRec: LLM-RecSys Alignment with Textual ID LearningCode2
Attention Calibration for Disentangled Text-to-Image PersonalizationCode2
Generative Medical SegmentationCode2
Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous DrivingCode2
Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion ModelCode2
Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene AffordanceCode2
Multi-Task Dense Prediction via Mixture of Low-Rank ExpertsCode2
EgoLifter: Open-world 3D Segmentation for Egocentric PerceptionCode2
A Survey on 3D Egocentric Human Pose EstimationCode2
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUsCode2
MIND Your Language: A Multilingual Dataset for Cross-lingual News RecommendationCode2
OmniVid: A Generative Framework for Universal Video UnderstandingCode2
BVR Gym: A Reinforcement Learning Environment for Beyond-Visual-Range Air CombatCode2
Mechanistic Design and Scaling of Hybrid ArchitecturesCode2
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language ModelsCode2
Unsupervised Learning for Joint Beamforming Design in RIS-aided ISAC SystemsCode2
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model MechanismsCode2
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
AID: Attention Interpolation of Text-to-Image DiffusionCode2
Efficient Image Pre-Training with Siamese Cropped Masked AutoencodersCode2
LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image DetectionCode2
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic DirectionsCode2
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal ForecastingCode2
An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock ForecastingCode2
RepairAgent: An Autonomous, LLM-Based Agent for Program RepairCode2
Animal Avatars: Reconstructing Animatable 3D Animals from Casual VideosCode2
AI-Generated Video Detection via Spatio-Temporal Anomaly LearningCode2
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling PerformanceCode2
Invertible Diffusion Models for Compressed SensingCode2
Is Your LiDAR Placement Optimized for 3D Scene Understanding?Code2
QKFormer: Hierarchical Spiking Transformer using Q-K AttentionCode2
Visually Guided Generative Text-Layout Pre-training for Document IntelligenceCode2
DeGCN: Deformable Graph Convolutional Networks for Skeleton-Based Action RecognitionCode2
LSTTN: A Long-Short Term Transformer-based Spatio-temporal Neural Network for Traffic Flow ForecastingCode2
Understanding Long Videos with Multimodal Language ModelsCode2
TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane SegmentationCode2
Grappa -- A Machine Learned Molecular Mechanics Force FieldCode2
Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image FusionCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Composed Video Retrieval via Enriched Context and Discriminative EmbeddingsCode2
Calib3D: Calibrating Model Preferences for Reliable 3D Scene UnderstandingCode2
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image GenerationCode2
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
Few-Shot Bearing Fault Diagnosis Via Ensembling Transformer-Based Model With Mahalanobis Distance Metric Learning From Multiscale FeaturesCode2
CFAT: Unleashing TriangularWindows for Image Super-resolutionCode2
A Transformer approach for Electricity Price ForecastingCode2
CoverUp: Effective High Coverage Test Generation for PythonCode2
CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian FieldCode2
Show:102550
← PrevPage 188 of 13232Next →