SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1310113150 of 177340 papers

TitleStatusHype
FakeBench: Probing Explainable Fake Image Detection via Large Multimodal ModelsCode2
A General Framework for Jersey Number Recognition in Sports VideoCode2
MobileQuant: Mobile-friendly Quantization for On-device Language ModelsCode2
STAEformer: Spatio-Temporal Adaptive Embedding Makes Vanilla Transformer SOTA for Traffic ForecastingCode2
Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and RegistrationCode2
Wavelet Diffusion Neural OperatorCode2
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic CompensationCode2
FedModule: A Modular Federated Learning FrameworkCode2
Dawn of the transformer era in speech emotion recognition: closing the valence gapCode2
NetTrack: Tracking Highly Dynamic Objects with a NetCode2
QuadTree Attention for Vision TransformersCode2
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-ConquerCode2
XSimGCL: Towards Extremely Simple Graph Contrastive Learning for RecommendationCode2
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMsCode2
fluke: Federated Learning Utility frameworK for Experimentation and researchCode2
EgoLifter: Open-world 3D Segmentation for Egocentric PerceptionCode2
Are Large Kernels Better Teachers than Transformers for ConvNets?Code2
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging ConditionsCode2
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous DrivingCode2
Visual Autoregressive Modeling for Image Super-ResolutionCode2
MBQ: Modality-Balanced Quantization for Large Vision-Language ModelsCode2
Lite Pose: Efficient Architecture Design for 2D Human Pose EstimationCode2
Spherical Fourier Neural Operators: Learning Stable Dynamics on the SphereCode2
ChatterBox: Multi-round Multimodal Referring and GroundingCode2
PHUDGE: Phi-3 as Scalable JudgeCode2
Towards Realistic Generative 3D Face ModelsCode2
SustainDC: Benchmarking for Sustainable Data Center ControlCode2
Platypus: Quick, Cheap, and Powerful Refinement of LLMsCode2
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and ThoroughlyCode2
TextSLAM: Visual SLAM with Semantic Planar Text FeaturesCode2
Trust, but Verify: Cross-Modality Fusion for HD Map Change DetectionCode2
LAMP: Learn A Motion Pattern for Few-Shot-Based Video GenerationCode2
SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change DetectorCode2
Gotta Hear Them All: Sound Source Aware Vision to Audio GenerationCode2
Audio-Visual Segmentation with SemanticsCode2
A Simple and Effective Pruning Approach for Large Language ModelsCode2
S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical RepresentationsCode2
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
MATCHA: Towards Matching AnythingCode2
FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language ModelsCode2
SEGA: Instructing Text-to-Image Models using Semantic GuidanceCode2
On the detection of synthetic images generated by diffusion modelsCode2
Universal Neural FunctionalsCode2
Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo CamerasCode2
Adaptive Bidirectional Displacement for Semi-Supervised Medical Image SegmentationCode2
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
PLA: Language-Driven Open-Vocabulary 3D Scene UnderstandingCode2
TIM: A Time Interval Machine for Audio-Visual Action RecognitionCode2
Beyond MOT: Semantic Multi-Object TrackingCode2
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction TuningCode2
Show:102550
← PrevPage 263 of 3547Next →