SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1090110950 of 661570 papers

TitleStatusHype
Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion FieldsCode2
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box PromptsCode2
In-Context Editing: Learning Knowledge from Self-Induced DistributionsCode2
Differentiable and accelerated spherical harmonic and Wigner transformsCode2
A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to SearchCode2
shapr: Explaining Machine Learning Models with Conditional Shapley Values in R and PythonCode2
Improving Diffusion Inverse Problem Solving with Decoupled Noise AnnealingCode2
Safety Alignment Should Be Made More Than Just a Few Tokens DeepCode2
Transcoders Find Interpretable LLM Feature CircuitsCode2
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner FrameworkCode2
Humanity's Last Code Exam: Can Advanced LLMs Conquer Human's Hardest Code Competition?Code2
Early Detection and Localization of Pancreatic Cancer by Label-Free Tumor SynthesisCode2
STAIR: Improving Safety Alignment with Introspective ReasoningCode2
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent EducationCode2
Multi-modal Queried Object Detection in the WildCode2
3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric DataCode2
Universal Physics Transformers: A Framework For Efficiently Scaling Neural OperatorsCode2
Conditional Image-to-Video Generation with Latent Flow Diffusion ModelsCode2
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial SoundCode2
Window Function-less DFT with Reduced Noise and Latency for Real-Time Music AnalysisCode2
An Efficient Sparse Kernel Generator for O(3)-Equivariant Deep NetworksCode2
OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender SystemsCode2
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based MethodCode2
Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data ScarcityCode2
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware DiffusionCode2
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive SurveyCode2
Hierarchical Integration Diffusion Model for Realistic Image DeblurringCode2
Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian SplattingCode2
MeshLoc: Mesh-Based Visual LocalizationCode2
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language ModelsCode2
Learning Semantic-Aware Knowledge Guidance for Low-Light Image EnhancementCode2
Agent AI: Surveying the Horizons of Multimodal InteractionCode2
β-DPO: Direct Preference Optimization with Dynamic βCode2
RedCode: Risky Code Execution and Generation Benchmark for Code AgentsCode2
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language ModelCode2
A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous SpeechCode2
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction TuningCode2
FreeInit: Bridging Initialization Gap in Video Diffusion ModelsCode2
GUICourse: From General Vision Language Models to Versatile GUI AgentsCode2
The CLRS Algorithmic Reasoning BenchmarkCode2
Language Models are Realistic Tabular Data GeneratorsCode2
Video Quality Assessment: A Comprehensive SurveyCode2
BEBLID: Boosted efficient binary local image descriptorCode2
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized AttentionCode2
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial ScenariosCode2
End-to-End Ontology Learning with Large Language ModelsCode2
TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud DetectionCode2
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference TimeCode2
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal UnderstandingCode2
Show:102550
← PrevPage 219 of 13232Next →