SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1205112100 of 177340 papers

TitleStatusHype
Perceptually Transparent Binaural Auralization of Simulated Sound FieldsCode2
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large ModelsCode2
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic GraspingCode2
RAR: Retrieving And Ranking Augmented MLLMs for Visual RecognitionCode2
BackFed: An Efficient & Standardized Benchmark Suite for Backdoor Attacks in Federated LearningCode2
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion ModelsCode2
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI ToolCode2
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion ModelsCode2
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph GenerationCode2
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech SynthesisCode2
SATO: Stable Text-to-Motion FrameworkCode2
TopoNets: High Performing Vision and Language Models with Brain-Like TopographyCode2
Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian SplattingCode2
3DAffordSplat: Efficient Affordance Reasoning with 3D GaussiansCode2
AlphaNet: Scaling Up Local-frame-based Atomistic Interatomic PotentialCode2
Exploring the Compositional Deficiency of Large Language Models in Mathematical ReasoningCode2
Open-Source Ground-based Sky Image Datasets for Very Short-term Solar Forecasting, Cloud Analysis and Modeling: A Comprehensive SurveyCode2
Syllabus: Portable Curricula for Reinforcement Learning AgentsCode2
ORFD: A Dataset and Benchmark for Off-Road Freespace DetectionCode2
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEGCode2
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step QuestionsCode2
DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car ReconstructionCode2
EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and ModelCode2
Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-ResolutionCode2
Predictive Data Selection: The Data That Predicts Is the Data That TeachesCode2
Graph Meets LLMs: Towards Large Graph ModelsCode2
The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing lossCode2
AutoSDF: Shape Priors for 3D Completion, Reconstruction and GenerationCode2
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion ModelsCode2
Heron-Bench: A Benchmark for Evaluating Vision Language Models in JapaneseCode2
Foundational Models in Medical Imaging: A Comprehensive Survey and Future VisionCode2
PDE Generalization of In-Context Operator Networks: A Study on 1D Scalar Nonlinear Conservation LawsCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMsCode2
IDRNet: Intervention-Driven Relation Network for Semantic SegmentationCode2
HyperSteer: Activation Steering at Scale with HypernetworksCode2
CSL: A Large-scale Chinese Scientific Literature DatasetCode2
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMsCode2
Matryoshka Representation LearningCode2
UniSim: A Neural Closed-Loop Sensor SimulatorCode2
Multi-Space Alignments Towards Universal LiDAR SegmentationCode2
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
Omni-Dimensional Dynamic ConvolutionCode2
Towards Relation-centered Pooling and Convolution for Heterogeneous Graph Learning NetworksCode2
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-VerificationCode2
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion ProcessCode2
Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene ReconstructionCode2
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AICode2
NNetscape Navigator: Complex Demonstrations for Web Agents Without a DemonstratorCode2
Show:102550
← PrevPage 242 of 3547Next →