SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1000110050 of 661570 papers

TitleStatusHype
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural EmbeddingsCode2
Contrastive learning of cell state dynamics in response to perturbationsCode2
KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender SystemsCode2
The Devil is in Temporal Token: High Quality Video Reasoning SegmentationCode2
ReplayCAD: Generative Diffusion Replay for Continual Anomaly DetectionCode2
A Simple Episodic Linear Probe Improves Visual Recognition in the WildCode2
SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3DCode2
SALT: Introducing a Framework for Hierarchical Segmentations in Medical Imaging using Softmax for Arbitrary Label TreesCode2
Skinned Motion Retargeting with Dense Geometric Interaction PerceptionCode2
DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT SpaceCode2
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting SynthesisCode2
DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving ScenesCode2
Grounding-IQA: Multimodal Language Grounding Model for Image Quality AssessmentCode2
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty AwarenessCode2
SHINE-Mapping: Large-Scale 3D Mapping Using Sparse Hierarchical Implicit Neural RepresentationsCode2
Hierarchical Temporal Context Learning for Camera-based Semantic Scene CompletionCode2
COALA: A Practical and Vision-Centric Federated Learning PlatformCode2
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera VideosCode2
Follow Anything: Open-set detection, tracking, and following in real-timeCode2
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace SettingCode2
OPEN: Object-wise Position Embedding for Multi-view 3D Object DetectionCode2
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within GenerationCode2
ECG-Chat: A Large ECG-Language Model for Cardiac Disease DiagnosisCode2
HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented GenerationCode2
Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight FinetuningCode2
Removal then Selection: A Coarse-to-Fine Fusion Perspective for RGB-Infrared Object DetectionCode2
Heating Up Quasi-Monte Carlo Graph Random Features: A Diffusion Kernel PerspectiveCode2
CMGAN: Conformer-based Metric GAN for Speech EnhancementCode2
ZClip: Adaptive Spike Mitigation for LLM Pre-TrainingCode2
Domain-Independent Dynamic ProgrammingCode2
Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational PathologyCode2
GRPose: Learning Graph Relations for Human Image Generation with Pose PriorsCode2
BitVLA: 1-bit Vision-Language-Action Models for Robotics ManipulationCode2
Text-space Graph Foundation Models: Comprehensive Benchmarks and New InsightsCode2
Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding MechanismCode2
Explanation-Preserving Augmentation for Semi-Supervised Graph Representation LearningCode2
ST-LLM: Large Language Models Are Effective Temporal LearnersCode2
VICRegL: Self-Supervised Learning of Local Visual FeaturesCode2
Adaptive Rectangular Convolution for Remote Sensing PansharpeningCode2
DCT-Net: Domain-Calibrated Translation for Portrait StylizationCode2
GaussianToken: An Effective Image Tokenizer with 2D Gaussian SplattingCode2
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation ModelsCode2
Few-shot Novel View Synthesis using Depth Aware 3D Gaussian SplattingCode2
Scaling New Frontiers: Insights into Large Recommendation ModelsCode2
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning ModelsCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
Coding Speech through Vocal Tract KinematicsCode2
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil EngineeringCode2
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and BenchmarkingCode2
Show:102550
← PrevPage 201 of 13232Next →