SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1065110700 of 661570 papers

TitleStatusHype
Graph Prompt Learning: A Comprehensive Survey and BeyondCode2
TransNeXt: Robust Foveal Visual Perception for Vision TransformersCode2
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPSCode2
Text-Driven Image Editing via Learnable RegionsCode2
SatCLIP: Global, General-Purpose Location Embeddings with Satellite ImageryCode2
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion PriorsCode2
SEED-Bench-2: Benchmarking Multimodal Large Language ModelsCode2
War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World WarsCode2
Panacea: Panoramic and Controllable Video Generation for Autonomous DrivingCode2
Photo-SLAM: Real-time Simultaneous Localization and Photorealistic Mapping for Monocular, Stereo, and RGB-D CamerasCode2
Source-Free Domain Adaptation with Frozen Multimodal Foundation ModelCode2
OccWorld: Learning a 3D Occupancy World Model for Autonomous DrivingCode2
CoSeR: Bridging Image and Language for Cognitive Super-ResolutionCode2
LLMGA: Multimodal Large Language Model based Generation AssistantCode2
SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose EstimationCode2
SiTH: Single-view Textured Human Reconstruction with Image-Conditioned DiffusionCode2
Optimal Transport Aggregation for Visual Place RecognitionCode2
XLB: A differentiable massively parallel lattice Boltzmann library in PythonCode2
On Bringing Robots HomeCode2
YUAN 2.0: A Large Language Model with Localized Filtering-based AttentionCode2
GS-IR: 3D Gaussian Splatting for Inverse RenderingCode2
Flow-Guided Diffusion for Video InpaintingCode2
Algorithm Evolution Using Large Language ModelCode2
Sketch Video SynthesisCode2
NeuRAD: Neural Rendering for Autonomous DrivingCode2
Adapter is All You Need for Tuning Visual TasksCode2
MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D GenerationCode2
OneFormer3D: One Transformer for Unified Point Cloud SegmentationCode2
Differentiable and accelerated spherical harmonic and Wigner transformsCode2
Controlled Text Generation via Language Model ArithmeticCode2
GeoChat: Grounded Large Vision-Language Model for Remote SensingCode2
GigaPose: Fast and Robust Novel Object Pose Estimation via One CorrespondenceCode2
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character DesignCode2
PyVRP: a high-performance VRP solver packageCode2
SegVol: Universal and Interactive Volumetric Medical Image SegmentationCode2
Using Human Feedback to Fine-tune Diffusion Models without Any Reward ModelCode2
Learning to Fly in SecondsCode2
PG-Video-LLaVA: Pixel Grounding Large Video-Language ModelsCode2
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAsCode2
Compact 3D Gaussian Representation for Radiance FieldCode2
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion ModelsCode2
Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot ImagesCode2
Intrinsic Image Decomposition via Ordinal ShadingCode2
A Survey of Graph Meets Large Language Model: Progress and Future DirectionsCode2
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive SurveyCode2
GAIA: a benchmark for General AI AssistantsCode2
SelfOcc: Self-Supervised Vision-Based 3D Occupancy PredictionCode2
Swift Parameter-free Attention Network for Efficient Super-ResolutionCode2
Diffusion Model Alignment Using Direct Preference OptimizationCode2
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion GuidanceCode2
Show:102550
← PrevPage 214 of 13232Next →