SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1065110700 of 661570 papers

TitleStatusHype
Graphs Meet AI Agents: Taxonomy, Progress, and Future OpportunitiesCode2
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic SegmentationCode2
Three New Validators and a Large-Scale Benchmark Ranking for Unsupervised Domain AdaptationCode2
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied AgentsCode2
Learning from All VehiclesCode2
LambdaNetworks: Modeling Long-Range Interactions Without AttentionCode2
Next Patch Prediction for Autoregressive Visual GenerationCode2
The Stable Artist: Steering Semantics in Diffusion Latent SpaceCode2
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of ParametersCode2
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image UnderstandingCode2
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision TransformersCode2
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View CompletionCode2
Optimization Methods for Personalizing Large Language Models through Retrieval AugmentationCode2
Active Generalized Category DiscoveryCode2
COLD: A Benchmark for Chinese Offensive Language DetectionCode2
Accurate and Efficient Stereo Matching via Attention Concatenation VolumeCode2
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkCode2
PFGM++: Unlocking the Potential of Physics-Inspired Generative ModelsCode2
SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory PredictionCode2
CAMAv2: A Vision-Centric Approach for Static Map Element AnnotationCode2
Depth Information Assisted Collaborative Mutual Promotion Network for Single Image DehazingCode2
Text Image Inpainting via Global Structure-Guided Diffusion ModelsCode2
Robot Trajectron: Trajectory Prediction-based Shared Control for Robot ManipulationCode2
RoboDepth: Robust Out-of-Distribution Depth Estimation under CorruptionsCode2
A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and RecommendationsCode2
Scaling Laws for Galaxy ImagesCode2
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg CodebaseCode2
Facial Appearance Capture at Home with Patch-Level Reflectance PriorCode2
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action ControlCode2
Explicitly Guided Information Interaction Network for Cross-modal Point Cloud CompletionCode2
CodePDE: An Inference Framework for LLM-driven PDE Solver GenerationCode2
Pan-Mamba: Effective pan-sharpening with State Space ModelCode2
Deform3DGS: Flexible Deformation for Fast Surgical Scene Reconstruction with Gaussian SplattingCode2
HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?Code2
Correspondence-Free Non-Rigid Point Set Registration Using Unsupervised Clustering AnalysisCode2
A generalizable 3D framework and model for self-supervised learning in medical imagingCode2
Point Cloud Forecasting as a Proxy for 4D Occupancy ForecastingCode2
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
On the Feasibility of Using LLMs to Autonomously Execute Multi-host Network AttacksCode2
All-In-One Medical Image Restoration via Task-Adaptive RoutingCode2
One Net to Rule Them All: Domain Randomization in Quadcopter Racing Across Different PlatformsCode2
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive DistillationCode2
Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language ModelsCode2
Memory-Space Visual Prompting for Efficient Vision-Language Fine-TuningCode2
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistencyCode2
Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCode2
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design GenerationCode2
Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement LearningCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
SC4D: Sparse-Controlled Video-to-4D Generation and Motion TransferCode2
Show:102550
← PrevPage 214 of 13232Next →