SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1275112800 of 177340 papers

TitleStatusHype
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed RetrievalCode2
AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance FieldCode2
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion ModelCode2
Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education ScenarioCode2
VMBench: A Benchmark for Perception-Aligned Video Motion GenerationCode2
SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico ExperimentsCode2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph RefinementCode2
pyPESTO: A modular and scalable tool for parameter estimation for dynamic modelsCode2
PyTopo3D: A Python Framework for 3D SIMP-based Topology OptimizationCode2
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion ModelsCode2
Scaling Data Generation in Vision-and-Language NavigationCode2
HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and BeyondCode2
Geomstats: A Python Package for Riemannian Geometry in Machine LearningCode2
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLMCode2
Large Continual Instruction AssistantCode2
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full ModelCode2
Diffusion Posterior Sampling for General Noisy Inverse ProblemsCode2
A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstaclesCode2
Multitask Prompted Training Enables Zero-Shot Task GeneralizationCode2
An Empirical Study of Data Ability Boundary in LLMs' Math ReasoningCode2
Affordable Generative AgentsCode2
SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM)Code2
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative DecodingCode2
Protein Large Language Models: A Comprehensive SurveyCode2
Statewide Visual Geolocalization in the WildCode2
Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative StudyCode2
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature AlignmentCode2
Graph Prompt Learning: A Comprehensive Survey and BeyondCode2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement LearningCode2
Position: What Can Large Language Models Tell Us about Time Series AnalysisCode2
Cloud2BIM: An open-source automatic pipeline for efficient conversion of large-scale point clouds into IFC formatCode2
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline DataCode2
Continuous Temporal Domain GeneralizationCode2
Map-Relative Pose Regression for Visual Re-LocalizationCode2
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path PlanningCode2
Aligning Language Models with Demonstrated FeedbackCode2
A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI AutonomyCode2
ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEsCode2
Can AI Assistants Know What They Don't Know?Code2
WildFusion: Individual Animal Identification with Calibrated Similarity FusionCode2
X-Avatar: Expressive Human AvatarsCode2
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language ModelsCode2
An L-BFGS-B approach for linear and nonlinear system identification under _1 and group-Lasso regularizationCode2
Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive SurveyCode2
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth CompletionCode2
Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent SystemsCode2
Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading IndicatorsCode2
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation CapabilitiesCode2
MegaScenes: Scene-Level View Synthesis at ScaleCode2
PointDreamer: Zero-shot 3D Textured Mesh Reconstruction from Colored Point CloudCode2
Show:102550
← PrevPage 256 of 3547Next →