SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 98519900 of 661570 papers

TitleStatusHype
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language ModelsCode2
Mathematical Introduction to Deep Learning: Methods, Implementations, and TheoryCode2
Adaptive Probabilistic ODE Solvers Without Adaptive Memory RequirementsCode2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
RoboBERT: An End-to-end Multimodal Robotic Manipulation ModelCode2
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed RetrievalCode2
AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance FieldCode2
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion ModelCode2
Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education ScenarioCode2
VMBench: A Benchmark for Perception-Aligned Video Motion GenerationCode2
SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico ExperimentsCode2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph RefinementCode2
pyPESTO: A modular and scalable tool for parameter estimation for dynamic modelsCode2
PyTopo3D: A Python Framework for 3D SIMP-based Topology OptimizationCode2
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion ModelsCode2
Scaling Data Generation in Vision-and-Language NavigationCode2
HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and BeyondCode2
Geomstats: A Python Package for Riemannian Geometry in Machine LearningCode2
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLMCode2
Large Continual Instruction AssistantCode2
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full ModelCode2
Diffusion Posterior Sampling for General Noisy Inverse ProblemsCode2
A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstaclesCode2
Multitask Prompted Training Enables Zero-Shot Task GeneralizationCode2
An Empirical Study of Data Ability Boundary in LLMs' Math ReasoningCode2
Affordable Generative AgentsCode2
SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM)Code2
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative DecodingCode2
Protein Large Language Models: A Comprehensive SurveyCode2
Statewide Visual Geolocalization in the WildCode2
Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative StudyCode2
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature AlignmentCode2
Graph Prompt Learning: A Comprehensive Survey and BeyondCode2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement LearningCode2
Position: What Can Large Language Models Tell Us about Time Series AnalysisCode2
Cloud2BIM: An open-source automatic pipeline for efficient conversion of large-scale point clouds into IFC formatCode2
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline DataCode2
Continuous Temporal Domain GeneralizationCode2
Map-Relative Pose Regression for Visual Re-LocalizationCode2
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path PlanningCode2
Aligning Language Models with Demonstrated FeedbackCode2
A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI AutonomyCode2
ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEsCode2
Can AI Assistants Know What They Don't Know?Code2
WildFusion: Individual Animal Identification with Calibrated Similarity FusionCode2
X-Avatar: Expressive Human AvatarsCode2
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language ModelsCode2
An L-BFGS-B approach for linear and nonlinear system identification under _1 and group-Lasso regularizationCode2
Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive SurveyCode2
Show:102550
← PrevPage 198 of 13232Next →