SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 27512800 of 659983 papers

TitleStatusHype
syftr: Pareto-Optimal Generative AICode3
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation ModelsCode3
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement LearningCode3
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing SystemCode3
Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe SamplingCode3
Unbiased Estimator for Distorted Conics in Camera CalibrationCode3
360Zhinao Technical ReportCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
E5-V: Universal Embeddings with Multimodal Large Language ModelsCode3
Affordance-based Robot Manipulation with Flow MatchingCode3
Harnessing the Universal Geometry of EmbeddingsCode3
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative FinanceCode3
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement LearningCode3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
Patches Are All You Need?Code3
Cascade Prompt Learning for Vision-Language Model AdaptationCode3
Relational Multi-Task Learning: Modeling Relations between Data and TasksCode3
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-TuningCode3
Deciphering Oracle Bone Language with Diffusion ModelsCode3
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout GuidanceCode3
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to AdvancesCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
Deep Photo Style TransferCode3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion ModelsCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive AttacksCode3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and MoreCode3
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice CloningCode3
StyleGaussian: Instant 3D Style Transfer with Gaussian SplattingCode3
GaMeS: Mesh-Based Adapting and Modification of Gaussian SplattingCode3
REPLUG: Retrieval-Augmented Black-Box Language ModelsCode3
Query-Based Adversarial Prompt GenerationCode3
GRAG: Graph Retrieval-Augmented GenerationCode3
Conformer: Convolution-augmented Transformer for Speech RecognitionCode3
Producing and Leveraging Online Map Uncertainty in Trajectory PredictionCode3
Efficient Inference for Large Reasoning Models: A SurveyCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic PatternsCode3
RF-Diffusion: Radio Signal Generation via Time-Frequency DiffusionCode3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image GenerationCode3
EXP-Bench: Can AI Conduct AI Research Experiments?Code3
CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground EnvironmentsCode3
Neural Ordinary Differential EquationsCode3
LEADS: Lightweight Embedded Assisted Driving SystemCode3
Fine-Tuning Language Models with Just Forward PassesCode3
USB: A Unified Semi-supervised Learning Benchmark for ClassificationCode3
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation TasksCode3
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart ReasoningCode3
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling CapabilitiesCode3
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan ArchivesCode3
Show:102550
← PrevPage 56 of 13200Next →