SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 82518300 of 661570 papers

TitleStatusHype
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic SegmentationCode2
PyMatting: A Python Library for Alpha MattingCode2
ARAGOG: Advanced RAG Output GradingCode2
Learning to See Through ObstructionsCode2
Learning Convex Optimization ModelsCode2
Oscar: Object-Semantics Aligned Pre-training for Vision-Language TasksCode2
AdapterFusion: Non-Destructive Task Composition for Transfer LearningCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
DeepRobust: A PyTorch Library for Adversarial Attacks and DefensesCode2
Transformers as Policies for Variable Action EnvironmentsCode2
Simulation-Based Inference for Global Health DecisionsCode2
Implementation of UAV Coordination Based on a Hierarchical Multi-UAV Simulation PlatformCode2
Practical Continual Forgetting for Pre-trained Vision ModelsCode2
TabDPT: Scaling Tabular Foundation ModelsCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Learning to Compose Dynamic Tree Structures for Visual ContextsCode2
Generative Sparse Detection Networks for 3D Single-shot Object DetectionCode2
HelloBench: Evaluating Long Text Generation Capabilities of Large Language ModelsCode2
Generative causal testing to bridge data-driven models and scientific theories in language neuroscienceCode2
Few-shot Knowledge Transfer for Fine-grained Cartoon Face GenerationCode2
Simplifying Object Segmentation with PixelLib LibraryCode2
Boundary-Aware Segmentation Network for Mobile and Web ApplicationsCode2
Poisoning Attacks against Recommender Systems: A SurveyCode2
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and ToxicityCode2
Interpreting and Editing Vision-Language Representations to Mitigate HallucinationsCode2
MBRL-Lib: A Modular Library for Model-based Reinforcement LearningCode2
VideoRAG: Retrieval-Augmented Generation over Video CorpusCode2
Fast Transformers with Clustered AttentionCode2
Language Model CascadesCode2
TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement LearningCode2
Non-Metric Space Library ManualCode2
DeepSVG: A Hierarchical Generative Network for Vector Graphics AnimationCode2
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device LearningCode2
FedML: A Research Library and Benchmark for Federated Machine LearningCode2
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer LearningCode2
The Open Catalyst 2020 (OC20) Dataset and Community ChallengesCode2
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image GenerationCode2
DeText: A Deep Text Ranking Framework with BERTCode2
Efficient Video Face Enhancement with Enhanced Spatial-Temporal ConsistencyCode2
SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?Code2
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame InterpolationCode2
Sibyl: Simple yet Effective Agent Framework for Complex Real-world ReasoningCode2
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
A Change Detection Reality CheckCode2
Top2Vec: Distributed Representations of TopicsCode2
OpenBot: Turning Smartphones into RobotsCode2
Delving into Inter-Image Invariance for Unsupervised Visual RepresentationsCode2
Flightmare: A Flexible Quadrotor SimulatorCode2
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data RewardsCode2
Show:102550
← PrevPage 166 of 13232Next →