SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1890118950 of 474278 papers

TitleStatusHype
Do Language Models Understand Time?Code1
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LNCode1
MixRec: Heterogeneous Graph Collaborative FilteringCode1
Robust Tracking via Mamba-based Context-aware Token LearningCode1
Hybrid CNN-LSTM based Indoor Pedestrian Localization with CSI Fingerprint MapsCode1
PowerMLP: An Efficient Version of KANCode1
Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box SettingsCode1
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and GenerationCode1
Context-DPO: Aligning Language Models for Context-FaithfulnessCode1
TRecViT: A Recurrent Video TransformerCode1
SemiDFL: A Semi-Supervised Paradigm for Decentralized Federated LearningCode1
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous InferenceCode1
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with AdaptersCode1
GraphAvatar: Compact Head Avatars with GNN-Generated 3D GaussiansCode1
Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly DetectionCode1
HA-RDet: Hybrid Anchor Rotation Detector for Oriented Object DetectionCode1
Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language ModelsCode1
Knowledge Editing with Dynamic Knowledge Graphs for Multi-Hop Question AnsweringCode1
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language ModelingCode1
When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?Code1
Balans: Multi-Armed Bandits-based Adaptive Large Neighborhood Search for Mixed-Integer Programming ProblemCode1
Beyond Outcomes: Transparent Assessment of LLM Reasoning in GamesCode1
Physics-Based Adversarial Attack on Near-Infrared Human Detector for Nighttime Surveillance Camera SystemsCode1
EscapeBench: Pushing Language Models to Think Outside the BoxCode1
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingCode1
Neural Combinatorial Optimization for Stochastic Flexible Job Shop Scheduling ProblemsCode1
QueryCDR: Query-Based Controllable Distortion Rectification Network for Fisheye ImagesCode1
Event-based Photometric Bundle AdjustmentCode1
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural NetworkCode1
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World TasksCode1
Generative AI Toolkit -- a framework for increasing the quality of LLM-based applications over their whole life cycleCode1
Autonomous Microscopy Experiments through Large Language Model AgentsCode1
Real-time One-Step Diffusion-based Expressive Portrait Videos GenerationCode1
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4oCode1
Exploring Multi-Modal Data with Tool-Augmented LLM Agents for Precise Causal DiscoveryCode1
Plug-and-Play Tri-Branch Invertible Block for Image RescalingCode1
I0T: Embedding Standardization Method Towards Zero Modality GapCode1
Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language ModelsCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute EditingCode1
Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial LearningCode1
XPath Agent: An Efficient XPath Programming Agent Based on LLM for Web CrawlerCode1
Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-trainingCode1
DocFusion: A Unified Framework for Document Parsing TasksCode1
SnakModel: Lessons Learned from Training an Open Danish Large Language ModelCode1
EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented GenerationCode1
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and GroundingCode1
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan ScriptCode1
4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA ImagesCode1
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance SegmentationCode1
Show:102550
← PrevPage 379 of 9486Next →