SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2160121650 of 474278 papers

TitleStatusHype
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language ModelsCode1
Toward Automated Simulation Research Workflow through LLM Prompt Engineering DesignCode1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
μgat: Improving Single-Page Document Parsing by Providing Multi-Page ContextCode1
On the Benefits of Visual Stabilization for Frame- and Event-based PerceptionCode1
VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and PurificationCode1
TrafficGamer: Reliable and Flexible Traffic Simulation for Safety-Critical Scenarios with Game-Theoretic OraclesCode1
Legilimens: Practical and Unified Content Moderation for Large Language Model ServicesCode1
EPO: Hierarchical LLM Agents with Environment Preference OptimizationCode1
More Text, Less Point: Towards 3D Data-Efficient Point-Language UnderstandingCode1
MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image FusionCode1
NAS-BNN: Neural Architecture Search for Binary Neural NetworksCode1
Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language ModelsCode1
Distribution Backtracking Builds A Faster Convergence Trajectory for Diffusion DistillationCode1
Mamba or Transformer for Time Series Forecasting? Mixture of Universals (MoU) Is All You NeedCode1
Trading with Time Series Causal Discovery: An Empirical StudyCode1
A Survey on Facial Expression Recognition of Static and Dynamic EmotionsCode1
SVDD 2024: The Inaugural Singing Voice Deepfake Detection ChallengeCode1
CBF-LLM: Safe Control for LLM AlignmentCode1
Segmentation-guided Layer-wise Image Vectorization with Gradient FillsCode1
Can Unconfident LLM Annotations Be Used for Confident Conclusions?Code1
What makes math problems hard for reinforcement learning: a case studyCode1
DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive StreamingCode1
PAT: Pruning-Aware Tuning for Large Language ModelsCode1
Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object DetectionCode1
CMTA: Cross-Modal Temporal Alignment for Event-guided Video DeblurringCode1
T-FAKE: Synthesizing Thermal Images for Facial LandmarkingCode1
MTMamba++: Enhancing Multi-Task Dense Scene Understanding via Mamba-Based DecodersCode1
Mamba2MIL: State Space Duality Based Multiple Instance Learning for Computational PathologyCode1
GPU-Accelerated Counterfactual Regret MinimizationCode1
ERX: A Fast Real-Time Anomaly Detection Algorithm for Hyperspectral Line ScanningCode1
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual taskCode1
LyCon: Lyrics Reconstruction from the Bag-of-Words Using Large Language ModelsCode1
MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum DisorderCode1
No Regrets: Investigating and Improving Regret Approximations for Curriculum DiscoveryCode1
Measuring Human Contribution in AI-Assisted Content GenerationCode1
TourSynbio: A Multi-Modal Large Model and Agent Framework to Bridge Text and Protein Sequences for Protein EngineeringCode1
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency DetectionCode1
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent SystemsCode1
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language ModelsCode1
Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven ApproachCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
XG-NID: Dual-Modality Network Intrusion Detection using a Heterogeneous Graph Neural Network and Large Language ModelCode1
DRL-Based Federated Self-Supervised Learning for Task Offloading and Resource Allocation in ISAC-Enabled Vehicle Edge ComputingCode1
DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-RaysCode1
SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space ModelsCode1
DCT-CryptoNets: Scaling Private Inference in the Frequency DomainCode1
Grounded Multi-Hop VideoQA in Long-Form Egocentric VideosCode1
Center Direction Network for Grasping Point Localization on ClothsCode1
Show:102550
← PrevPage 433 of 9486Next →