SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1870118750 of 474278 papers

TitleStatusHype
No Preference Left Behind: Group Distributional Preference OptimizationCode1
UniBrain: A Unified Model for Cross-Subject Brain DecodingCode1
Long Context vs. RAG for LLMs: An Evaluation and RevisitsCode1
Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual CuesCode1
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving ScenesCode1
RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion ModelCode1
ReNeg: Learning Negative Embedding with Reward GuidanceCode1
An Engorgio Prompt Makes Large Language Model Babble onCode1
RecConv: Efficient Recursive Convolutions for Multi-Frequency RepresentationsCode1
Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsCode1
EEG-Reptile: An Automatized Reptile-Based Meta-Learning Library for BCIsCode1
Fortran2CPP: Automating Fortran-to-C++ Translation using LLMs via Multi-Turn Dialogue and Dual-Agent IntegrationCode1
Toward Adaptive Reasoning in Large Language Models with Thought RollbackCode1
Multi-P^2A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language ModelsCode1
CL-Attack: Textual Backdoor Attacks via Cross-Lingual TriggersCode1
MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical NotesCode1
Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging SegmentationCode1
BeSplat: Gaussian Splatting from a Single Blurry Image and Event StreamCode1
TLS_Finder: An algorithm for Identifying Tertiary Lymphoid Structures Using Immune Cell Spatial CoordinatesCode1
Generating Editable Head Avatars with 3D Gaussian GANsCode1
Context-Aware Deep Learning for Multi Modal Depression DetectionCode1
RAG with Differential PrivacyCode1
Jasper and Stella: distillation of SOTA embedding modelsCode1
On the Expressiveness and Length Generalization of Selective State-Space Models on Regular LanguagesCode1
Improving Generalization for AI-Synthesized Voice DetectionCode1
Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-LearningCode1
IUST_PersonReId: A New Domain in Person Re-Identification DatasetsCode1
MotionMap: Representing Multimodality in Human Pose ForecastingCode1
Improving Integrated Gradient-based Transferable Adversarial Examples by Refining the Integration PathCode1
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement LearningCode1
Efficiently Serving Large Multimodal Models Using EPD DisaggregationCode1
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer ModelCode1
Probabilistic Mission Design in Neuro-Symbolic SystemsCode1
FedCFA: Alleviating Simpson's Paradox in Model Aggregation with Counterfactual Federated LearningCode1
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-AugmentationCode1
Underwater Image Restoration via Polymorphic Large Kernel CNNsCode1
VisionGRU: A Linear-Complexity RNN Model for Efficient Image AnalysisCode1
Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge NetworksCode1
Towards Modality Generalization: A Benchmark and Prospective AnalysisCode1
Extract Free Dense Misalignment from CLIPCode1
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement FilteringCode1
ReducedLUT: Table Decomposition with "Don't Care" ConditionsCode1
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content ModerationCode1
GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural NetworkCode1
AutoDroid-V2: Boosting SLM-based GUI Agents via Code GenerationCode1
LangYa: Revolutionizing Cross-Spatiotemporal Ocean ForecastingCode1
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
Point-DeepONet: A Deep Operator Network Integrating PointNet for Nonlinear Analysis of Non-Parametric 3D Geometries and Load ConditionsCode1
RaSeRec: Retrieval-Augmented Sequential RecommendationCode1
Quo Vadis, Anomaly Detection? LLMs and VLMs in the SpotlightCode1
Show:102550
← PrevPage 375 of 9486Next →