SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1630116350 of 474278 papers

TitleStatusHype
LLMSR@XLLM25: Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided DistillationCode1
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" ControlCode1
Disentangling and Generating Modalities for Recommendation in Missing Modality ScenariosCode1
SemanticSugarBeets: A Multi-Task Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar BeetsCode1
Private Federated Learning using Preference-Optimized Synthetic DataCode1
Physics-guided and fabrication-aware inverse design of photonic devices using diffusion modelsCode1
MMHCL: Multi-Modal Hypergraph Contrastive Learning for RecommendationCode1
SonarT165: A Large-scale Benchmark and STFTrack Framework for Acoustic Object TrackingCode1
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field EnlargementCode1
Survey of Video Diffusion Models: Foundations, Implementations, and ApplicationsCode1
FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image GenerationCode1
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud LearningCode1
Intent-aware Diffusion with Contrastive Learning for Sequential RecommendationCode1
Instruction-Tuning Data Synthesis from Scratch via Web ReconstructionCode1
NLCTables: A Dataset for Marrying Natural Language Conditions with Table DiscoveryCode1
Heterogeneous networks in drug-target interaction predictionCode1
SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision SystemsCode1
A Python Tool for Reconstructing Full News Text from GDELTCode1
Mitigating Degree Bias in Graph Representation Learning with Learnable Structural Augmentation and Structural Self-AttentionCode1
KGMEL: Knowledge Graph-Enhanced Multimodal Entity LinkingCode1
Exploring _0 Sparsification for Inference-free Sparse RetrieversCode1
Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated ImagesCode1
Event2Vec: Processing neuromorphic events directly by representations in vector spaceCode1
Shape-Guided Clothing Warping for Virtual Try-OnCode1
NeuGaze: Reshaping the future BCICode1
Completing A Systematic Review in Hours instead of Months with Interactive AI AgentsCode1
Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identificationCode1
ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale StagesCode1
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust TranspilationCode1
Enhancing the Patent Matching Capability of Large Language Models via the Memory GraphCode1
AlignRAG: Leveraging Critique Learning for Evidence-Sensitive Retrieval-Augmented ReasoningCode1
HSANET: A Hybrid Self-Cross Attention Network For Remote Sensing Change DetectionCode1
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMsCode1
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity SearchCode1
NoWag: A Unified Framework for Shape Preserving Compression of Large Language ModelsCode1
Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action CorrectionCode1
Video-MMLU: A Massive Multi-Discipline Lecture Understanding BenchmarkCode1
NTIRE 2025 Challenge on Real-World Face Restoration: Methods and ResultsCode1
Unconstrained Monotonic Calibration of Predictions in Deep Ranking SystemsCode1
Integrating LLM-Generated Views into Mean-Variance Optimization Using the Black-Litterman ModelCode1
Visual Consensus Prompting for Co-Salient Object DetectionCode1
Template-Based Financial Report Generation in Agentic and Decomposed Information RetrievalCode1
FedCIA: Federated Collaborative Information Aggregation for Privacy-Preserving RecommendationCode1
Walk the Talk? Measuring the Faithfulness of Large Language Model ExplanationsCode1
Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided CalibrationCode1
Understanding the Repeat Curse in Large Language Models from a Feature PerspectiveCode1
Fighting Fires from Space: Leveraging Vision Transformers for Enhanced Wildfire Detection and CharacterizationCode1
SupResDiffGAN a new approach for the Super-Resolution taskCode1
KAN or MLP? Point Cloud Shows the Way ForwardCode1
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory PredictionCode1
Show:102550
← PrevPage 327 of 9486Next →