SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1910119150 of 474278 papers

TitleStatusHype
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion ModelCode1
A physics-informed transformer neural operator for learning generalized solutions of initial boundary value problemsCode1
Augmenting Sequential Recommendation with Balanced Relevance and DiversityCode1
HARP: A challenging human-annotated math reasoning benchmarkCode1
Repository-Level Graph Representation Learning for Enhanced Security Patch DetectionCode1
PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud CompletionCode1
Revisiting Weight Averaging for Model MergingCode1
Fast Prompt Alignment for Text-to-Image GenerationCode1
Concept Bottleneck Large Language ModelsCode1
Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill LearningCode1
EmoVerse: Exploring Multimodal Large Language Models for Sentiment and Emotion UnderstandingCode1
TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt TuningCode1
PepMNet: a hybrid deep learning model for predicting peptide properties using hierarchical graph representationsCode1
SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level promptingCode1
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image CaptioningCode1
Magneto: Combining Small and Large Language Models for Schema MatchingCode1
Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language ModelsCode1
ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query DecoderCode1
Adversarial Vulnerabilities in Large Language Models for Time Series ForecastingCode1
GDSG: Graph Diffusion-based Solution Generator for Optimization Problems in MEC NetworksCode1
NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision AnalysisCode1
Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and TrainingCode1
EOV-Seg: Efficient Open-Vocabulary Panoptic SegmentationCode1
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data FlywheelCode1
Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud CompletionCode1
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp SegmentationCode1
Boundary Exploration of Next Best View Policy in 3D Robotic ScanningCode1
SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber WorldCode1
T-TIME: Test-Time Information Maximization Ensemble for Plug-and-Play BCIsCode1
A New Federated Learning Framework Against Gradient Inversion AttacksCode1
Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion ModelCode1
FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction ErrorCode1
Mask prior-guided denoising diffusion improves inverse protein foldingCode1
Optimizing Personalized Federated Learning through Adaptive Layer-Wise LearningCode1
Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving SequencesCode1
Unlocking the Potential of Reverse Distillation for Anomaly DetectionCode1
EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based VisionCode1
On Evaluating the Durability of Safeguards for Open-Weight LLMsCode1
Scaling Sequential Recommendation Models with TransformersCode1
RFL: Simplifying Chemical Structure Recognition with Ring-Free LanguageCode1
Efficient 3D Recognition with Event-driven Spike Sparse ConvolutionCode1
Cloud Object Detector Adaptation by Integrating Different Source KnowledgeCode1
Monte Carlo Tree Search based Space Transfer for Black-box OptimizationCode1
Towards Automated Cross-domain Exploratory Data Analysis through Large Language ModelsCode1
ReCap: Better Gaussian Relighting with Cross-Environment CapturesCode1
Temporal Linear Item-Item Model for Sequential RecommendationCode1
IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design PatentsCode1
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
Modeling Dual-Exposure Quad-Bayer Patterns for Joint Denoising and DeblurringCode1
PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and ModelsCode1
Show:102550
← PrevPage 383 of 9486Next →