SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50015050 of 661570 papers

TitleStatusHype
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the KeyCode2
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained OptimizationCode2
KAN or MLP: A Fairer ComparisonCode2
ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow DataCode2
Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at ScaleCode2
GrounDiT: Grounding Diffusion Transformers via Noisy Patch TransplantationCode2
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision TransformersCode2
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge ComputingCode2
Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level LossCode2
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language ModelsCode2
CompassJudger-2: Towards Generalist Judge Model via Verifiable RewardsCode2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization AlignmentCode2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything ModelCode2
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale DatasetCode2
ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language ModelsCode2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
Multiview Scene GraphCode2
MovieBench: A Hierarchical Movie Level Dataset for Long Video GenerationCode2
N-HiTS: Neural Hierarchical Interpolation for Time Series ForecastingCode2
DepMamba: Progressive Fusion Mamba for Multimodal Depression DetectionCode2
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language ModelsCode2
Arbitrary-Scale Video Super-Resolution with Structural and Textural PriorsCode2
DiMeR: Disentangled Mesh Reconstruction ModelCode2
Can Large Language Model Agents Simulate Human Trust Behavior?Code2
TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree SequencingCode2
Fast Best-of-N Decoding via Speculative RejectionCode2
Llama-VITS: Enhancing TTS Synthesis with Semantic AwarenessCode2
On the Generalization of BasicVSR++ to Video Deblurring and DenoisingCode2
Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge DistillationCode2
PA-SAM: Prompt Adapter SAM for High-Quality Image SegmentationCode2
One Quantizer is Enough: Toward a Lightweight Audio CodecCode2
Side Adapter Network for Open-Vocabulary Semantic SegmentationCode2
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase IdentificationCode2
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation ModelsCode2
Style-Based Global Appearance Flow for Virtual Try-OnCode2
BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View SynthesisCode2
Open-Vocabulary DETR with Conditional MatchingCode2
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect DetectionCode2
FG^2: Fine-Grained Cross-View Localization by Fine-Grained Feature MatchingCode2
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated ConvolutionsCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
FairyGen: Storied Cartoon Video from a Single Child-Drawn CharacterCode2
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical ModalitiesCode2
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference OptimizationCode2
Tuning Language Models by ProxyCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
TextAtlas5M: A Large-scale Dataset for Dense Text Image GenerationCode2
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented GenerationCode2
DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attentionCode2
A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point ProcessesCode2
Show:102550
← PrevPage 101 of 13232Next →