SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 62016250 of 177340 papers

TitleStatusHype
Can Graph Learning Improve Planning in LLM-based Agents?Code2
Universal Segmentation at Arbitrary Granularity with Language InstructionCode2
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified ApproachCode2
Do Transformers Really Perform Bad for Graph Representation?Code2
A Comprehensive Survey on Continual Learning in Generative ModelsCode2
ConvMAE: Masked Convolution Meets Masked AutoencodersCode2
Hyperion - A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAMCode2
Learning Spatiotemporal Features with 3D Convolutional NetworksCode2
FlowDec: A flow-based full-band general audio codec with high perceptual qualityCode2
Map-free Visual Relocalization: Metric Pose Relative to a Single ImageCode2
Towards Comprehensive Detection of Chinese Harmful MemesCode2
Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X CommunicationsCode2
TerDiT: Ternary Diffusion Models with TransformersCode2
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMsCode2
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal MathematicsCode2
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingCode2
ICP-Flow: LiDAR Scene Flow Estimation with ICPCode2
Large Selective Kernel Network for Remote Sensing Object DetectionCode2
Crafting Better Contrastive Views for Siamese Representation LearningCode2
GeoDiff: a Geometric Diffusion Model for Molecular Conformation GenerationCode2
LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound VideosCode2
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement LearningCode2
3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh GenerationCode2
Learning charges and long-range interactions from energies and forcesCode2
PubLayNet: largest dataset ever for document layout analysisCode2
A Simple Image Segmentation Framework via In-Context ExamplesCode2
IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation ModelCode2
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and SynthesisCode2
Large Language Models are Zero-Shot Rankers for Recommender SystemsCode2
Learning to Prompt with Text Only Supervision for Vision-Language ModelsCode2
MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and AccompanimentCode2
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object DetectionCode2
Koopman neural operator as a mesh-free solver of non-linear partial differential equationsCode2
Analytic Federated LearningCode2
EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerceCode2
Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning LanguagesCode2
Concept Bottleneck Language Models For protein designCode2
LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language ModelCode2
Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus AreasCode2
LibCity: An Open Library for Traffic PredictionCode2
Cartesian atomic cluster expansion for machine learning interatomic potentialsCode2
Skill Expansion and Composition in Parameter SpaceCode2
Training-Free Activation Sparsity in Large Language ModelsCode2
TexPainter: Generative Mesh Texturing with Multi-view ConsistencyCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
What Matters in Transformers? Not All Attention is NeededCode2
Easy-to-Hard Generalization: Scalable Alignment Beyond Human SupervisionCode2
Rulebook: bringing co-routines to reinforcement learning environmentsCode2
Generative Semantic SegmentationCode2
Explainable Fake News Detection With Large Language Model via Defense Among Competing WisdomCode2
Show:102550
← PrevPage 125 of 3547Next →