SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1225112300 of 177340 papers

TitleStatusHype
HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian SplattingCode2
LifelongAgentBench: Evaluating LLM Agents as Lifelong LearnersCode2
A Survey of Generative AI for de novo Drug Design: New Frontiers in Molecule and Protein GenerationCode2
JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language ModelsCode2
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement LearningCode2
Context and Geometry Aware Voxel Transformer for Semantic Scene CompletionCode2
An Item is Worth a Prompt: Versatile Image Editing with Disentangled ControlCode2
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIPCode2
Objaverse++: Curated 3D Object Dataset with Quality AnnotationsCode2
Depth-Aware Video Frame InterpolationCode2
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context SparsificationCode2
Mining Error Templates for Grammatical Error CorrectionCode2
Collaborative Expert LLMs Guided Multi-Objective Molecular OptimizationCode2
I^2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene ForecastingCode2
MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical ImagesCode2
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit AssignmentCode2
Self-Supervised Learning for Recommender Systems: A SurveyCode2
Demystifying and Enhancing the Efficiency of Large Language Model Based Search AgentsCode2
Topological Deep Learning: Going Beyond Graph DataCode2
Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct RenderingCode2
Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time TrajectoryCode2
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal PerceptionCode2
GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?Code2
SAFIRE: Segment Any Forged Image RegionCode2
TweetNLP: Cutting-Edge Natural Language Processing for Social MediaCode2
ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving ObjectCode2
Essential-Web v1.0: 24T tokens of organized web dataCode2
Compressing Context to Enhance Inference Efficiency of Large Language ModelsCode2
Video Compression for Spatiotemporal Earth System DataCode2
YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-IDCode2
WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model BenchmarkCode2
OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event UnderstandingCode2
Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar CreationCode2
FlowReasoner: Reinforcing Query-Level Meta-AgentsCode2
SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large ObjectsCode2
VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-ResolutionCode2
Low-resource finetuning of foundation models beats state-of-the-art in histopathologyCode2
Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived Orthophoto And Digital Surface ModelCode2
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for EnsemblingCode2
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness BenchmarkCode2
Next3D: Generative Neural Texture Rasterization for 3D-Aware Head AvatarsCode2
MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis AttentionCode2
ResAD: A Simple Framework for Class Generalizable Anomaly DetectionCode2
RENO: Real-Time Neural Compression for 3D LiDAR Point CloudsCode2
AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction ErrorCode2
A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild ImagesCode2
Simul-Whisper: Attention-Guided Streaming Whisper with Truncation DetectionCode2
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in VideosCode2
Preserving Fairness Generalization in Deepfake DetectionCode2
Merging Context Clustering with Visual State Space Models for Medical Image SegmentationCode2
Show:102550
← PrevPage 246 of 3547Next →