SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 32513300 of 659983 papers

TitleStatusHype
SAM-Med2DCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
DEADiff: An Efficient Stylization Diffusion Model with Disentangled RepresentationsCode3
GaussianCity: Generative Gaussian Splatting for Unbounded 3D City GenerationCode3
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate DetailsCode3
ResearchTown: Simulator of Human Research CommunityCode3
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point SupervisionCode3
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMsCode3
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for LocomotionCode3
TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug DiscoveryCode3
MathArena: Evaluating LLMs on Uncontaminated Math CompetitionsCode3
Frequency-aware Feature Fusion for Dense Image PredictionCode3
VoiceBench: Benchmarking LLM-Based Voice AssistantsCode3
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D GenerationCode3
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM AgentsCode3
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and ReconstructionCode3
Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry ProfessionalsCode3
Scoring Time Intervals using Non-Hierarchical Transformer For Automatic Piano TranscriptionCode3
PointCNN: Convolution On X-Transformed PointsCode3
OverleafCopilot: Empowering Academic Writing in Overleaf with Large Language ModelsCode3
Jailbreak Attacks and Defenses against Multimodal Generative Models: A SurveyCode3
Infrared and Visible Image Fusion: From Data Compatibility to Task AdaptionCode3
Game-theoretic LLM: Agent Workflow for Negotiation GamesCode3
Tracking Anything with Decoupled Video SegmentationCode3
ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground VehicleCode3
BeautyMap: Binary-Encoded Adaptable Ground Matrix for Dynamic Points Removal in Global MapsCode3
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth EstimationCode3
Investigating Efficiently Extending Transformers for Long Input SummarizationCode3
Multiple Object Tracking as ID PredictionCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
MoAI: Mixture of All Intelligence for Large Language and Vision ModelsCode3
MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone ThreatsCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data ProcessingCode3
A Joint Representation Using Continuous and Discrete Features for Cardiovascular Diseases Risk Prediction on Chest CT ScansCode3
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth EstimationCode3
VBench: Comprehensive Benchmark Suite for Video Generative ModelsCode3
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated TextCode3
Theoretically Achieving Continuous Representation of Oriented Bounding BoxesCode3
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought ReasoningCode3
Do generative video models understand physical principles?Code3
Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor SearchCode3
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online VideosCode3
VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed TomographyCode3
Remote Sensing Temporal Vision-Language Models: A Comprehensive SurveyCode3
STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space ModelCode3
Rethinking Evaluation Metrics of Open-Vocabulary SegmentaionCode3
Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory MappingCode3
ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language ModelsCode3
Show:102550
← PrevPage 66 of 13200Next →