SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,323 code links4,818 tasks

Papers

Showing 35513600 of 661570 papers

TitleStatusHype
Long-Context Autoregressive Video Modeling with Next-Frame PredictionCode3
ID-Animator: Zero-Shot Identity-Preserving Human Video GenerationCode3
Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera MotionCode3
Data-Copilot: Bridging Billions of Data and Humans with Autonomous WorkflowCode3
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset TransferCode3
Consistency Models Made EasyCode3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP ResearchersCode3
UniTraj: A Unified Framework for Scalable Vehicle Trajectory PredictionCode3
Scalable Optimization in the Modular NormCode3
SupeRANSAC: One RANSAC to Rule Them AllCode3
Wordflow: Social Prompt Engineering for Large Language ModelsCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and BaselinesCode3
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language ModelsCode3
Face Anonymization Made SimpleCode3
Locating and Editing Factual Associations in GPTCode3
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkCode3
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular VideosCode3
ImageInWords: Unlocking Hyper-Detailed Image DescriptionsCode3
Flow Q-LearningCode3
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge GraphsCode3
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and CompatibilityCode3
Lyra: An Efficient and Speech-Centric Framework for Omni-CognitionCode3
The Ninth NTIRE 2024 Efficient Super-Resolution Challenge ReportCode3
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language UnderstandingCode3
Ultra-High-Resolution Image Synthesis: Data, Method and EvaluationCode3
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal RetrieversCode3
FreeMatch: Self-adaptive Thresholding for Semi-supervised LearningCode3
Unlimited-Size Diffusion RestorationCode3
TorchSparse: Efficient Point Cloud Inference EngineCode3
Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait SynthesisCode3
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly DetectionCode3
From Matching to Generation: A Survey on Generative Information RetrievalCode3
SAM-Med2DCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
DEADiff: An Efficient Stylization Diffusion Model with Disentangled RepresentationsCode3
GaussianCity: Generative Gaussian Splatting for Unbounded 3D City GenerationCode3
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate DetailsCode3
ResearchTown: Simulator of Human Research CommunityCode3
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point SupervisionCode3
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMsCode3
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for LocomotionCode3
TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug DiscoveryCode3
MathArena: Evaluating LLMs on Uncontaminated Math CompetitionsCode3
Frequency-aware Feature Fusion for Dense Image PredictionCode3
VoiceBench: Benchmarking LLM-Based Voice AssistantsCode3
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D GenerationCode3
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM AgentsCode3
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and ReconstructionCode3
Show:102550
← PrevPage 72 of 13232Next →