SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 60016050 of 177340 papers

TitleStatusHype
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head SynthesisCode2
VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned PriorsCode2
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language ModelsCode2
Exploring CLIP for Assessing the Look and Feel of ImagesCode2
Visual Perception by Large Language Model's WeightsCode2
MCP-Solver: Integrating Language Models with Constraint Programming SystemsCode2
SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point CloudCode2
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose EstimationCode2
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual EditingCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
CMB: A Comprehensive Medical Benchmark in ChineseCode2
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D PolicyCode2
StructChart: On the Schema, Metric, and Augmentation for Visual Chart UnderstandingCode2
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision MakingCode2
The P^3 dataset: Pixels, Points and Polygons for Multimodal Building VectorizationCode2
Protein Representation Learning by Geometric Structure PretrainingCode2
SegNeXt: Rethinking Convolutional Attention Design for Semantic SegmentationCode2
JudgeLM: Fine-tuned Large Language Models are Scalable JudgesCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
Internal Consistency and Self-Feedback in Large Language Models: A SurveyCode2
Hybrid-SORT: Weak Cues Matter for Online Multi-Object TrackingCode2
PartIR: Composing SPMD Partitioning Strategies for Machine LearningCode2
SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine CollaborationCode2
FastVID: Dynamic Density Pruning for Fast Video Large Language ModelsCode2
Embedding Earth: Self-supervised contrastive pre-training for dense land cover classificationCode2
AudioDec: An Open-source Streaming High-fidelity Neural Audio CodecCode2
Self-Normalizing Neural NetworksCode2
Discovering uncertainty: Gaussian constitutive neural networks with correlated weightsCode2
InterCode: Standardizing and Benchmarking Interactive Coding with Execution FeedbackCode2
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer DevicesCode2
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUsCode2
Defending LLMs against Jailbreaking Attacks via BacktranslationCode2
TabDDPM: Modelling Tabular Data with Diffusion ModelsCode2
MCIBI++: Soft Mining Contextual Information Beyond Image for Semantic SegmentationCode2
RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human expertsCode2
3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural RepresentationCode2
Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image GenerationCode2
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive DecodingCode2
RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code CompletionCode2
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their MixCode2
Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement NetworkCode2
SRGS: Super-Resolution 3D Gaussian SplattingCode2
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red TeamingCode2
AdaNeRF: Adaptive Sampling for Real-time Rendering of Neural Radiance FieldsCode2
Language Models can Self-Lengthen to Generate Long TextsCode2
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective ResamplingCode2
Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace ProjectionCode2
Multi-modal Molecule Structure-text Model for Text-based Retrieval and EditingCode2
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text ExpertiseCode2
VOOM: Robust Visual Object Odometry and Mapping using Hierarchical LandmarksCode2
Show:102550
← PrevPage 121 of 3547Next →