SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 98519900 of 177340 papers

TitleStatusHype
Optimizing Model Selection for Compound AI SystemsCode2
MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and EditingCode2
A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context ReasoningCode2
Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven CyberattacksCode2
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
What Limits LLM-based Human Simulation: LLMs or Our Design?Code2
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
OpenGlue: Open Source Graph Neural Net Based Pipeline for Image MatchingCode2
Omni-Kernel Network for Image RestorationCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map ConstructionCode2
Trends, Applications, and Challenges in Human Attention ModellingCode2
MixFormerV2: Efficient Fully Transformer TrackingCode2
PartSTAD: 2D-to-3D Part Segmentation Task AdaptationCode2
Tri^2-plane: Thinking Head Avatar via Feature PyramidCode2
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space ModelCode2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
Interpretable Pre-Trained Transformers for Heart Time-Series DataCode2
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft DatasetCode2
RigNet: Neural Rigging for Articulated CharactersCode2
Building Cooperative Embodied Agents Modularly with Large Language ModelsCode2
Pretrained Transformers for Text Ranking: BERT and BeyondCode2
Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural NetsCode2
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question AnsweringCode2
Balanced MSE for Imbalanced Visual RegressionCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
A Unified Evaluation of Textual Backdoor Learning: Frameworks and BenchmarksCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
Is Attention All That NeRF Needs?Code2
Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022Code2
Place Recognition: A Comprehensive Review, Current Challenges and Future DirectionsCode2
Reporting Eye-Tracking Data Quality: Towards a New StandardCode2
TEACH: Temporal Action Composition for 3D HumansCode2
ZeroEGGS: Zero-shot Example-based Gesture Generation from SpeechCode2
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak PromptsCode2
SPARF: Neural Radiance Fields from Sparse and Noisy PosesCode2
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision MakingCode2
Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language ModelsCode2
Single channel voice separation for unknown number of speakers under reverberant and noisy settingsCode2
Reconstructing Hands in 3D with TransformersCode2
PET-NeuS: Positional Encoding Tri-Planes for Neural SurfacesCode2
BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian SplattingCode2
Person Re-IdentificationCode2
Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur PriorCode2
You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure CorrectionCode2
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned GuidanceCode2
AIM: Adapting Image Models for Efficient Video Action RecognitionCode2
Shifts 2.0: Extending The Dataset of Real Distributional ShiftsCode2
SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAMCode2
Leveraging Procedural Generation to Benchmark Reinforcement LearningCode2
Show:102550
← PrevPage 198 of 3547Next →