SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 65016550 of 661570 papers

TitleStatusHype
A Closer Look at Learned Optimization: Stability, Robustness, and Inductive BiasesCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep LearningCode2
RecDiff: Diffusion Model for Social RecommendationCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
Computational Life: How Well-formed, Self-replicating Programs Emerge from Simple InteractionCode2
PEM: Prototype-based Efficient MaskFormer for Image SegmentationCode2
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMsCode2
A Survey of Personalized Large Language Models: Progress and Future DirectionsCode2
TIES-Merging: Resolving Interference When Merging ModelsCode2
CLIP-Mesh: Generating textured meshes from text using pretrained image-text modelsCode2
What's In My Big Data?Code2
Continual Pre-training of Language ModelsCode2
Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 ChallengeCode2
HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned SamplingCode2
F^2-NeRF: Fast Neural Radiance Field Training with Free Camera TrajectoriesCode2
Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-ResolutionCode2
Omegance: A Single Parameter for Various Granularities in Diffusion-Based SynthesisCode2
Star-convex Polyhedra for 3D Object Detection and Segmentation in MicroscopyCode2
Recent Advances in Speech Language Models: A SurveyCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
SRFormer: Text Detection Transformer with Incorporated Segmentation and RegressionCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its HybridCode2
Large Language Model Safety: A Holistic SurveyCode2
Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment NetworkCode2
MiniGPT-5: Interleaved Vision-and-Language Generation via Generative VokensCode2
pyrtklib: An open-source package for tightly coupled deep learning and GNSS integration for positioning in urban canyonsCode2
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
BayesFlow: Amortized Bayesian Workflows With Neural NetworksCode2
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic SegmentationCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Equivariant 3D-Conditional Diffusion Models for Molecular Linker DesignCode2
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and MergingCode2
InFoBench: Evaluating Instruction Following Ability in Large Language ModelsCode2
ForecastBench: A Dynamic Benchmark of AI Forecasting CapabilitiesCode2
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained DiffusionCode2
Improving Text-guided Object Inpainting with Semantic Pre-inpaintingCode2
INQUIRE: A Natural World Text-to-Image Retrieval BenchmarkCode2
Adaptive Personalized Federated LearningCode2
CodeBERTScore: Evaluating Code Generation with Pretrained Models of CodeCode2
A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D cameraCode2
Predictive Dynamic FusionCode2
MetaFormer: A Unified Meta Framework for Fine-Grained RecognitionCode2
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training FrameworkCode2
Deep Patch Visual OdometryCode2
PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human ModelingCode2
MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based AgentsCode2
LingoQA: Visual Question Answering for Autonomous DrivingCode2
MemLong: Memory-Augmented Retrieval for Long Text ModelingCode2
Show:102550
← PrevPage 131 of 13232Next →