SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1025110300 of 661570 papers

TitleStatusHype
FreGrad: Lightweight and Fast Frequency-aware Diffusion VocoderCode2
R-Judge: Benchmarking Safety Risk Awareness for LLM AgentsCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language ModelCode2
Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor ProductsCode2
Towards Language-Driven Video Inpainting via Multimodal Large Language ModelsCode2
LangProp: A code optimization framework using Large Language Models applied to drivingCode2
Spatial-Temporal Large Language Model for Traffic PredictionCode2
Adaptive Kalman-Informed TransformerCode2
Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation NetworkCode2
Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object RelightingCode2
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling PriorCode2
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image SynthesisCode2
Autonomous Catheterization with Open-source Simulator and Expert TrajectoryCode2
RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series TasksCode2
Tri^2-plane: Thinking Head Avatar via Feature PyramidCode2
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelCode2
PPSURF: Combining Patches and Point Convolutions for Detailed Surface ReconstructionCode2
Tuning Language Models by ProxyCode2
Adversarial Supervision Makes Layout-to-Image Diffusion Models ThriveCode2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentCode2
WAVES: Benchmarking the Robustness of Image WatermarksCode2
UV-SAM: Adapting Segment Anything Model for Urban Village IdentificationCode2
Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token DictionaryCode2
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)Code2
EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective AnalysisCode2
MMToM-QA: Multimodal Theory of Mind Question AnsweringCode2
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box PromptsCode2
Spatial-Semantic Collaborative Cropping for User Generated ContentCode2
Efficient4D: Fast Dynamic 3D Object Generation from a Single-view VideoCode2
Fixed Point Diffusion ModelsCode2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsCode2
E3x: E(3)-Equivariant Deep Learning Made EasyCode2
Authorship Obfuscation in Multilingual Machine-Generated Text DetectionCode2
Improved Implicit Neural Representation with Fourier Reparameterized TrainingCode2
Integrate Any Omics: Towards genome-wide data integration for patient stratificationCode2
Fine-Grained Prototypes Distillation for Few-Shot Object DetectionCode2
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image SegmentationCode2
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance DesignCode2
PDE Generalization of In-Context Operator Networks: A Study on 1D Scalar Nonlinear Conservation LawsCode2
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health RecordsCode2
Graph Language ModelsCode2
Extending LLMs' Context Window with 100 SamplesCode2
Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-IdentificationCode2
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained EvaluationCode2
Expected Shapley-Like Scores of Boolean Functions: Complexity and Applications to Probabilistic DatabasesCode2
Vehicle: Bridging the Embedding Gap in the Verification of Neuro-Symbolic ProgramsCode2
Mission: Impossible Language ModelsCode2
Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imageryCode2
Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and TrackingCode2
Show:102550
← PrevPage 206 of 13232Next →