SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 91019150 of 661570 papers

TitleStatusHype
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal PromptingCode2
VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality GenerationCode2
Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language ModelsCode2
LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language ModelsCode2
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large ImagesCode2
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting EditingCode2
An All-Atom Generative Model for Designing Protein ComplexesCode2
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by ThemselvesCode2
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual BackbonesCode2
Multimodal Automated Fact-Checking: A SurveyCode2
HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced AttentionCode2
Synthetic Tumors Make AI Segment Tumors BetterCode2
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion ModelsCode2
Generative Diffusion Models on Graphs: Methods and ApplicationsCode2
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy ProjectionsCode2
BigSmall: Efficient Multi-Task Learning for Disparate Spatial and Temporal Physiological MeasurementsCode2
ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable TransformationCode2
UCTB: An Urban Computing Tool Box for Building Spatiotemporal Prediction ServicesCode2
Break-A-Scene: Extracting Multiple Concepts from a Single ImageCode2
Spectrum: Targeted Training on Signal to Noise RatioCode2
Exploiting Scale-Variant Attention for Segmenting Small Medical ObjectsCode2
FB-BEV: BEV Representation from Forward-Backward View TransformationsCode2
GPT-Driver: Learning to Drive with GPTCode2
Personalizing Text-to-Image Generation via Aesthetic GradientsCode2
LLaVA-Plus: Learning to Use Tools for Creating Multimodal AgentsCode2
Evolving Reservoirs for Meta Reinforcement LearningCode2
Transformers are Multi-State RNNsCode2
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object DetectionCode2
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful SpaceCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
LLM2LLM: Boosting LLMs with Novel Iterative Data EnhancementCode2
Transfer CLIP for Generalizable Image DenoisingCode2
Multi-Session SLAM with Differentiable Wide-Baseline Pose OptimizationCode2
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature FieldsCode2
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement LearningCode2
H-Watch: An Open, Connected Platform for AI-Enhanced COVID19 Infection Symptoms Monitoring and Contact TracingCode2
Video-CCAM: Enhancing Video-Language Understanding with Causal Cross-Attention Masks for Short and Long VideosCode2
From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with ReflectionCode2
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language ModelsCode2
Trajectory Flow Matching with Applications to Clinical Time Series ModelingCode2
StoryTeller: Improving Long Video Description through Global Audio-Visual Character IdentificationCode2
Semantic-Conditional Diffusion Networks for Image CaptioningCode2
LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech EnhancementCode2
LiteASR: Efficient Automatic Speech Recognition with Low-Rank ApproximationCode2
An Introduction to Neural Data CompressionCode2
Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and AmendmentCode2
Real Time Speech Enhancement in the Waveform DomainCode2
RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint ExtractionCode2
Mechanistic Design and Scaling of Hybrid ArchitecturesCode2
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic EnvironmentsCode2
Show:102550
← PrevPage 183 of 13232Next →