SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 80018050 of 661570 papers

TitleStatusHype
DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time VariabilityCode2
RoboUniView: Visual-Language Model with Unified View Representation for Robotic ManipulationCode2
Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural NetworksCode2
Odd-One-Out: Anomaly Detection by Comparing with NeighborsCode2
E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awarenessCode2
MG-Verilog: Multi-grained Dataset Towards Enhanced LLM-assisted Verilog GenerationCode2
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
Centerline Boundary Dice Loss for Vascular SegmentationCode2
Benchmarking Predictive Coding Networks -- Made SimpleCode2
A Survey of Personalization: From RAG to AgentCode2
Discovering symbolic expressions with parallelized tree searchCode2
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language ModelsCode2
See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of DecompositionCode2
RPN: Reconciled Polynomial Network Towards Unifying PGMs, Kernel SVMs, MLP and KANCode2
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion RecognitionCode2
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention MapsCode2
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous ExplorationCode2
MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view VideosCode2
Adaptive Parametric ActivationCode2
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous DrivingCode2
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address LocalizationCode2
xLSTMTime : Long-term Time Series Forecasting With xLSTMCode2
Image Compression for Machine and Human Vision with Spatial-Frequency AdaptationCode2
GOFA: A Generative One-For-All Model for Joint Graph Language ModelingCode2
TTSDS -- Text-to-Speech Distribution ScoreCode2
UrbanWorld: An Urban World Model for 3D City GenerationCode2
GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure DetectionCode2
A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and BeyondCode2
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted FeaturesCode2
Weak-to-Strong ReasoningCode2
PlacidDreamer: Advancing Harmony in Text-to-3D GenerationCode2
A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion AttacksCode2
Forecasting GPU Performance for Deep Learning Training and InferenceCode2
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and GenerationCode2
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
Decomposed Meta-Learning for Few-Shot Named Entity RecognitionCode2
PartGLEE: A Foundation Model for Recognizing and Parsing Any ObjectsCode2
A Simulation Benchmark for Autonomous Racing with Large-Scale Human DataCode2
Perm: A Parametric Representation for Multi-Style 3D Hair ModelingCode2
Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AICode2
MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory PredictionCode2
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of AttentionCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
radarODE: An ODE-Embedded Deep Learning Model for Contactless ECG Reconstruction from Millimeter-Wave RadarCode2
500xCompressor: Generalized Prompt Compression for Large Language ModelsCode2
VERINA: Benchmarking Verifiable Code GenerationCode2
MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing AgentsCode2
wav2graph: A Framework for Supervised Learning Knowledge Graph from SpeechCode2
Causal Agent based on Large Language ModelCode2
Show:102550
← PrevPage 161 of 13232Next →