SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 30513100 of 659983 papers

TitleStatusHype
Generative Multimodal Models are In-Context LearnersCode3
Attention is not not ExplanationCode3
Evaluating Language Model Agency through NegotiationsCode3
DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge DetectionCode3
Pheme: Efficient and Conversational Speech GenerationCode3
Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual ModelsCode3
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web TasksCode3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-DesignCode3
SliceGPT: Compress Large Language Models by Deleting Rows and ColumnsCode3
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text SegmentationCode3
LongAlign: A Recipe for Long Context Alignment of Large Language ModelsCode3
Noise Contrastive Alignment of Language Models with Explicit RewardsCode3
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian SplattingCode3
Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series ForecastingCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
Magic-Me: Identity-Specific Video Customized DiffusionCode3
BitDelta: Your Fine-Tune May Only Be Worth One BitCode3
QuRating: Selecting High-Quality Data for Training Language ModelsCode3
LLMDFA: Analyzing Dataflow in Code with Large Language ModelsCode3
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-PositiveCode3
Codec-SUPERB: An In-Depth Analysis of Sound Codec ModelsCode3
Towards Building Multilingual Language Model for MedicineCode3
ChatMusician: Understanding and Generating Music Intrinsically with LLMCode3
Leveraging Enhanced Queries of Point Sets for Vectorized Map ConstructionCode3
Explicit Interaction for Fusion-Based Place RecognitionCode3
Diffusion Language Models Are Versatile Protein LearnersCode3
CAMixerSR: Only Details Need More "Attention"Code3
CLLMs: Consistency Large Language ModelsCode3
SynCode: LLM Generation with Grammar AugmentationCode3
Controllable Text Generation for Large Language Models: A SurveyCode3
RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly DetectionCode3
Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force FieldsCode3
Retrieval Augmented Generation and Understanding in Vision: A Survey and New OutlookCode3
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical ImagesCode3
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain FrameworkCode3
Rotary Position Embedding for Vision TransformerCode3
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and ModulationCode3
The Elements of Differentiable ProgrammingCode3
Advancing LLM Reasoning Generalists with Preference TreesCode3
Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMsCode3
OGBench: Benchmarking Offline Goal-Conditioned RLCode3
HPNet: Dynamic Trajectory Forecasting with Historical Prediction AttentionCode3
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on GraphsCode3
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous DrivingCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement LearningCode3
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion ModelsCode3
ModernTCN: A Modern Pure Convolution Structure for General Time Series AnalysisCode3
Efficient Multimodal Large Language Models: A SurveyCode3
CV-VAE: A Compatible Video VAE for Latent Generative Video ModelsCode3
Show:102550
← PrevPage 62 of 13200Next →