SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 53515400 of 177340 papers

TitleStatusHype
Machine Learning in Asset Management—Part 2: Portfolio Construction—Weight Optimization. The Journal of Financial Data ScienceCode2
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR SummarizationCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized RationalesCode2
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive DiversityCode2
BANet: Bilateral Aggregation Network for Mobile Stereo MatchingCode2
Axes that matter: PCA with a differenceCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image SegmentationCode2
Neural Plasticity-Inspired Multimodal Foundation Model for Earth ObservationCode2
PromptDet: Towards Open-vocabulary Detection using Uncurated ImagesCode2
Geometric Transformer for Fast and Robust Point Cloud RegistrationCode2
Weak-to-Strong Extrapolation Expedites AlignmentCode2
A Survey of Pretraining on Graphs: Taxonomy, Methods, and ApplicationsCode2
Practical Compact Deep Compressed SensingCode2
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision MakersCode2
BHViT: Binarized Hybrid Vision TransformerCode2
Observational Scaling Laws and the Predictability of Language Model PerformanceCode2
Improving Synthetic Image Detection Towards Generalization: An Image Transformation PerspectiveCode2
Melody transcription via generative pre-trainingCode2
ExT5: Towards Extreme Multi-Task Scaling for Transfer LearningCode2
Binding Language Models in Symbolic LanguagesCode2
FreGrad: Lightweight and Fast Frequency-aware Diffusion VocoderCode2
Isotropic3D: Image-to-3D Generation Based on a Single CLIP EmbeddingCode2
ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity LinkingCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
Multi-instrument Music Synthesis with Spectrogram DiffusionCode2
Interpreting Object-level Foundation Models via Visual Precision SearchCode2
EDGE: Editable Dance Generation From MusicCode2
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent SpaceCode2
A Systematic Review on the Evaluation of Large Language Models in Theory of Mind TasksCode2
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language ModelsCode2
UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation DatasetsCode2
Mini-DALLE3: Interactive Text to Image by Prompting Large Language ModelsCode2
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel BaselineCode2
Scalable Diffusion Models with State Space BackboneCode2
Temporally Consistent Transformers for Video GenerationCode2
Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token DictionaryCode2
Meta Prompting for AI SystemsCode2
VideoShield: Regulating Diffusion-based Video Generation Models via WatermarkingCode2
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution AdaptationCode2
FreeVC: Towards High-Quality Text-Free One-Shot Voice ConversionCode2
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature MimickingCode2
MS-DETR: Efficient DETR Training with Mixed SupervisionCode2
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical ReasoningCode2
Bokehlicious: Photorealistic Bokeh Rendering with Controllable AperturesCode2
Accelerating Transformers with Spectrum-Preserving Token MergingCode2
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to SeeCode2
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation LearningCode2
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image GenerationCode2
Show:102550
← PrevPage 108 of 3547Next →