SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1165111700 of 661570 papers

TitleStatusHype
Equivariant Multi-Modality Image FusionCode2
Pengi: An Audio Language Model for Audio TasksCode2
Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language ModelsCode2
Efficient Mixed Transformer for Single Image Super-ResolutionCode2
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool EmbeddingsCode2
TSGM: A Flexible Framework for Generative Modeling of Synthetic Time SeriesCode2
PointGPT: Auto-regressively Generative Pre-training from Point CloudsCode2
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine TranslationCode2
DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical ImagesCode2
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the WildCode2
Causal Document-Grounded Dialogue Pre-trainingCode2
OpenShape: Scaling Up 3D Shape Representation Towards Open-World UnderstandingCode2
Structural Pruning for Diffusion ModelsCode2
Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload AwarenessCode2
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language ModelCode2
Segment Any Anomaly without Training via Hybrid Prompt RegularizationCode2
Going Denser with Open-Vocabulary Part SegmentationCode2
3D Registration with Maximal CliquesCode2
A Survey on Time-Series Pre-Trained ModelsCode2
Listen, Think, and UnderstandCode2
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized AttentionCode2
Evaluating Object Hallucination in Large Vision-Language ModelsCode2
Investigating image-based fallow weed detection performance on Raphanus sativus and Avena sativa at speeds up to 30 km h^-1Code2
TextSLAM: Visual SLAM with Semantic Planar Text FeaturesCode2
Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product NetworksCode2
Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenesCode2
Improving Language Model Negotiation with Self-Play and In-Context Learning from AI FeedbackCode2
MemoryBank: Enhancing Large Language Models with Long-Term MemoryCode2
DoReMi: Optimizing Data Mixtures Speeds Up Language Model PretrainingCode2
StructGPT: A General Framework for Large Language Model to Reason over Structured DataCode2
AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three WeeksCode2
ICDAR 2023 Competition on Hierarchical Text Detection and RecognitionCode2
CLRerNet: Improving Confidence of Lane Detection with LaneIoUCode2
Interpretability at Scale: Identifying Causal Mechanisms in AlpacaCode2
Identity-Preserving Talking Face Generation with Landmark and Appearance PriorsCode2
Denoising Diffusion Models for Plug-and-Play Image RestorationCode2
Large Language Models are Zero-Shot Rankers for Recommender SystemsCode2
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape EstimationCode2
Common Diffusion Noise Schedules and Sample Steps are FlawedCode2
Large Language Model Guided Tree-of-ThoughtCode2
Marsellus: A Heterogeneous RISC-V AI-IoT End-Node SoC with 2-to-8b DNN Acceleration and 30%-Boost Adaptive Body BiasingCode2
Diffusion Models for Imperceptible and Transferable Adversarial AttackCode2
ULIP-2: Towards Scalable Multimodal Pre-training for 3D UnderstandingCode2
OCRBench: On the Hidden Mystery of OCR in Large Multimodal ModelsCode2
Benchmarks and leaderboards for sound demixing tasksCode2
How to Index Item IDs for Recommendation Foundation ModelsCode2
WebCPM: Interactive Web Search for Chinese Long-form Question AnsweringCode2
An Inverse Scaling Law for CLIP TrainingCode2
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction TuningCode2
Show:102550
← PrevPage 234 of 13232Next →