SOTAVerified

Descriptive

Papers

Showing 3140 of 1477 papers

TitleStatusHype
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
An Item is Worth a Prompt: Versatile Image Editing with Disentangled ControlCode2
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
GRiT: A Generative Region-to-text Transformer for Object UnderstandingCode2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image ClassificationCode2
Fine-grained Image Captioning with CLIP RewardCode2
Customization Assistant for Text-to-image GenerationCode2
AmadeusGPT: a natural language interface for interactive animal behavioral analysisCode2
Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language ModelsCode2
K-LITE: Learning Transferable Visual Models with External KnowledgeCode2
Show:102550
← PrevPage 4 of 148Next →

No leaderboard results yet.