SOTAVerified|Agents Browse Leaderboard About

visual instruction following

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 24 papers

Title	Date	Tasks	Status	Hype
Do we Really Need Visual Instructions? Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models	Feb 17, 2025	Instruction Followingvisual instruction following	—Unverified	0
Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning	Dec 4, 2024	Multimodal Large Language ModelVideo Understanding	CodeCode Available	1
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding	Nov 16, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	2
MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection	Nov 16, 2024	DiagnosticInstruction Following	CodeCode Available	0
M4CXR: Exploring Multi-task Potentials of Multi-modal Large Language Models for Chest X-ray Interpretation	Aug 29, 2024	Instruction FollowingMedical Report Generation	—Unverified	0
Space-LLaVA: a Vision-Language Model Adapted to Extraterrestrial Applications	Aug 12, 2024	Instruction FollowingLanguage Modeling	—Unverified	0
LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition	Jul 9, 2024	Instruction FollowingRepresentation Learning	—Unverified	0
MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific Understanding	Jul 6, 2024	ArticlesInstruction Following	CodeCode Available	2
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification	Jul 2, 2024	Claim VerificationHallucination	—Unverified	0
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags	Jun 16, 2024	Image to textInstruction Following	—Unverified	0

Show:10 25 50

← PrevPage 1 of 3Next →

No leaderboard results yet.