SOTAVerified

Large Language Model

Papers

Showing 18011810 of 6097 papers

TitleStatusHype
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model0
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene0
SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled SynthesisCode1
Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding0
HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models0
ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models0
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving0
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question AnsweringCode1
Video-Bench: Human-Aligned Video Generation Benchmark0
Show:102550
← PrevPage 181 of 610Next →

No leaderboard results yet.