Descriptive

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 1477 papers

Title	Date	Tasks	Status	Hype
Visually Descriptive Language Model for Vector Graphics Reasoning	Apr 9, 2024	DescriptiveLanguage Modeling	CodeCode Available	9
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy	Mar 21, 2024	Contrastive LearningDescriptive	CodeCode Available	7
AudioGen: Textually Guided Audio Generation	Sep 30, 2022	Audio GenerationDescriptive	CodeCode Available	6
Fundamental Components of Deep Learning: A category-theoretic approach	Mar 13, 2024	Deep LearningDescriptive	CodeCode Available	5
Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation	Jun 2, 2025	4kDescriptive	CodeCode Available	3
Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey	Dec 3, 2024	Change DetectionDescriptive	CodeCode Available	3
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation	Sep 20, 2024	DescriptiveQuestion Answering	CodeCode Available	3
Descriptive Image Quality Assessment in the Wild	May 29, 2024	DescriptiveImage Quality Assessment	CodeCode Available	3
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation	Apr 15, 2024	Contrastive LearningDescriptive	CodeCode Available	3
A Survey on Self-Supervised Learning for Non-Sequential Tabular Data	Feb 2, 2024	Contrastive LearningDescriptive	CodeCode Available	3
Fine-Tuning Language Models from Human Preferences	Sep 18, 2019	DescriptiveLanguage Modelling	CodeCode Available	3
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning	Jun 18, 2025	Caption GenerationDescriptive	CodeCode Available	2
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model	Jun 11, 2025	cross-modal alignmentDescriptive	CodeCode Available	2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models	Jun 11, 2025	counterfactualDescriptive	CodeCode Available	2
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning	May 29, 2025	Anomaly DetectionDescriptive	CodeCode Available	2
RuleKit 2: Faster and simpler rule learning	Apr 29, 2025	Descriptive	CodeCode Available	2
Q-Insight: Understanding Image Quality via Visual Reinforcement Learning	Mar 28, 2025	DescriptiveImage Quality Assessment	CodeCode Available	2
Teaching LMMs for Image Quality Scoring and Interpreting	Mar 12, 2025	DescriptiveImage Quality Assessment	CodeCode Available	2
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification	Feb 12, 2025	DecoderDescriptive	CodeCode Available	2
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression	Jan 1, 2025	Descriptive	CodeCode Available	2
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression	Dec 5, 2024	DescriptiveVisual Question Answering	CodeCode Available	2
SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity Recognition	Oct 14, 2024	Activity RecognitionDescriptive	CodeCode Available	2
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion	Sep 26, 2024	DescriptiveGeneralized Referring Expression Comprehension	CodeCode Available	2
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description	Aug 24, 2024	DescriptiveSpeech Synthesis	CodeCode Available	2
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision	Jul 8, 2024	Action Quality AssessmentDescriptive	CodeCode Available	2

Show:10 25 50

← PrevPage 1 of 60Next →

No leaderboard results yet.