SOTAVerified

Descriptive

Papers

Showing 125 of 1477 papers

TitleStatusHype
DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization0
Assay2Mol: large language model-based drug design using BioAssay contextCode0
Describe Anything Model for Visual Question Answering on Text-rich ImagesCode1
FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation0
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual DescriptorCode0
Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization0
Dataset Distillation via Vision-Language Category PrototypeCode1
Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization0
Experiential marketing strategy and tourism demand in the contribution of the positioning of the floating islands Los Uros, Puno0
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation0
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
SonicVerse: Multi-Task Learning for Music Feature-Informed CaptioningCode2
Uncovering Intention through LLM-Driven Code Snippet Description Generation0
A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation0
Evolvable Conditional Diffusion0
Rethinking Optimization: A Systems-Based Approach to Social Externalities0
Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables0
Alice and the Caterpillar: A more descriptive null model for assessing data mining resultsCode0
CoLMbo: Speaker Language Model for Descriptive ProfilingCode0
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single ModelCode2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video ModelsCode2
ArchiLense: A Framework for Quantitative Analysis of Architectural Styles Based on Vision Large Language Models0
ARGUS: Hallucination and Omission Evaluation in Video-LLMs0
The Influence of Tourist Experience on Revisit Decisions with the Mediation of Tourist Satisfaction0
Show:102550
← PrevPage 1 of 60Next →

No leaderboard results yet.