SOTAVerified

Descriptive

Papers

Showing 101150 of 1477 papers

TitleStatusHype
JAMMIN-GPT: Text-based Improvisation using LLMs in Ableton LiveCode1
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel SizesCode1
FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis AssistantCode1
Language-Assisted 3D Feature Learning for Semantic Scene UnderstandingCode1
A Bi-directional Transformer for Musical Chord RecognitionCode1
Learning Concise and Descriptive Attributes for Visual RecognitionCode1
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
Logical Consistency and Greater Descriptive Power for Facial Hair Attribute LearningCode1
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text SupervisionCode1
FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language ModelsCode1
Field Convolutions for Surface CNNsCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
Modeling the Complexity and Descriptive Adequacy of Construction GrammarsCode1
MORE: Multi-Order RElation Mining for Dense Captioning in 3D ScenesCode1
Enhancing Monocular 3D Scene Completion with Diffusion ModelCode1
Multi-Grained Multimodal Interaction Network for Entity LinkingCode1
Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific ArticlesCode1
Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMsCode1
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic SegmentationCode1
ANNdotNET -- deep learning tool on .NET PlatformCode1
Text-Guided Neural Image InpaintingCode1
Neural-Symbolic Descriptive Action Model from Images: The Search for STRIPSCode1
NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup AnnotationsCode1
On the descriptive power of LiDAR intensity images for segment-based loop closing in 3-D SLAMCode1
FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font ApplicationsCode1
PDNS-Net: A Large Heterogeneous Graph Benchmark Dataset of Network Resolutions for Graph LearningCode1
HDCC: A Hyperdimensional Computing compiler for classification on embedded systems and high-performance computingCode1
Driving Style Recognition Using Interval Type-2 Fuzzy Inference System and Multiple Experts Decision MakingCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
Dual-Level Collaborative Transformer for Image CaptioningCode1
A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM)Code1
DEER: Descriptive Knowledge Graph for Explaining Entity RelationshipsCode1
DOBF: A Deobfuscation Pre-Training Objective for Programming LanguagesCode1
EgoTaskQA: Understanding Human Tasks in Egocentric VideosCode1
DFR: Deep Feature Reconstruction for Unsupervised Anomaly SegmentationCode1
Aligning LLM Agents by Learning Latent Preference from User EditsCode1
Descriptive and Predictive Analysis of Euroleague Basketball Games and the Wisdom of Basketball CrowdsCode1
Can Machines Learn Morality? The Delphi ExperimentCode1
Deep Graph Matching under Quadratic ConstraintCode1
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and AccountabilityCode1
Deep Implicit Statistical Shape Models for 3D Medical Image DelineationCode1
Distilling BlackBox to Interpretable models for Efficient Transfer LearningCode1
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language UnderstandingCode1
Controlling Latent Diffusion Using Latent CLIPCode1
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading BooksCode1
Deep learning based geometric registration for medical images: How accurate can we get without visual features?Code1
A Recipe for Creating Multimodal Aligned Datasets for Sequential TasksCode1
Describe Anything Model for Visual Question Answering on Text-rich ImagesCode1
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractionsCode1
Contrastive Learning and Mixture of Experts Enables Precise Vector EmbeddingsCode1
Show:102550
← PrevPage 3 of 30Next →

No leaderboard results yet.