SOTAVerified

Attribute

Papers

Showing 901950 of 5387 papers

TitleStatusHype
Diffusion Guided Language ModelingCode1
Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?Code0
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling0
Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image SynthesisCode1
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksCode2
Training LLMs to Recognize Hedges in Spontaneous NarrativesCode0
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning0
KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous drivingCode0
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social ScienceCode0
MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph0
SAT3D: Image-driven Semantic Attribute Transfer in 3D0
Regularized Contrastive Partial Multi-view Outlier Detection0
DERA: Dense Entity Retrieval for Entity Alignment in Knowledge Graphs0
Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed InputsCode1
Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio GenerationCode1
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and ExplanationCode0
PrivateGaze: Preserving User Privacy in Black-box Mobile Gaze Tracking ServicesCode0
Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control0
"Patriarchy Hurts Men Too." Does Your Model Agree? A Discussion on Fairness Assumptions0
LADDER: Language Driven Slice Discovery and Error RectificationCode1
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme DetectionCode1
HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose EstimationCode0
VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative AdversaryCode0
Multi-Modal CLIP-Informed Protein Editing0
Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical RoutingCode1
Diffusion-driven lensless fiber endomicroscopic quantitative phase imaging towards digital pathology0
Unveiling Privacy Vulnerabilities: Investigating the Role of Structure in Graph Data0
A Reference-Based 3D Semantic-Aware Framework for Accurate Local Facial Attribute Editing0
Learning mental states estimation through self-observation: a developmental synergy between intentions and beliefs representations in a deep-learning model of Theory of Mind0
Lifelong Graph Learning for Graph SummarizationCode0
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control0
Hidden or Inferred: Fair Learning-To-Rank with Unknown DemographicsCode0
Quantifying the Role of Textual Predictability in Automatic Speech Recognition0
MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMsCode1
Unveiling and Mitigating Bias in Audio Visual Segmentation0
VisMin: Visual Minimal-Change Understanding0
AI-Enhanced 7-Point Checklist for Melanoma Detection Using Clinical Knowledge Graphs and Data-Driven QuantificationCode0
Text2Place: Affordance-aware Text Guided Human Placement0
Regression under demographic parity constraints via unlabeled post-processing0
TimeInf: Time Series Data Contribution via Influence FunctionsCode1
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement0
Out of spuriousity: Improving robustness to spurious correlations without group annotations0
An Explainable Fast Deep Neural Network for Emotion Recognition0
Img2CAD: Reverse Engineering 3D CAD Models from Images through VLM-Assisted Conditional Factorization0
Are handcrafted filters helpful for attributing AI-generated images?0
PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding0
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video GenerationCode2
A Benchmark for Gaussian Splatting Compression and Quality Assessment StudyCode1
Learning Visual Grounding from Generative Vision and Language Model0
Show:102550
← PrevPage 19 of 108Next →

No leaderboard results yet.