SOTAVerified

Attribute

Papers

Showing 851900 of 5387 papers

TitleStatusHype
Does Vector Quantization Fail in Spatio-Temporal Forecasting? Exploring a Differentiable Sparse Soft-Vector Quantization ApproachCode1
Boosting Multi-modal Model Performance with Adaptive Gradient ModulationCode1
Boosting Spike Camera Image Reconstruction from a Perspective of Dealing with Spike FluctuationsCode1
FedTabDiff: Federated Learning of Diffusion Probabilistic Models for Synthetic Mixed-Type Tabular Data GenerationCode1
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?Code1
FICE: Text-Conditioned Fashion Image Editing With Guided GAN InversionCode1
Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty?Code1
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language UnderstandingCode1
EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language ModelsCode1
Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio ReasoningCode1
Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven NavigationCode1
TailorGAN: Making User-Defined Fashion DesignsCode1
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing FrameworkCode1
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image DetectionCode1
FineRec:Exploring Fine-grained Sequential RecommendationCode1
Finetuning CLIP to Reason about Pairwise DifferencesCode1
FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font ApplicationsCode1
FLAC: Fairness-Aware Representation Learning by Suppressing Attribute-Class AssociationsCode1
Causality-Inspired Fair Representation Learning for Multimodal RecommendationCode1
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image TranslationCode1
FuseCap: Leveraging Large Language Models for Enriched Fused Image CaptionsCode1
DeepDC: Deep Distance Correlation as a Perceptual Image Quality EvaluatorCode1
Critical Influence of Overparameterization on Sharpness-aware MinimizationCode1
Deep Extrapolation for Attribute-Enhanced GenerationCode1
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample EfficiencyCode1
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language ModelCode1
FUDGE: Controlled Text Generation With Future DiscriminatorsCode1
GaitFormer: Learning Gait Representations with Noisy Multi-Task LearningCode1
HyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via HypernetworksCode1
Image Watermarks are Removable Using Controllable Regeneration from Clean NoiseCode1
Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models?Code1
Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian ParticleCode1
C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion TransferCode1
C2T-Net: Channel-Aware Cross-Fused Transformer-Style Networks for Pedestrian Attribute RecognitionCode1
Initiative Defense against Facial ManipulationCode1
Gender Bias in Masked Language Models for Multiple LanguagesCode1
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute EditingCode1
Dendrite Net: A White-Box Module for Classification, Regression, and System IdentificationCode1
Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement LearningCode1
Towards Machine Unlearning Benchmarks: Forgetting the Personal Identities in Facial Recognition SystemsCode1
CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot LearningCode1
Calendar Graph Neural Networks for Modeling Time Structures in Spatiotemporal User BehaviorsCode1
Generalized Domain Conditioned Adaptation NetworkCode1
Learning a Practical SDR-to-HDRTV Up-conversion using New Dataset and Degradation ModelsCode1
Mix and Match: Learning-free Controllable Text Generationusing Energy Language ModelsCode1
Rethinking Cross-Modal Interaction in Multimodal Diffusion TransformersCode1
Generative and Contrastive Self-Supervised Learning for Graph Anomaly DetectionCode1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation ModelsCode1
HoloFace: Augmenting Human-to-Human Interactions on HoloLensCode0
Show:102550
← PrevPage 18 of 108Next →

No leaderboard results yet.