SOTAVerified

cross-modal alignment

Papers

Showing 110 of 342 papers

TitleStatusHype
Transformer-based Spatial Grounding: A Comprehensive Survey0
Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection0
CATVis: Context-Aware Thought Visualization0
Evaluating Attribute Confusion in Fashion Text-to-Image Generation0
RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation ModelsCode1
Skywork-R1V3 Technical ReportCode7
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
Flash-VStream: Efficient Real-Time Understanding for Long Video StreamsCode3
DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning0
TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation0
Show:102550
← PrevPage 1 of 35Next →

No leaderboard results yet.