SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 2130 of 671 papers

TitleStatusHype
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model EvaluationCode2
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object TrajectoryCode2
Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image AnalysisCode2
Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality InversionCode2
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific LiteratureCode2
Where am I? Cross-View Geo-localization with Natural Language DescriptionsCode2
Gramian Multimodal Representation Learning and AlignmentCode2
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language ModelsCode2
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial ApplicationsCode2
Towards Vision-Language Geo-Foundation Model: A SurveyCode2
Show:102550
← PrevPage 3 of 68Next →

No leaderboard results yet.