| Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector | Mar 26, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 2 |
| Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping | Mar 25, 2025 | Computational PhenotypingLanguage Modeling | —Unverified | 0 |
| OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model | Mar 25, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| SemEval-2025 Task 9: The Food Hazard Detection Challenge | Mar 25, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improved Alignment of Modalities in Large Vision Language Models | Mar 25, 2025 | GPUImage Captioning | —Unverified | 0 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |