| CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs Can Simulate Standardized Patients via Agent Coevolution | Dec 16, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator | Dec 16, 2024 | GSM8KLanguage Modeling | CodeCode Available | 4 |
| LMM-Regularized CLIP Embeddings for Image Classification | Dec 16, 2024 | Classificationimage-classification | —Unverified | 0 |
| Active Large Language Model-based Knowledge Distillation for Session-based Recommendation | Dec 15, 2024 | Active LearningKnowledge Distillation | —Unverified | 0 |
| Finding a Wolf in Sheep's Clothing: Combating Adversarial Text-To-Image Prompts with Text Summarization | Dec 15, 2024 | Adversarial TextBinary Classification | —Unverified | 0 |
| Embracing Large Language Models in Traffic Flow Forecasting | Dec 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval | Dec 15, 2024 | Image RetrievalInstruction Following | —Unverified | 0 |
| LAW: Legal Agentic Workflows for Custody and Fund Services Contracts | Dec 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer | Dec 15, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Superhuman performance of a large language model on the reasoning tasks of a physician | Dec 14, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives | Dec 14, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Inference Scaling for Bridging Retrieval and Augmented Generation | Dec 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Verify Summary Facts with Fine-Grained LLM Feedback | Dec 14, 2024 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| WEPO: Web Element Preference Optimization for LLM-based Web Navigation | Dec 14, 2024 | Autonomous Web NavigationLanguage Modeling | —Unverified | 0 |
| Optimizing Vision-Language Interactions Through Decoder-Only Models | Dec 14, 2024 | DecoderImage Captioning | —Unverified | 0 |
| EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Solving the Inverse Alignment Problem for Efficient RLHF | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Small Language Model as Data Prospector for Large Language Model | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PickLLM: Context-Aware RL-Assisted Large Language Model Routing | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |