| DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation | Dec 17, 2024 | Contrastive LearningImage Segmentation | CodeCode Available | 1 |
| Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models | Dec 17, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 0 |
| SnakModel: Lessons Learned from Training an Open Danish Large Language Model | Dec 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach | Dec 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models | Dec 17, 2024 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 |
| Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LMUnit: Fine-grained Evaluation with Natural Language Unit Tests | Dec 17, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training | Dec 17, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Large Language Models as Realistic Microservice Trace Generators | Dec 16, 2024 | Graph GenerationLanguage Modeling | CodeCode Available | 1 |
| Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey | Dec 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Endangered Alert: A Field-Validated Self-Training Scheme for Detecting and Protecting Threatened Wildlife on Roads and Roadsides | Dec 16, 2024 | Edge-computingLanguage Modeling | CodeCode Available | 0 |
| Krony-PT: GPT2 compressed with Kronecker Products | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Private Yet Social: How LLM Chatbots Support and Challenge Eating Disorder Recovery | Dec 16, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| The Impact of Token Granularity on the Predictive Power of Language Model Surprisal | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data | Dec 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Personalized LLM for Generating Customized Responses to the Same Query from Different Users | Dec 16, 2024 | Contrastive LearningDiversity | CodeCode Available | 0 |
| OmniVLM: A Token-Compressed, Sub-Billion-Parameter Vision-Language Model for Efficient On-Device Inference | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents | Dec 16, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Embodied CoT Distillation From LLM To Off-the-shelf Agents | Dec 16, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 3 |
| Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Whisper-GPT: A Hybrid Representation Audio Large Language Model | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMs Can Simulate Standardized Patients via Agent Coevolution | Dec 16, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator | Dec 16, 2024 | GSM8KLanguage Modeling | CodeCode Available | 4 |
| LMM-Regularized CLIP Embeddings for Image Classification | Dec 16, 2024 | Classificationimage-classification | —Unverified | 0 |
| Active Large Language Model-based Knowledge Distillation for Session-based Recommendation | Dec 15, 2024 | Active LearningKnowledge Distillation | —Unverified | 0 |
| Finding a Wolf in Sheep's Clothing: Combating Adversarial Text-To-Image Prompts with Text Summarization | Dec 15, 2024 | Adversarial TextBinary Classification | —Unverified | 0 |
| Embracing Large Language Models in Traffic Flow Forecasting | Dec 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval | Dec 15, 2024 | Image RetrievalInstruction Following | —Unverified | 0 |
| LAW: Legal Agentic Workflows for Custody and Fund Services Contracts | Dec 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer | Dec 15, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Superhuman performance of a large language model on the reasoning tasks of a physician | Dec 14, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives | Dec 14, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Inference Scaling for Bridging Retrieval and Augmented Generation | Dec 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Verify Summary Facts with Fine-Grained LLM Feedback | Dec 14, 2024 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| WEPO: Web Element Preference Optimization for LLM-based Web Navigation | Dec 14, 2024 | Autonomous Web NavigationLanguage Modeling | —Unverified | 0 |
| Optimizing Vision-Language Interactions Through Decoder-Only Models | Dec 14, 2024 | DecoderImage Captioning | —Unverified | 0 |
| EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Solving the Inverse Alignment Problem for Efficient RLHF | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Small Language Model as Data Prospector for Large Language Model | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PickLLM: Context-Aware RL-Assisted Large Language Model Routing | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |