Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best? Mar 27, 2025 Hallucination Hallucination Evaluation
— Unverified 0Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation Mar 27, 2025 Code Generation Language Modeling
— Unverified 0MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness Mar 27, 2025 Language Modeling Language Modelling
— Unverified 0Prompting Vision-Language Model for Nuclei Instance Segmentation and Classification Mar 27, 2025 Cell Segmentation Contrastive Learning
Code Code Available 0LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning Mar 27, 2025 Decision Making Language Modeling
— Unverified 0A Multi-Modal Knowledge-Enhanced Framework for Vessel Trajectory Prediction Mar 27, 2025 Language Modeling Language Modelling
— Unverified 0Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them Mar 27, 2025 Continual Pretraining Language Modeling
— Unverified 0Debate-Driven Multi-Agent LLMs for Phishing Email Detection Mar 27, 2025 Language Modeling Language Modelling
— Unverified 0FakeReasoning: Towards Generalizable Forgery Detection and Reasoning Mar 27, 2025 Attribute Binary Classification
— Unverified 0Boosting Large Language Models with Mask Fine-Tuning Mar 27, 2025 Language Modeling Language Modelling
Code Code Available 0Controlling Large Language Model with Latent Actions Mar 27, 2025 CoLA Language Modeling
Code Code Available 0VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation Mar 27, 2025 Autonomous Navigation Language Modeling
— Unverified 0Using large language models to produce literature reviews: Usages and systematic biases of microphysics parametrizations in 2699 publications Mar 27, 2025 Language Modeling Language Modelling
— Unverified 0VALLR: Visual ASR Language Model for Lip Reading Mar 27, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0The cell as a token: high-dimensional geometry in language models and cell embeddings Mar 26, 2025 Language Modeling Language Modelling
— Unverified 0LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions Mar 26, 2025 Anomaly Detection Language Modeling
— Unverified 0RALLRec+: Retrieval Augmented Large Language Model Recommendation with Reasoning Mar 26, 2025 Language Modeling Language Modelling
Code Code Available 0MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model Mar 26, 2025 Language Modeling Language Modelling
Code Code Available 0D4R -- Exploring and Querying Relational Graphs Using Natural Language and Large Language Models -- the Case of Historical Documents Mar 26, 2025 Language Modeling Language Modelling
— Unverified 0Exploring the Effect of Robotic Embodiment and Empathetic Tone of LLMs on Empathy Elicitation Mar 26, 2025 Chatbot Language Modeling
— Unverified 0A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications Mar 26, 2025 Language Modeling Language Modelling
Code Code Available 0Can Large Language Models Predict Associations Among Human Attitudes? Mar 26, 2025 Language Modeling Language Modelling
— Unverified 0ASGO: Adaptive Structured Gradient Optimization Mar 26, 2025 Language Modeling Language Modelling
— Unverified 0AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction Mar 26, 2025 Computed Tomography (CT) cross-modal alignment
— Unverified 0InfoBid: A Simulation Framework for Studying Information Disclosure in Auctions with Large Language Model-based Agents Mar 26, 2025 Language Modeling Language Modelling
— Unverified 0Dynamic Pyramid Network for Efficient Multimodal Large Language Model Mar 26, 2025 Language Modeling Language Modelling
Code Code Available 0CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing Mar 26, 2025 Language Modeling Language Modelling
— Unverified 0Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success Mar 25, 2025 Language Modeling Language Modelling
— Unverified 01.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training Mar 25, 2025 Language Modeling Language Modelling
— Unverified 0Improved Alignment of Modalities in Large Vision Language Models Mar 25, 2025 GPU Image Captioning
— Unverified 0Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model Mar 25, 2025 Diversity Language Modeling
— Unverified 0FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model Mar 25, 2025 Denoising Language Modeling
— Unverified 0A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition Mar 25, 2025 Contrastive Learning Intent Recognition
— Unverified 0CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model Mar 25, 2025 Decision Making Language Modeling
— Unverified 0Optimizing Language Models for Inference Time Objectives using Reinforcement Learning Mar 25, 2025 Code Generation Language Modeling
— Unverified 0OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching Mar 25, 2025 Language Modeling Language Modelling
— Unverified 0PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping Mar 25, 2025 Computational Phenotyping Language Modeling
— Unverified 0Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery Mar 25, 2025 Language Modeling Language Modelling
— Unverified 0Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning Mar 25, 2025 Language Modeling Language Modelling
— Unverified 0SemEval-2025 Task 9: The Food Hazard Detection Challenge Mar 25, 2025 Decoder Language Modeling
— Unverified 0Solving Situation Puzzles with Large Language Model and External Reformulation Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0MMCR: Advancing Visual Language Model in Multimodal Multi-Turn Contextual Reasoning Mar 24, 2025 Diagnostic Language Modeling
— Unverified 0Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors Mar 24, 2025 Chatbot Language Modeling
— Unverified 0TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling Mar 24, 2025 Continual Pretraining Language Modeling
— Unverified 0ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation Mar 24, 2025 Code Generation Language Modeling
— Unverified 0LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Unsupervised Acquisition of Discrete Grammatical Categories Mar 24, 2025 Language Acquisition Language Modeling
— Unverified 0CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0