| Using large language models to produce literature reviews: Usages and systematic biases of microphysics parametrizations in 2699 publications | Mar 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| InfoBid: A Simulation Framework for Studying Information Disclosure in Auctions with Large Language Model-based Agents | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| D4R -- Exploring and Querying Relational Graphs Using Natural Language and Large Language Models -- the Case of Historical Documents | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synthesizing world models for bilevel planning | Mar 26, 2025 | Large Language ModelProgram Synthesis | —Unverified | 0 |
| Dynamic Pyramid Network for Efficient Multimodal Large Language Model | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins | Mar 26, 2025 | Large Language ModelReasoning Segmentation | —Unverified | 0 |
| Qwen2.5-Omni Technical Report | Mar 26, 2025 | Automatic Speech Recognition (ASR)GSM8K | CodeCode Available | 7 |
| CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation | Mar 26, 2025 | Large Language ModelScheduling | CodeCode Available | 1 |
| Cross-Modal Prototype Allocation: Unsupervised Slide Representation Learning via Patch-Text Contrast in Computational Pathology | Mar 26, 2025 | DescriptiveLarge Language Model | —Unverified | 0 |
| RALLRec+: Retrieval Augmented Large Language Model Recommendation with Reasoning | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Exploring the Effect of Robotic Embodiment and Empathetic Tone of LLMs on Empathy Elicitation | Mar 26, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping | Mar 25, 2025 | Computational PhenotypingLanguage Modeling | —Unverified | 0 |
| Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees | Mar 25, 2025 | Large Language Modelscientific discovery | —Unverified | 0 |
| LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Mar 25, 2025 | Cross-Modal RetrievalHallucination | CodeCode Available | 1 |
| FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs | Mar 25, 2025 | Efficient ExplorationInformation Retrieval | —Unverified | 0 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Mar 25, 2025 | Action GenerationAutonomous Driving | —Unverified | 0 |
| Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SemEval-2025 Task 9: The Food Hazard Detection Challenge | Mar 25, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| Membership Inference Attacks on Large-Scale Models: A Survey | Mar 25, 2025 | Large Language ModelSurvey | —Unverified | 0 |
| Cross-Tokenizer Distillation via Approximate Likelihood Matching | Mar 25, 2025 | Large Language Model | CodeCode Available | 2 |
| A Survey of Large Language Model Agents for Question Answering | Mar 24, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models | Mar 24, 2025 | Large Language ModelSarcasm Detection | —Unverified | 0 |
| Solving Situation Puzzles with Large Language Model and External Reformulation | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sun-Shine: A Large Language Model for Tibetan Culture | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training | Mar 24, 2025 | DiversityLarge Language Model | CodeCode Available | 1 |
| Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization | Mar 24, 2025 | GPULarge Language Model | —Unverified | 0 |
| ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation | Mar 24, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors | Mar 24, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| Simulating Filter Bubble on Short-video Recommender System with Large Language Model Agents | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Payload-Aware Intrusion Detection with CMAE and Large Language Models | Mar 23, 2025 | Intrusion DetectionLanguage Modeling | —Unverified | 0 |
| AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs | Mar 23, 2025 | Large Language Model | —Unverified | 0 |
| LakotaBERT: A Transformer-based Model for Low Resource Lakota Language | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unleashing the power of text for credit default prediction: Comparing human-written and generative AI-refined texts | Mar 23, 2025 | Large Language ModelSemantic Similarity | —Unverified | 0 |
| WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model | Mar 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report Generation | Mar 22, 2025 | AnatomyLarge Language Model | CodeCode Available | 1 |
| Large Language Model Compression via the Nested Activation-Aware Decomposition | Mar 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement | Mar 21, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |