| NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus | Sep 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models | Nov 11, 2024 | AttributeDialogue Generation | CodeCode Available | 0 | 5 |
| AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated Text | Jun 26, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Chaining thoughts and LLMs to learn DNA structural biophysics | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MuseChat: A Conversational Music Recommendation System for Videos | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation | Nov 15, 2023 | Constituency ParsingKnowledge Distillation | CodeCode Available | 0 | 5 |
| Narrative Shift Detection: A Hybrid Approach of Dynamic Topic Models and Large Language Models | Jun 25, 2025 | ArticlesChange Point Detection | CodeCode Available | 0 | 5 |
| Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval | Sep 30, 2024 | Cross-Modal RetrievalLarge Language Model | CodeCode Available | 0 | 5 |
| Are Generative AI Agents Effective Personalized Financial Advisors? | Apr 8, 2025 | Large Language Model | CodeCode Available | 0 | 5 |
| CellTypeAgent: Trustworthy cell type annotation with Large Language Models | May 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration | Nov 5, 2024 | Collaborative InferenceLarge Language Model | CodeCode Available | 0 | 5 |
| Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering | Dec 19, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Multi-Objective Large Language Model Unlearning | Dec 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM Pipelines | Jun 20, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 | 5 |
| CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations | Jul 8, 2025 | Generative Adversarial NetworkLarge Language Model | CodeCode Available | 0 | 5 |
| Causal Walk: Debiasing Multi-Hop Fact Verification with Front-Door Adjustment | Mar 5, 2024 | Causal Inferencecounterfactual | CodeCode Available | 0 | 5 |
| Multi-Lingual Cyber Threat Detection in Tweets/X Using ML, DL, and LLM: A Comparative Analysis | Feb 4, 2025 | Large Language Model | CodeCode Available | 0 | 5 |
| Multi-Programming Language Ensemble for Code Generation in Large Language Model | Sep 6, 2024 | Code GenerationHumanEval | CodeCode Available | 0 | 5 |
| MT4CrossOIE: Multi-stage Tuning for Cross-lingual Open Information Extraction | Aug 12, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 0 | 5 |
| mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale | Jun 26, 2025 | Anomaly DetectionBenchmarking | CodeCode Available | 0 | 5 |
| Multi-Armed Bandit Approach for Optimizing Training on Synthetic Data | Dec 6, 2024 | AttributeLarge Language Model | CodeCode Available | 0 | 5 |
| Multi-aspect Knowledge Distillation with Large Language Model | Jan 23, 2025 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| CASTILLO: Characterizing Response Length Distributions of Large Language Models | May 22, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 | 5 |
| Indian-BhED: A Dataset for Measuring India-Centric Biases in Large Language Models | Sep 15, 2023 | FairnessLanguage Modelling | CodeCode Available | 0 | 5 |
| MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration | Oct 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Cascading Large Language Models for Salient Event Graph Generation | Jun 26, 2024 | Graph GenerationLanguage Modeling | CodeCode Available | 0 | 5 |
| MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Apr 9, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 | 5 |
| Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca | Sep 16, 2023 | Instruction FollowingLarge Language Model | CodeCode Available | 0 | 5 |
| Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs | Jan 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification | Jun 28, 2024 | Fact CheckingFact Verification | CodeCode Available | 0 | 5 |
| A Quick, trustworthy spectral knowledge Q&A system leveraging retrieval-augmented generation on LLM | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation | Nov 7, 2024 | Large Language Model | CodeCode Available | 0 | 5 |
| MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation | Dec 12, 2024 | DescriptiveFood recommendation | CodeCode Available | 0 | 5 |
| APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Nov 26, 2024 | Few-Shot LearningLarge Language Model | CodeCode Available | 0 | 5 |
| Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Jul 15, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| Multi-FAct: Assessing Factuality of Multilingual LLMs using FActScore | Feb 28, 2024 | DiversityForm | CodeCode Available | 0 | 5 |
| Mitigating the Bias of Large Language Model Evaluation | Sep 25, 2024 | Instruction FollowingLanguage Model Evaluation | CodeCode Available | 0 | 5 |
| Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks | Apr 17, 2025 | Epistemic ReasoningLarge Language Model | CodeCode Available | 0 | 5 |
| Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement | Sep 13, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 | 5 |
| MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding | Sep 10, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 0 | 5 |
| Agentic Society: Merging skeleton from real world and texture from Large Language Model | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Can LLM-Augmented autonomous agents cooperate?, An evaluation of their cooperative capabilities through Melting Pot | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research | Feb 7, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 | 5 |
| Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator | Apr 19, 2025 | Large Language Model | CodeCode Available | 0 | 5 |
| MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Dec 27, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 | 5 |
| Can Language Models Evaluate Human Written Text? Case Study on Korean Student Writing for Education | Jul 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |