| LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking | May 31, 2024 | In-Context LearningInformation Retrieval | CodeCode Available | 0 | 5 |
| DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLM-enhanced Self-training for Cross-domain Constituency Parsing | Nov 5, 2023 | Constituency ParsingLanguage Modeling | CodeCode Available | 0 | 5 |
| LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress? | May 7, 2025 | Large Language ModelMixture-of-Experts | CodeCode Available | 0 | 5 |
| DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models | Dec 17, 2024 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLM-GEm: Large Language Model-Guided Prediction of People’s Empathy Levels towards Newspaper Article | Mar 19, 2024 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Benchmarking Large Language Model Uncertainty for Prompt Optimization | Sep 16, 2024 | BenchmarkingDiversity | CodeCode Available | 0 | 5 |
| Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence | Feb 15, 2024 | Emotional IntelligenceLanguage Modeling | CodeCode Available | 0 | 5 |
| Detecting the Clinical Features of Difficult-to-Treat Depression using Synthetic Data from Large Language Models | Feb 12, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models | Jun 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem | Dec 6, 2023 | AI AgentLanguage Modelling | CodeCode Available | 0 | 5 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 | 5 |
| NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors | Jun 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model | May 3, 2024 | Image CaptioningInstruction Following | CodeCode Available | 0 | 5 |
| LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery | Feb 26, 2024 | Continual LearningExemplar-Free | CodeCode Available | 0 | 5 |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Jun 18, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| Detecting AI-Generated Texts in Cross-Domains | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| AIOS: LLM Agent Operating System | Mar 25, 2024 | AI AgentLanguage Modelling | CodeCode Available | 0 | 5 |
| LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy Logic | Jun 12, 2025 | Large Language ModelPrompt Engineering | CodeCode Available | 0 | 5 |
| LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback | Jun 5, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence? | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Aug 9, 2024 | DiversityInstruction Following | CodeCode Available | 0 | 5 |
| PoliTune: Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in Large Language Models | Apr 10, 2024 | Decision MakingLarge Language Model | CodeCode Available | 0 | 5 |
| Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model | Jul 31, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 | 5 |