| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 |
| An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | May 16, 2025 | FormLanguage Modeling | CodeCode Available | 0 |
| Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| WaterDrum: Watermarking for Data-centric Unlearning Metric | May 8, 2025 | Large Language Model | CodeCode Available | 0 |
| TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability | Jun 4, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Conversations in Galician: a Large Language Model for an Underrepresented Language | Nov 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Vamos: Versatile Action Models for Video Understanding | Nov 22, 2023 | EgoSchemaHard Attention | CodeCode Available | 0 |
| Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models | Apr 2, 2024 | Distractor GenerationIn-Context Learning | CodeCode Available | 0 |
| PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLM | Sep 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging Content and Acoustic Representations for Speech Emotion Recognition | Sep 9, 2024 | Emotion RecognitionLanguage Modelling | CodeCode Available | 0 |
| Can a Large Language Model Learn Matrix Functions In Context? | Nov 24, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Jun 29, 2024 | DiversityImage Generation | CodeCode Available | 0 |
| Can a large language model be a gaslighter? | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Can AI Relate: Testing Large Language Model Response for Mental Health Support | May 20, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT | Jan 15, 2024 | Binary ClassificationClassification | CodeCode Available | 0 |
| TULUN: Transparent and Adaptable Low-resource Machine Translation | May 24, 2025 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Exploiting the Vulnerability of Large Language Models via Defense-Aware Architectural Backdoor | Sep 3, 2024 | Backdoor AttackLarge Language Model | CodeCode Available | 0 |
| Personalized LLM for Generating Customized Responses to the Same Query from Different Users | Dec 16, 2024 | Contrastive LearningDiversity | CodeCode Available | 0 |
| Automated Privacy Information Annotation in Large Language Model Interactions | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 |
| Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct Features | May 3, 2024 | DiagnosticLanguage Modelling | CodeCode Available | 0 |
| Scaling Reasoning can Improve Factuality in Large Language Models | May 16, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 0 |
| Computational Reasoning of Large Language Models | Apr 29, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Length Optimization in Conformal Prediction | Jun 27, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 0 |
| Can (A)I Change Your Mind? | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model | Jun 6, 2024 | Chinese Word SegmentationLanguage Modeling | CodeCode Available | 0 |
| Conversational AI Powered by Large Language Models Amplifies False Memories in Witness Interviews | Aug 8, 2024 | ChatbotLanguage Modelling | CodeCode Available | 0 |
| LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Controlling Large Language Model with Latent Actions | Mar 27, 2025 | CoLALanguage Modeling | CodeCode Available | 0 |
| Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists | Jan 21, 2025 | Image GenerationLarge Language Model | CodeCode Available | 0 |
| Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis | Apr 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels | Jun 25, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| The impact of responding to patient messages with large language model assistance | Oct 26, 2023 | ChatbotDecision Making | CodeCode Available | 0 |
| Physics Event Classification Using Large Language Models | Apr 5, 2024 | ChatbotClassification | CodeCode Available | 0 |
| Variance Control via Weight Rescaling in LLM Pre-training | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LEAVS: An LLM-based Labeler for Abdominal CT Supervision | Mar 17, 2025 | AnatomyLarge Language Model | CodeCode Available | 0 |
| Learning to Verify Summary Facts with Fine-Grained LLM Feedback | Dec 14, 2024 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students | May 2, 2025 | GSM8KIn-Context Learning | CodeCode Available | 0 |
| Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters | May 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset | Oct 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Controlled LLM Decoding via Discrete Auto-regressive Biasing | Feb 6, 2025 | Large Language ModelText Generation | CodeCode Available | 0 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 |
| TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property Prediction | Jan 9, 2024 | Drug DiscoveryLanguage Modeling | CodeCode Available | 0 |
| Expanding the Vocabulary of BERT for Knowledge Base Construction | Oct 12, 2023 | Knowledge Base ConstructionKnowledge Base Population | CodeCode Available | 0 |
| A multimodal LLM for the non-invasive decoding of spoken text from brain recordings | Sep 29, 2024 | Large Language Model | CodeCode Available | 0 |
| Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion | Jun 5, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |