| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 | 5 |
| On Diversified Preferences of Large Language Model Alignment | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Democratizing Reasoning Ability: Tailored Learning from Large Language Model | Oct 20, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output Generation | Oct 22, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 | 5 |
| A Study of Generative Large Language Model for Medical Research and Healthcare | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 | 5 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Measuring General Intelligence with Generated Games | May 12, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 | 5 |
| CONFLARE: CONFormal LArge language model REtrieval | Apr 4, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-Learning | Jul 11, 2023 | ChatbotInformation Retrieval | CodeCode Available | 1 | 5 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AstroAgents: A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry Data | Mar 29, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Jun 17, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 | 5 |
| Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers | Apr 25, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Extensive Self-Contrast Enables Feedback-Free Language Model Alignment | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks | Jun 20, 2024 | General KnowledgeHuman Dynamics | CodeCode Available | 1 | 5 |
| Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation | Mar 19, 2024 | Gloss-free Sign Language TranslationLanguage Modeling | CodeCode Available | 1 | 5 |
| Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning | Jun 10, 2025 | Large Language Modelreinforcement-learning | CodeCode Available | 1 | 5 |
| ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters | Jan 31, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model | Mar 31, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledge | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |