| Differentially Private Steering for Large Language Model Alignment | Jan 30, 2025 | HallucinationInference Attack | CodeCode Available | 0 |
| Diet-ODIN: A Novel Framework for Opioid Misuse Detection with Interpretable Dietary Patterns | Feb 21, 2024 | Graph LearningLanguage Modelling | CodeCode Available | 0 |
| A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation | Jul 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration | Jun 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| How to Protect Copyright Data in Optimization of Large Language Models? | Aug 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reliable Academic Conference Question Answering: A Study Based on Large Language Model | Oct 19, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System | Jun 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChatVis: Automating Scientific Visualization with a Large Language Model | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) Dataset | Jan 28, 2023 | Cultural Vocal Bursts Intensity PredictionImage Generation | CodeCode Available | 0 |
| How to Leverage Personal Textual Knowledge for Personalized Conversational Information Retrieval | Jul 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| tcrLM: a lightweight protein language model for predicting T cell receptor and epitope binding specificity | Jun 24, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Oct 14, 2024 | Density Ratio EstimationGSM8K | CodeCode Available | 0 |
| How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities | Mar 20, 2025 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| A large language model-assisted education tool to provide feedback on open-ended responses | Jul 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines | Nov 16, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language Models | Oct 11, 2023 | In-Context LearningInstruction Following | CodeCode Available | 0 |
| How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench | May 24, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 |
| ChatGraph: Interpretable Text Classification by Converting ChatGPT Knowledge to Graphs | May 3, 2023 | ClassificationDecision Making | CodeCode Available | 0 |
| Personalized Abstractive Summarization by Tri-agent Generation Pipeline | May 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| ChatGPT-guided Semantics for Zero-shot Learning | Oct 18, 2023 | AttributeLanguage Modelling | CodeCode Available | 0 |
| Developing Safe and Responsible Large Language Model : Can We Balance Bias Reduction and Language Understanding in Large Language Models? | Apr 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reproducing NevIR: Negation in Neural Information Retrieval | Feb 19, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Detecting the Clinical Features of Difficult-to-Treat Depression using Synthetic Data from Large Language Models | Feb 12, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Harnessing Large Language Models for Comprehension of Conversational Grounding | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards | Feb 1, 2024 | Answer SelectionLanguage Modeling | CodeCode Available | 0 |
| Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models | Jun 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Jun 18, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Detecting AI-Generated Texts in Cross-Domains | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformers | Mar 27, 2024 | Generative Question AnsweringInformation Retrieval | CodeCode Available | 0 |
| How Far Are LLMs from Believable AI? A Benchmark for Evaluating the Believability of Human Behavior Simulation | Dec 28, 2023 | AI AgentLanguage Modelling | CodeCode Available | 0 |
| Chaining thoughts and LLMs to learn DNA structural biophysics | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CellTypeAgent: Trustworthy cell type annotation with Large Language Models | May 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Resolving References in Visually-Grounded Dialogue via Text Generation | Sep 23, 2023 | Image RetrievalLanguage Modeling | CodeCode Available | 0 |
| DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence? | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales | Mar 19, 2024 | Hate Speech DetectionLanguage Modeling | CodeCode Available | 0 |
| How Benchmark Prediction from Fewer Data Misses the Mark | Jun 9, 2025 | Large Language ModelPrediction | CodeCode Available | 0 |
| Summarisation of German Judgments in conjunction with a Class-based Evaluation | May 9, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 |
| SumRec: A Framework for Recommendation using Open-Domain Dialogue | Feb 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HORAE: A Domain-Agnostic Language for Automated Service Regulation | Jun 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Design Principle Transfer in Neural Architecture Search via Large Language Models | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration | Nov 5, 2024 | Collaborative InferenceLarge Language Model | CodeCode Available | 0 |
| HLAT: High-quality Large Language Model Pre-trained on AWS Trainium | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models | Aug 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective | Nov 23, 2023 | Large Language ModelMulti-Armed Bandits | CodeCode Available | 0 |