| Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding | Jun 3, 2024 | counterfactualDistractor Generation | —Unverified | 0 |
| The Geometry of Categorical and Hierarchical Concepts in Large Language Models | Jun 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning | Jun 3, 2024 | Graph LearningLanguage Modeling | —Unverified | 0 |
| VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model | Jun 3, 2024 | Image OutpaintingLanguage Modeling | CodeCode Available | 1 |
| Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study | Jun 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| HBTP: Heuristic Behavior Tree Planning with Large Language Model Reasoning | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards a copilot in BIM authoring tool using a large language model-based agent for intelligent human-machine interaction | Jun 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Inverse Constitutional AI: Compressing Preferences into Principles | Jun 2, 2024 | ChatbotLanguage Modelling | CodeCode Available | 1 |
| LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models | Jun 2, 2024 | Continual PretrainingInformation Retrieval | —Unverified | 0 |
| Large Language Model Confidence Estimation via Black-Box Access | Jun 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning | Jun 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Controlling Large Language Model Agents with Entropic Activation Steering | Jun 1, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots | Jun 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HonestLLM: Toward an Honest and Helpful Large Language Model | Jun 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation | Jun 1, 2024 | feature selectionLanguage Modeling | CodeCode Available | 1 |
| RAG Does Not Work for Enterprises | May 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Query2CAD: Generating CAD models using natural language queries | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking | May 31, 2024 | In-Context LearningInformation Retrieval | CodeCode Available | 0 |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FineRadScore: A Radiology Report Line-by-Line Evaluation Technique Generating Corrections with Severity Scores | May 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Large Language Model Biases in Persona-Steered Generation | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation | May 30, 2024 | DiversityDrug Design | CodeCode Available | 3 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Ontology-Enhanced Representation Learning for Large Language Models | May 30, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning | May 30, 2024 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 3 |
| Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions | May 30, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems | May 30, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model | May 30, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths | May 30, 2024 | GSM8KHumanEval | —Unverified | 0 |
| LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Efficient Indirect LLM Jailbreak via Multimodal-LLM Jailbreak | May 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback | May 30, 2024 | GPUKnowledge Graphs | —Unverified | 0 |
| Large Language Model Watermark Stealing With Mixed Integer Programming | May 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Full-duplex Speech Dialogue Scheme Based On Large Language Models | May 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive In-conversation Team Building for Language Model Agents | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 7 |
| Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study | May 29, 2024 | Answer GenerationHallucination | —Unverified | 0 |
| LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration | May 29, 2024 | DecoderImage Registration | —Unverified | 0 |
| To FP8 and Back Again: Quantifying Reduced Precision Effects on LLM Training Stability | May 29, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| X-VILA: Cross-Modality Alignment for Large Language Model | May 29, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Multi-Modal Generative Embedding Model | May 29, 2024 | Caption GenerationCross-Modal Retrieval | —Unverified | 0 |
| Gemini & Physical World: Large Language Models Can Estimate the Intensity of Earthquake Shaking from Multi-Modal Social Media Posts | May 29, 2024 | Disaster ResponseLanguage Modelling | —Unverified | 0 |
| Voice Jailbreak Attacks Against GPT-4o | May 29, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery | May 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | May 29, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital Twins | May 28, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference | May 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |