| MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models | Feb 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Style Vectors for Steering Generative Large Language Model | Feb 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack | Feb 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software | Feb 1, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters | Jan 31, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Contextualization Distillation from Large Language Model for Knowledge Graph Completion | Jan 28, 2024 | ArticlesKnowledge Graph Completion | CodeCode Available | 1 |
| ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning | Jan 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Large Language Models Write Parallel Code? | Jan 23, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations | Jan 23, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| The Radiation Oncology NLP Database | Jan 19, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Excuse me, sir? Your language model is leaking (information) | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Vlogger: Make Your Dream A Vlog | Jan 17, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot | Jan 14, 2024 | ChatbotConversational Search | CodeCode Available | 1 |
| ModaVerse: Efficiently Transforming Modalities with LLMs | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search | Jan 9, 2024 | Code GenerationCode Search | CodeCode Available | 1 |
| VLLaVO: Mitigating Visual Gap through LLMs | Jan 6, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Can Large Language Models Understand Molecules? | Jan 5, 2024 | Drug DiscoveryLanguage Modelling | CodeCode Available | 1 |
| PLLaMa: An Open-source Large Language Model for Plant Science | Jan 3, 2024 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Quokka: An Open-source Large Language Model ChatBot for Material Science | Jan 2, 2024 | ArticlesChatbot | CodeCode Available | 1 |
| SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection | Dec 31, 2023 | Data AugmentationIntent Detection | CodeCode Available | 1 |
| AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Ten Modalities via Language as a Reference Framework | Dec 31, 2023 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| GeoGalactica: A Scientific Large Language Model in Geoscience | Dec 31, 2023 | Document ClassificationGeneral Knowledge | CodeCode Available | 1 |
| A Simple LLM Framework for Long-Range Video Question-Answering | Dec 28, 2023 | EgoSchemaLanguage Modelling | CodeCode Available | 1 |
| MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation | Dec 28, 2023 | GSM8KLanguage Model Evaluation | CodeCode Available | 1 |
| DrugAssist: A Large Language Model for Molecule Optimization | Dec 28, 2023 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 |
| RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation | Dec 26, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| LLM-SAP: Large Language Models Situational Awareness Based Planning | Dec 26, 2023 | Decision MakingLanguage Modelling | CodeCode Available | 1 |
| Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study | Dec 23, 2023 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks | Dec 21, 2023 | Image RetrievalImage-to-Text Retrieval | CodeCode Available | 1 |
| Context-aware Decoding Reduces Hallucination in Query-focused Summarization | Dec 21, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fine-tuning Large Language Models for Adaptive Machine Translation | Dec 20, 2023 | In-Context LearningLanguage Modelling | CodeCode Available | 1 |
| ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation | Dec 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Sparse is Enough in Fine-tuning Pre-trained Large Language Models | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| "Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation | Dec 18, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 |
| A Unified Framework for Multi-Domain CTR Prediction via Large Language Models | Dec 17, 2023 | Click-Through Rate PredictionLanguage Modelling | CodeCode Available | 1 |
| Personalized Autonomous Driving with Large Language Models: Field Experiments | Dec 14, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hallucination Augmented Contrastive Learning for Multimodal Large Language Model | Dec 12, 2023 | Contrastive LearningHallucination | CodeCode Available | 1 |
| On Diversified Preferences of Large Language Model Alignment | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes | Dec 11, 2023 | Federated LearningLarge Language Model | CodeCode Available | 1 |
| History Matters: Temporal Knowledge Editing in Large Language Model | Dec 9, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 1 |
| SparQ Attention: Bandwidth-Efficient LLM Inference | Dec 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TypeFly: Flying Drones with Large Language Model | Dec 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Auto-Vocabulary Semantic Segmentation | Dec 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |