| Memory, Consciousness and Large Language Model | Jan 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Demonstration of an Adversarial Attack Against a Multimodal Vision Language Model for Pathology Imaging | Jan 4, 2024 | Adversarial AttackDomain Adaptation | CodeCode Available | 0 |
| Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model | Jan 4, 2024 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 3 |
| Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models | Jan 4, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model | Jan 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Multi-modal vision-language model for generalizable annotation-free pathology localization and clinical diagnosis | Jan 4, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Understanding LLMs: A Comprehensive Overview from Training to Inference | Jan 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives | Jan 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning | Jan 4, 2024 | Data VisualizationDecision Making | CodeCode Available | 2 |
| TinyLlama: An Open-Source Small Language Model | Jan 4, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 11 |
| Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition | Jan 3, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication | Jan 3, 2024 | ClassificationICU Admission | CodeCode Available | 0 |
| Cross-target Stance Detection by Exploiting Target Analytical Perspectives | Jan 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity | Jan 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PLLaMa: An Open-source Large Language Model for Plant Science | Jan 3, 2024 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| Efficient Parallel Audio Generation using Group Masked Language Modeling | Jan 2, 2024 | Audio GenerationComputational Efficiency | —Unverified | 0 |
| Quokka: An Open-source Large Language Model ChatBot for Material Science | Jan 2, 2024 | ArticlesChatbot | CodeCode Available | 1 |
| Discovering Significant Topics from Legal Decisions with Selective Inference | Jan 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Imperio: Language-Guided Backdoor Attacks for Arbitrary Model Control | Jan 2, 2024 | Backdoor AttackImage Classification | —Unverified | 0 |
| Cheetah: Natural Language Generation for 517 African Languages | Jan 2, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| On Scaling Up a Multilingual Vision and Language Model | Jan 1, 2024 | document understandingIn-Context Learning | —Unverified | 0 |
| Edge-Aware 3D Instance Segmentation Network with Intelligent Semantic Prior | Jan 1, 2024 | 3D Instance SegmentationInstance Segmentation | —Unverified | 0 |
| LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge | Jan 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Discovering Syntactic Interaction Clues for Human-Object Interaction Detection | Jan 1, 2024 | DecoderHuman-Object Interaction Detection | —Unverified | 0 |
| Pixel-Aligned Language Model | Jan 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| General Point Model Pretraining with Autoencoding and Autoregressive | Jan 1, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| CoDi-2: In-Context Interleaved and Interactive Any-to-Any Generation | Jan 1, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Few-Shot Object Detection with Foundation Models | Jan 1, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |
| Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model | Jan 1, 2024 | AllAttribute | —Unverified | 0 |
| PeVL: Pose-Enhanced Vision-Language Model for Fine-Grained Human Action Recognition | Jan 1, 2024 | Action RecognitionContrastive Learning | —Unverified | 0 |
| Querying as Prompt: Parameter-Efficient Learning for Multimodal Language Model | Jan 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AssistGUI: Task-Oriented PC Graphical User Interface Automation | Jan 1, 2024 | Action GenerationLanguage Modeling | —Unverified | 0 |
| Predicting Anti-microbial Resistance using Large Language Models | Jan 1, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Digger: Detecting Copyright Content Mis-usage in Large Language Model Training | Jan 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large language model for Bible sentiment analysis: Sermon on the Mount | Jan 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Searching, fast and slow, through product catalogs | Jan 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DocLLM: A layout-aware generative language model for multimodal document understanding | Dec 31, 2023 | document understandingLanguage Modeling | —Unverified | 0 |
| Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition | Dec 31, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws | Dec 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HSC-GPT: A Large Language Model for Human Settlements Construction | Dec 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GeoGalactica: A Scientific Large Language Model in Geoscience | Dec 31, 2023 | Document ClassificationGeneral Knowledge | CodeCode Available | 1 |
| SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection | Dec 31, 2023 | Data AugmentationIntent Detection | CodeCode Available | 1 |
| Neural Networks Against (and For) Self-Training: Classification with Small Labeled and Large Unlabeled Sets | Dec 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Trace and Edit Relation Associations in GPT | Dec 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Boosting Large Language Model for Speech Synthesis: An Empirical Study | Dec 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Open-TI: Open Traffic Intelligence with Augmented Language Model | Dec 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning | Dec 29, 2023 | Federated LearningLanguage Modeling | —Unverified | 0 |
| MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining | Dec 29, 2023 | GPULanguage Modeling | CodeCode Available | 2 |
| Principled Gradient-based Markov Chain Monte Carlo for Text Generation | Dec 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |