| Awes, Laws, and Flaws From Today's LLM Research | Aug 27, 2024 | EthicsLanguage Modeling | —Unverified | 0 |
| XG-NID: Dual-Modality Network Intrusion Detection using a Heterogeneous Graph Neural Network and Large Language Model | Aug 27, 2024 | Graph Neural NetworkIntrusion Detection | CodeCode Available | 1 |
| Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis | Aug 27, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Project SHADOW: Symbolic Higher-order Associative Deductive reasoning On Wikidata using LM probing | Aug 27, 2024 | Knowledge Base ConstructionLanguage Modeling | CodeCode Available | 0 |
| The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Aug 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models | Aug 27, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline | Aug 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Unifying Multitrack Music Arrangement via Reconstruction Fine-Tuning and Efficient Tokenization | Aug 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild | Aug 27, 2024 | Cross-Modal RetrievalImage Retrieval | —Unverified | 0 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AAVENUE: Detecting LLM Biases on NLU Tasks in AAVE via a Novel Benchmark | Aug 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model for Patent Concept Generation | Aug 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predictability and Causality in Spanish and English Natural Language Generation | Aug 26, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| An Evaluation of Explanation Methods for Black-Box Detectors of Machine-Generated Text | Aug 26, 2024 | Feature ImportanceLanguage Modeling | CodeCode Available | 0 |
| Social perception of faces in a vision-language model | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning | Aug 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Investigating Language-Specific Calibration For Pruning Multilingual Large Language Models | Aug 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Question answering system of bridge design specification based on large language model | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| StockTime: A Time Series Specialized Large Language Model Architecture for Stock Price Prediction | Aug 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models | Aug 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback | Aug 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing SQL Query Generation with Neurosymbolic Reasoning | Aug 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Empowered Spatio-Temporal Forecasting via Physics-Aware Reprogramming | Aug 24, 2024 | energy managementLanguage Modeling | —Unverified | 0 |
| GNN: Graph Neural Network and Large Language Model for Data Discovery | Aug 24, 2024 | AttributeGraph Neural Network | —Unverified | 0 |
| LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs | Aug 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Learning to Plan Long-Term for Language Modeling | Aug 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving | Aug 23, 2024 | Domain AdaptationInformation Retrieval | —Unverified | 0 |
| In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting | Aug 23, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction | Aug 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DrugAgent: Multi-Agent Large Language Model-Based Reasoning for Drug-Target Interaction Prediction | Aug 23, 2024 | AI AgentDrug Discovery | —Unverified | 0 |
| SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks | Aug 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predicting Affective States from Screen Text Sentiment | Aug 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities | Aug 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Context-Aware Temporal Embedding of Objects in Video Data | Aug 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition | Aug 23, 2024 | Contrastive Learningfew-shot-ner | —Unverified | 0 |
| FIDAVL: Fake Image Detection and Attribution using Vision-Language Model | Aug 22, 2024 | AttributeFake Image Detection | CodeCode Available | 0 |
| TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model | Aug 22, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection | Aug 22, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | Aug 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multi-tool Integration Application for Math Reasoning Using Large Language Model | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Multi-hop Reasoning through Knowledge Erasure in Large Language Model Editing | Aug 22, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| Evidence-backed Fact Checking using RAG and Few-Shot In-Context Learning with LLMs | Aug 22, 2024 | Fact CheckingIn-Context Learning | CodeCode Available | 0 |
| Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Implicit Sentiment Analysis Based on Chain of Thought Prompting | Aug 22, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |