| ClaimBrush: A Novel Framework for Automated Patent Claim Refinement Based on Large Language Models | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling | Oct 8, 2024 | document understandingLanguage Modeling | CodeCode Available | 2 |
| Multi-Session Client-Centered Treatment Outcome Evaluation in Psychotherapy | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ParallelSpec: Parallel Drafter for Efficient Speculative Decoding | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning | Oct 8, 2024 | GSM8KHallucination | —Unverified | 0 |
| Wireless-Friendly Window Position Optimization for RIS-Aided Outdoor-to-Indoor Networks based on Multi-Modal Large Language Model | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatVis: Automating Scientific Visualization with a Large Language Model | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths | Oct 7, 2024 | AttributeGSM8K | —Unverified | 0 |
| RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Filtering Discomforting Recommendations with Large Language Models | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards the generation of hierarchical attack models from cybersecurity vulnerabilities using language models | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Falcon Mamba: The First Competitive Attention-free 7B Language Model | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformers learn variable-order Markov chains in-context | Oct 7, 2024 | Data CompressionIn-Context Learning | —Unverified | 0 |
| Neural machine translation system for Lezgian, Russian and Azerbaijani languages | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Chain and Causal Attention for Efficient Entity Tracking | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Differential Transformer | Oct 7, 2024 | HallucinationIn-Context Learning | CodeCode Available | 2 |
| LPZero: Language Model Zero-cost Proxy Search from Zero | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Inference for Large Language Model-based Generative Recommendation | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DEPT: Decoupled Embeddings for Pre-training Language Models | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synthesizing Interpretable Control Policies through Large Language Model Guided Search | Oct 7, 2024 | Combinatorial OptimizationEvolutionary Algorithms | CodeCode Available | 0 |
| Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law | Oct 7, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality | Oct 7, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Fine-Tuning CLIP's Last Visual Projector: A Few-Shot Cornucopia | Oct 7, 2024 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks | Oct 7, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe | Oct 7, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Activation Scaling for Steering and Interpreting Language Models | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Driving with Regulation: Interpretable Decision-Making for Autonomous Vehicles with Retrieval-Augmented Reasoning via LLM | Oct 7, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| On the Reliability of Large Language Models to Misinformed and Demographically-Informed Prompts | Oct 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis | Oct 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions | Oct 6, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective | Oct 6, 2024 | CPUGPU | CodeCode Available | 1 |
| ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model | Oct 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference | Oct 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| GenSim: A General Social Simulation Platform with Large Language Model based Agents | Oct 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Assessing the Performance of Human-Capable LLMs -- Are LLMs Coming for Your Job? | Oct 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Language Model Character Traits | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RoQLlama: A Lightweight Romanian Adapted Language Model | Oct 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Persona Knowledge-Aligned Prompt Tuning Method for Online Debate | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations | Oct 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Model-Driven Data Pruning Enables Efficient Active Learning | Oct 5, 2024 | Active LearningLanguage Modeling | —Unverified | 0 |
| SyllableLM: Learning Coarse Semantic Units for Speech Language Models | Oct 5, 2024 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| Leveraging Social Determinants of Health in Alzheimer's Research Using LLM-Augmented Literature Mining and Knowledge Graphs | Oct 4, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| A Large Language Model-based Framework for Semi-Structured Tender Document Retrieval-Augmented Generation | Oct 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens | Oct 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| You Know What I'm Saying: Jailbreak Attack via Implicit Reference | Oct 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Searching for Best Practices in Medical Transcription with Large Language Model | Oct 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding Large Language Models in Your Pockets: Performance Study on COTS Mobile Devices | Oct 4, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |