| PLeak: Prompt Leaking Attacks against Large Language Model Applications | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LMD3: Language Model Data Density Dependence | May 10, 2024 | Density EstimationLanguage Modeling | —Unverified | 0 |
| Large Language Model in Financial Regulatory Interpretation | May 10, 2024 | EthicsLanguage Modeling | —Unverified | 0 |
| CANAL -- Cyber Activity News Alerting Language Model: Empirical Approach vs. Expensive LLM | May 10, 2024 | ArticlesFew-Shot Learning | —Unverified | 0 |
| Value Augmented Sampling for Language Model Alignment and Personalization | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Memory Mosaics | May 10, 2024 | DisentanglementIn-Context Learning | CodeCode Available | 2 |
| Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation | May 10, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| Correlation Dimension of Natural Language in a Statistical Manifold | May 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| State-Free Inference of State-Space Models: The Transfer Function Approach | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit | May 9, 2024 | BenchmarkingComputational Efficiency | CodeCode Available | 4 |
| Can Perplexity Reflect Large Language Model's Ability in Long Text Understanding? | May 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Boosting Large Language Models with Continual Learning for Aspect-based Sentiment Analysis | May 9, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | —Unverified | 0 |
| Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft | May 9, 2024 | AllLanguage Modeling | —Unverified | 0 |
| Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language | May 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter-Efficient Fine-Tuning With Adapters | May 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HMT: Hierarchical Memory Transformer for Long Context Language Processing | May 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 |
| Large Language Model-Aided Evolutionary Search for Constrained Multiobjective Optimization | May 9, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models | May 8, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change Detector | May 8, 2024 | Change DetectionLanguage Modeling | CodeCode Available | 2 |
| Large Language Model Enhanced Machine Learning Estimators for Classification | May 8, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 0 |
| LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation | May 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization | May 8, 2024 | DiversityKnowledge Distillation | —Unverified | 0 |
| Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information Extraction from Historical Well Records Using A Large Language Model | May 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Impact of Tone-Aware Explanations in Recommender Systems | May 8, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| ChuXin: 1.6B Technical Report | May 8, 2024 | Continual PretrainingLanguage Modeling | —Unverified | 0 |
| AirGapAgent: Protecting Privacy-Conscious Conversational Agents | May 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | May 8, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming | May 8, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs | May 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio | May 8, 2024 | Audio Deepfake DetectionAudio Generation | CodeCode Available | 2 |
| DrugLLM: Open Large Language Model for Few-shot Molecule Generation | May 7, 2024 | Drug DesignDrug Discovery | —Unverified | 0 |
| SUTRA: Scalable Multilingual Language Model Architecture | May 7, 2024 | Computational EfficiencyHallucination | —Unverified | 0 |
| Language Modeling Using Tensor Trains | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhancing Knowledge Retrieval with Topic Modeling for Knowledge-Grounded Dialogue | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore | May 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization | May 7, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Representation Learning of Daily Movement Data Using Text Encoders | May 7, 2024 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Deception in Reinforced Autonomous Agents | May 7, 2024 | Deception DetectionHallucination | —Unverified | 0 |
| xLSTM: Extended Long Short-Term Memory | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference | May 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Transformer with Stack Attention | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing | May 7, 2024 | Image ManipulationLanguage Modeling | CodeCode Available | 4 |
| SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering | May 6, 2024 | Bug fixingLanguage Modeling | CodeCode Available | 11 |
| AntiFold: Improved antibody structure-based design using inverse folding | May 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |