| Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Aug 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What are the limits of cross-lingual dense passage retrieval for low-resource languages? | Aug 21, 2024 | Answer GenerationLanguage Modeling | —Unverified | 0 |
| EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model | Aug 21, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding | Aug 21, 2024 | Click-Through Rate PredictionContrastive Learning | —Unverified | 0 |
| WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain | Aug 21, 2024 | Answer GenerationBenchmarking | —Unverified | 0 |
| Video Emotion Open-vocabulary Recognition Based on Multimodal Large Language Model | Aug 21, 2024 | Emotion RecognitionLanguage Modeling | —Unverified | 0 |
| Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework | Aug 21, 2024 | geo-localizationLanguage Modeling | —Unverified | 0 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 |
| Improving Speech Recognition Error Prediction for Modern and Off-the-shelf Speech Recognizers | Aug 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automating Thought of Search: A Journey Towards Soundness and Completeness | Aug 21, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| GeoReasoner: Reasoning On Geospatially Grounded Context For Natural Language Understanding | Aug 21, 2024 | Entity TypingLanguage Modeling | —Unverified | 0 |
| Great Memory, Shallow Reasoning: Limits of kNN-LMs | Aug 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs | Aug 21, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding | Aug 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Aug 21, 2024 | Image GenerationImage Retrieval | CodeCode Available | 1 |
| Approaching Deep Learning through the Spectral Dynamics of Weights | Aug 21, 2024 | Deep Learningimage-classification | CodeCode Available | 1 |
| Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles | Aug 20, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Analysis of Plan-based Retrieval for Grounded Text Generation | Aug 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unconditional Truthfulness: Learning Conditional Dependency for Uncertainty Quantification of Large Language Models | Aug 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HMoE: Heterogeneous Mixture of Experts for Language Modeling | Aug 20, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ColBERT Retrieval and Ensemble Response Scoring for Language Model Question Answering | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval | Aug 20, 2024 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Aug 20, 2024 | Image EnhancementLanguage Modeling | CodeCode Available | 1 |
| Fine-Tuning a Local LLaMA-3 Large Language Model for Automated Privacy-Preserving Physician Letter Generation in Radiation Oncology | Aug 20, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups | Aug 20, 2024 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Large Language Model Driven Recommendation | Aug 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Minor SFT loss for LLM fine-tune to increase performance and reduce model deviation | Aug 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization using Large Language Models | Aug 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| MoDeGPT: Modular Decomposition for Large Language Model Compression | Aug 19, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Development of an AI Anti-Bullying System Using Large Language Model Key Topic Detection | Aug 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-composition Feature Disentanglement for Compositional Zero-shot Learning | Aug 19, 2024 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model | Aug 19, 2024 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation | Aug 19, 2024 | DiversityExplainable Recommendation | —Unverified | 0 |
| SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training | Aug 19, 2024 | GPULanguage Modeling | —Unverified | 0 |
| IDEA: Enhancing the Rule Learning Ability of Large Language Model Agent through Induction, Deduction, and Abduction | Aug 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| R2GenCSR: Retrieving Context Samples for Large Language Model based X-ray Medical Report Generation | Aug 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Comparison of Large Language Model and Human Performance on Random Number Generation Tasks | Aug 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 |
| AutoML-guided Fusion of Entity and LLM-based Representations for Document Classification | Aug 19, 2024 | AutoMLClassification | CodeCode Available | 0 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| MSDiagnosis: A Benchmark for Evaluating Large Language Models in Multi-Step Clinical Diagnosis | Aug 19, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |