| MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Aug 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What are the limits of cross-lingual dense passage retrieval for low-resource languages? | Aug 21, 2024 | Answer GenerationLanguage Modeling | —Unverified | 0 |
| Improving Speech Recognition Error Prediction for Modern and Off-the-shelf Speech Recognizers | Aug 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain | Aug 21, 2024 | Answer GenerationBenchmarking | —Unverified | 0 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 |
| SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs | Aug 21, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding | Aug 21, 2024 | Click-Through Rate PredictionContrastive Learning | —Unverified | 0 |
| ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding | Aug 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model | Aug 21, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Great Memory, Shallow Reasoning: Limits of kNN-LMs | Aug 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeoReasoner: Reasoning On Geospatially Grounded Context For Natural Language Understanding | Aug 21, 2024 | Entity TypingLanguage Modeling | —Unverified | 0 |
| Automating Thought of Search: A Journey Towards Soundness and Completeness | Aug 21, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Video Emotion Open-vocabulary Recognition Based on Multimodal Large Language Model | Aug 21, 2024 | Emotion RecognitionLanguage Modeling | —Unverified | 0 |
| Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework | Aug 21, 2024 | geo-localizationLanguage Modeling | —Unverified | 0 |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Aug 21, 2024 | Image GenerationImage Retrieval | CodeCode Available | 1 |
| Approaching Deep Learning through the Spectral Dynamics of Weights | Aug 21, 2024 | Deep Learningimage-classification | CodeCode Available | 1 |
| Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Aug 20, 2024 | Image EnhancementLanguage Modeling | CodeCode Available | 1 |
| Unconditional Truthfulness: Learning Conditional Dependency for Uncertainty Quantification of Large Language Models | Aug 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval | Aug 20, 2024 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Analysis of Plan-based Retrieval for Grounded Text Generation | Aug 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HMoE: Heterogeneous Mixture of Experts for Language Modeling | Aug 20, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |