Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models Jul 31, 2024 Dictionary Learning Language Modeling
Code Code Available 1Finch: Prompt-guided Key-Value Cache Compression Jul 31, 2024 GPU Language Modeling
— Unverified 0SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving Jul 31, 2024 Autonomous Driving Language Modeling
— Unverified 0Learning Video Context as Interleaved Multimodal Sequences Jul 31, 2024 Language Modeling Language Modelling
Code Code Available 1Interpreting and learning voice commands with a Large Language Model for a robot system Jul 31, 2024 Decision Making Language Modeling
— Unverified 0MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment Jul 31, 2024 Contrastive Learning Decoder
— Unverified 0Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs Jul 31, 2024 Hallucination Image Comprehension
Code Code Available 1MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection Jul 31, 2024 Language Modelling Object
Code Code Available 1The Llama 3 Herd of Models Jul 31, 2024 answerability prediction Language Modeling
Code Code Available 4KemenkeuGPT: Leveraging a Large Language Model on Indonesia's Government Financial Data and Regulations to Enhance Decision Making Jul 31, 2024 Benchmarking Decision Making
— Unverified 0Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins Jul 31, 2024 Knowledge Distillation Language Modeling
— Unverified 0MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Jul 31, 2024 Causal Inference Language Modelling
— Unverified 0Decomposed Prompting to Answer Questions on a Course Discussion Board Jul 30, 2024 Language Modeling Language Modelling
Code Code Available 0Entropy, Thermodynamics and the Geometrization of the Language Model Jul 30, 2024 Language Modeling Language Modelling
— Unverified 0Meltemi: The first open Large Language Model for Greek Jul 30, 2024 Language Modeling Language Modelling
— Unverified 0Knesset-DictaBERT: A Hebrew Language Model for Parliamentary Proceedings Jul 30, 2024 Language Modeling Language Modelling
— Unverified 0Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning Jul 30, 2024 Efficient Exploration Language Modeling
— Unverified 0Industrial-Grade Smart Troubleshooting through Causal Technical Language Processing: a Proof of Concept Jul 30, 2024 Language Modeling Language Modelling
— Unverified 0Mimicking the Mavens: Agent-based Opinion Synthesis and Emotion Prediction for Social Media Influencers Jul 30, 2024 Language Modelling Large Language Model
— Unverified 0MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions Jul 30, 2024 Audio Generation Image to Video Generation
Code Code Available 1Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach Jul 30, 2024 image-classification Image Classification
Code Code Available 0Accelerating Large Language Model Inference with Self-Supervised Early Exits Jul 30, 2024 Language Modeling Language Modelling
— Unverified 0CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning Jul 30, 2024 Contrastive Learning Diagnostic
Code Code Available 1UniProcessor: A Text-induced Unified Low-level Image Processor Jul 30, 2024 Image Enhancement Image Restoration
Code Code Available 1Harvesting Textual and Structured Data from the HAL Publication Repository Jul 30, 2024 Articles Authorship Attribution
— Unverified 0Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification Jul 30, 2024 Language Modelling
— Unverified 0A federated large language model for long-term time series forecasting Jul 30, 2024 Language Modeling Language Modelling
— Unverified 0Label-Guided Prompt for Multi-label Few-shot Aspect Category Detection Jul 30, 2024 Aspect Category Detection Language Modeling
— Unverified 0Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval Jul 29, 2024 Fairness Language Modeling
— Unverified 0Apple Intelligence Foundation Language Models Jul 29, 2024 Language Modeling Language Modelling
— Unverified 0Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks Jul 29, 2024 Benchmarking Language Model Evaluation
— Unverified 0AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs Jul 29, 2024 Bilevel Optimization Language Modelling
Code Code Available 1VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks Jul 29, 2024 Deep Learning Domain Generalization
— Unverified 0OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale Jul 29, 2024 Language Modeling Language Modelling
Code Code Available 3Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Jul 29, 2024 Denoising Diversity
Code Code Available 0Normality Addition via Normality Detection in Industrial Image Anomaly Detection Models Jul 29, 2024 Anomaly Detection Language Modeling
— Unverified 0Improving Retrieval Augmented Language Model with Self-Reasoning Jul 29, 2024 Fact Verification Language Modeling
— Unverified 0Prometheus Chatbot: Knowledge Graph Collaborative Large Language Model for Computer Components Recommendation Jul 29, 2024 Chatbot Knowledge Graphs
Code Code Available 0ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2 Jul 29, 2024 Language Modeling Language Modelling
— Unverified 0Harnessing Large Vision and Language Models in Agriculture: A Review Jul 29, 2024 Language Modelling Large Language Model
— Unverified 0A Bayesian Flow Network Framework for Chemistry Tasks Jul 28, 2024 Diversity Language Modeling
Code Code Available 1MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training Jul 28, 2024 Contrastive Learning Language Modeling
Code Code Available 0VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary Jul 28, 2024 Attribute Fairness
Code Code Available 0LawLLM: Law Large Language Model for the US Legal System Jul 27, 2024 In-Context Learning Information Retrieval
— Unverified 0FarSSiBERT: A Novel Transformer-based Model for Semantic Similarity Measurement of Persian Social Networks Informal Texts Jul 27, 2024 Language Modeling Language Modelling
— Unverified 0GP-VLS: A general-purpose vision language model for surgery Jul 27, 2024 Language Modeling Language Modelling
— Unverified 0LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models Jul 27, 2024 Language Modeling Language Modelling
— Unverified 0Large Language Model Agent in Financial Trading: A Survey Jul 26, 2024 Language Modeling Language Modelling
— Unverified 0ChipExpert: The Open-Source Integrated-Circuit-Design-Specific Large Language Model Jul 26, 2024 Language Modeling Language Modelling
— Unverified 0Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Jul 26, 2024 Attribute Language Modelling
Code Code Available 1