A Dutch Financial Large Language Model Oct 3, 2024 Language Modeling Language Modelling
Code Code Available 0Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0Selective Attention Improves Transformer Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion Oct 3, 2024 Adversarial Attack Denoising
Code Code Available 0Large Language Model Aided Multi-objective Evolutionary Algorithm: a Low-cost Adaptive Approach Oct 3, 2024 Evolutionary Algorithms global-optimization
— Unverified 0LLMCO2: Advancing Accurate Carbon Footprint Prediction for LLM Inferences Oct 3, 2024 GPU Graph Neural Network
— Unverified 0UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation Oct 3, 2024 Chunking Language Modeling
— Unverified 0Multi-modal clothing recommendation model based on large model and VAE enhancement Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0NL-Eye: Abductive NLI for Images Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0Neutral residues: revisiting adapters for model extension Oct 3, 2024 Domain Adaptation Language Modelling
— Unverified 0SEAL: SEmantic-Augmented Imitation Learning via Language Model Oct 3, 2024 Decision Making Imitation Learning
— Unverified 0On the Proper Treatment of Tokenization in Psycholinguistics Oct 3, 2024 Language Modeling Language Modelling
Code Code Available 0MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation Oct 3, 2024 Diagnostic Image Segmentation
— Unverified 0GPT-4o as the Gold Standard: A Scalable and General Purpose Approach to Filter Language Model Pretraining Data Oct 3, 2024 Active Learning Language Modeling
— Unverified 0Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning Oct 3, 2024 Benchmarking Language Modeling
— Unverified 0LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model Oct 3, 2024 image-classification Image Classification
— Unverified 0Morphological evaluation of subwords vocabulary used by BETO language model Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0Real-World Cooking Robot System from Recipes Based on Food State Recognition Using Foundation Models and PDDL Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0Learning the Latent Rules of a Game from Data: A Chess Story Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0LLM-Augmented Symbolic Reinforcement Learning with Landmark-Based Task Decomposition Oct 2, 2024 Common Sense Reasoning Inductive logic programming
— Unverified 0Leveraging Large Language Models to Enhance Personalized Recommendations in E-commerce Oct 2, 2024 Diversity Language Modeling
— Unverified 0OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning Oct 2, 2024 3D Generation Language Modeling
— Unverified 0LS-HAR: Language Supervised Human Action Recognition with Salient Fusion, Construction Sites as a Use-Case Oct 2, 2024 Action Recognition Language Modeling
— Unverified 0TypedThinker: Typed Thinking Improves Large Language Model Reasoning Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 0Racing Thoughts: Explaining Contextualization Errors in Large Language Models Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Spoken Grammar Assessment Using LLM Oct 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Long-range gene expression prediction with token alignment of large language model Oct 2, 2024 In-Context Learning Language Modeling
— Unverified 0Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 0SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics Oct 2, 2024 Classification Language Modeling
Code Code Available 0Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Generate then Refine: Data Augmentation for Zero-shot Intent Detection Oct 2, 2024 Data Augmentation Diversity
Code Code Available 0Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Investigating on RLHF methodology Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Agent-Driven Large Language Models for Mandarin Lyric Generation Oct 2, 2024 In-Context Learning Language Modeling
— Unverified 0Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0A Two-Stage Proactive Dialogue Generator for Efficient Clinical Information Collection Using Large Language Model Oct 2, 2024 Diagnostic Dialogue Generation
— Unverified 0Elaborative Subtopic Query Reformulation for Broad and Indirect Queries in Travel Destination Recommendation Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 0Automatic deductive coding in discourse analysis: an application of large language models in learning analytics Oct 2, 2024 Feature Engineering Language Modeling
Code Code Available 0Integrating Protein Sequence and Expression Level to Analysis Molecular Characterization of Breast Cancer Subtypes Oct 2, 2024 Clustering Feature Importance
— Unverified 0FARM: Functional Group-Aware Representations for Small Molecules Oct 2, 2024 Contrastive Learning Drug Discovery
— Unverified 0ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving Oct 2, 2024 Benchmarking Document Summarization
— Unverified 0When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Unveiling Language Skills via Path-Level Circuit Discovery Oct 2, 2024 Disentanglement In-Context Learning
Code Code Available 0ViDAS: Vision-based Danger Assessment and Scoring Oct 1, 2024 Fixed Few Shot Prompting Fixed Few Shot Prompting Danger Assessment
— Unverified 0ReXplain: Translating Radiology into Patient-Friendly Video Reports Oct 1, 2024 Anatomy Image Segmentation
— Unverified 0Removing Distributional Discrepancies in Captions Improves Image-Text Alignment Oct 1, 2024 Language Modeling Language Modelling
— Unverified 0ScVLM: Enhancing Vision-Language Model for Safety-Critical Event Understanding Oct 1, 2024 Contrastive Learning Hallucination
Code Code Available 0