How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics Jun 20, 2024 Language Modeling Language Modelling
— Unverified 0Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction Jun 20, 2024 Language Modeling Language Modelling
— Unverified 0CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks Jun 20, 2024 General Knowledge Human Dynamics
Code Code Available 1AspirinSum: an Aspect-based utility-preserved de-identification Summarization framework Jun 20, 2024 De-identification Language Modelling
— Unverified 0Revealing Vision-Language Integration in the Brain with Multimodal Networks Jun 20, 2024 Contrastive Learning Language Modelling
Code Code Available 0LiveMind: Low-latency Large Language Models with Simultaneous Inference Jun 20, 2024 Collaborative Inference Language Modeling
Code Code Available 1Enhancing the LLM-Based Robot Manipulation Through Human-Robot Collaboration Jun 20, 2024 Language Modeling Language Modelling
— Unverified 0APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Jun 20, 2024 Information Retrieval Language Modeling
— Unverified 0Information Guided Regularization for Fine-tuning Language Models Jun 20, 2024 Language Modeling Language Modelling
Code Code Available 0VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model Jun 20, 2024 Language Modeling Language Modelling
Code Code Available 1ReaL: Efficient RLHF Training of Large Language Models with Parameter Reallocation Jun 20, 2024 Language Modelling Large Language Model
— Unverified 0SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages Jun 20, 2024 Language Modelling Large Language Model
— Unverified 0Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding Jun 19, 2024 Language Modeling Language Modelling
Code Code Available 0Transferable speech-to-text large language model alignment module Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0Enhancing Collaborative Semantics of Language Model-Driven Recommendations via Graph-Aware Learning Jun 19, 2024 In-Context Learning Language Modeling
— Unverified 0PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model Jun 19, 2024 Feature Engineering Language Modelling
Code Code Available 0From Single Agent to Multi-Agent: Improving Traffic Signal Control Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0Enhancing Travel Choice Modeling with Large Language Models: A Prompt-Learning Approach Jun 19, 2024 Discrete Choice Models Language Modeling
— Unverified 0VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models Jun 19, 2024 GPU Language Modeling
Code Code Available 3The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It Jun 19, 2024 Diagnostic Language Modeling
— Unverified 0Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis Jun 19, 2024 Image Segmentation Language Modeling
Code Code Available 1In-Context Former: Lightning-fast Compressing Context for Large Language Model Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0On AI-Inspired UI-Design Jun 19, 2024 Language Modeling Language Modelling
Code Code Available 1Elliptical Attention Jun 19, 2024 Image Segmentation Language Modeling
Code Code Available 0Improving Visual Commonsense in Language Models via Multiple Image Generation Jun 19, 2024 Common Sense Reasoning Image Generation
Code Code Available 1Towards Holistic Language-video Representation: the language model-enhanced MSR-Video to Text Dataset Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts Jun 19, 2024 Language Modeling Language Modelling
Code Code Available 3Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration Jun 19, 2024 Benchmarking Distractor Generation
— Unverified 0LIT: Large Language Model Driven Intention Tracking for Proactive Human-Robot Collaboration -- A Robot Sous-Chef Application Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0Large Language Models are Biased Because They Are Large Language Models Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0Block-level Text Spotting with LLMs Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0VELO: A Vector Database-Assisted Cloud-Edge Collaborative LLM QoS Optimization Framework Jun 19, 2024 Language Modelling Large Language Model
— Unverified 0BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation Jun 19, 2024 Knowledge Distillation Language Modeling
Code Code Available 1Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks Jun 19, 2024 Decoder Language Modeling
Code Code Available 2Investigating Low-Cost LLM Annotation for~Spoken Dialogue Understanding Datasets Jun 19, 2024 Dialogue Understanding Language Modeling
— Unverified 0WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia Jun 19, 2024 Language Modelling RAG
— Unverified 0GPT Czech Poet: Generation of Czech Poetic Strophes with Language Models Jun 18, 2024 Language Modeling Language Modelling
— Unverified 0Improving Text-To-Audio Models with Synthetic Captions Jun 18, 2024 AudioCaps Audio captioning
Code Code Available 5MaskPure: Improving Defense Against Text Adversaries with Stochastic Purification Jun 18, 2024 Adversarial Defense Denoising
Code Code Available 0MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction Jun 18, 2024 Drug Discovery Graph Neural Network
Code Code Available 1Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors Jun 18, 2024 Hallucination Language Modeling
Code Code Available 0Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages Jun 18, 2024 Cross-Lingual Transfer Language Modeling
— Unverified 0Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment Jun 18, 2024 All Language Modeling
Code Code Available 0LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization Jun 18, 2024 Document Summarization Language Modelling
— Unverified 0AgentReview: Exploring Peer Review Dynamics with LLM Agents Jun 18, 2024 Language Modeling Language Modelling
Code Code Available 2Applying Ensemble Methods to Model-Agnostic Machine-Generated Text Detection Jun 18, 2024 Language Modeling Language Modelling
— Unverified 0UrbanLLM: Autonomous Urban Activity Planning and Management with Large Language Models Jun 18, 2024 Language Modeling Language Modelling
— Unverified 0QOG:Question and Options Generation based on Language Model Jun 18, 2024 Information Retrieval Language Modeling
— Unverified 0DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence? Jun 18, 2024 Language Modeling Language Modelling
Code Code Available 0Stealth edits to large language models Jun 18, 2024 Language Modelling Model Editing
Code Code Available 0