Towards Evaluating Generalist Agents: An Automated Benchmark in Open World Oct 12, 2023 Benchmarking Diversity
Code Code Available 1Mapping Memes to Words for Multimodal Hateful Meme Classification Oct 12, 2023 Hateful Meme Classification Language Modeling
Code Code Available 1Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Oct 12, 2023 Language Modelling Large Language Model
Code Code Available 7QASiNa: Religious Domain Question Answering using Sirah Nabawiyah Oct 12, 2023 Language Modelling Large Language Model
Code Code Available 0Large Language Models for Scientific Synthesis, Inference and Explanation Oct 12, 2023 Code Generation Language Modeling
Code Code Available 1Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques Oct 12, 2023 In-Context Learning Language Modelling
— Unverified 0Toward Joint Language Modeling for Speech Units and Text Oct 12, 2023 Language Modeling Language Modelling
— Unverified 0Language Models are Universal Embedders Oct 12, 2023 Code Search Language Modeling
Code Code Available 1Multimodal Large Language Model for Visual Navigation Oct 12, 2023 Language Modeling Language Modelling
— Unverified 0GameGPT: Multi-agent Collaborative Framework for Game Development Oct 12, 2023 Code Generation Hallucination
— Unverified 0Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability Oct 12, 2023 Causal Language Modeling In-Context Learning
Code Code Available 0Expanding the Vocabulary of BERT for Knowledge Base Construction Oct 12, 2023 Knowledge Base Construction Knowledge Base Population
Code Code Available 0Context Compression for Auto-regressive Transformers with Sentinel Tokens Oct 12, 2023 Language Modeling Language Modelling
Code Code Available 1Harnessing Large Language Models' Empathetic Response Generation Capabilities for Online Mental Health Counselling Support Oct 12, 2023 Empathetic Response Generation Language Modeling
— Unverified 0GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models Oct 12, 2023 Answer Generation Hallucination
Code Code Available 0Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning Oct 12, 2023 Image Captioning Image-text Retrieval
— Unverified 0Towards Robust Multi-Modal Reasoning via Model Selection Oct 12, 2023 Language Modelling Large Language Model
Code Code Available 1DistillSpec: Improving Speculative Decoding via Knowledge Distillation Oct 12, 2023 Knowledge Distillation Language Modelling
— Unverified 0HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science Oct 12, 2023 Language Modeling Language Modelling
Code Code Available 1On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models Oct 11, 2023 Language Modeling Language Modelling
Code Code Available 0Crosslingual Structural Priming and the Pre-Training Dynamics of Bilingual Language Models Oct 11, 2023 Language Modeling Language Modelling
— Unverified 0Measuring Feature Sparsity in Language Models Oct 11, 2023 Language Modeling Language Modelling
— Unverified 0LangNav: Language as a Perceptual Representation for Navigation Oct 11, 2023 Image Captioning Language Modeling
— Unverified 0Language Models As Semantic Indexers Oct 11, 2023 Contrastive Learning Information Retrieval
Code Code Available 1MatChat: A Large Language Model and Application Service Platform for Materials Science Oct 11, 2023 Language Modeling Language Modelling
— Unverified 0LLark: A Multimodal Instruction-Following Language Model for Music Oct 11, 2023 Instruction Following Language Modeling
Code Code Available 2From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language Models Oct 11, 2023 In-Context Learning Instruction Following
Code Code Available 0PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model Oct 11, 2023 Language Modeling Language Modelling
Code Code Available 1MatFormer: Nested Transformer for Elastic Inference Oct 11, 2023 Decoder Language Modelling
Code Code Available 1Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction Oct 11, 2023 Language Modelling Large Language Model
Code Code Available 1Ferret: Refer and Ground Anything Anywhere at Any Granularity Oct 11, 2023 Hallucination Language Modeling
Code Code Available 5ClausewitzGPT Framework: A New Frontier in Theoretical Large Language Model Enhanced Information Operations Oct 11, 2023 Language Modeling Language Modelling
— Unverified 0Fast-ELECTRA for Efficient Pre-training Oct 11, 2023 Language Modeling Language Modelling
— Unverified 0CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving Oct 11, 2023 Language Modeling Language Modelling
Code Code Available 5Cognate Transformer for Automated Phonological Reconstruction and Cognate Reflex Prediction Oct 11, 2023 Language Modeling Language Modelling
Code Code Available 0A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation Oct 11, 2023 Caption Generation Decoder
— Unverified 0A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models Oct 11, 2023 Language Modeling Language Modelling
Code Code Available 0Acoustic Model Fusion for End-to-end Speech Recognition Oct 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Answer Candidate Type Selection: Text-to-Text Language Model for Closed Book Question Answering Meets Knowledge Graphs Oct 10, 2023 Graph Question Answering Knowledge Graphs
— Unverified 0Prosody Analysis of Audiobooks Oct 10, 2023 Attribute Language Modeling
Code Code Available 0The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets Oct 10, 2023 Language Modeling Language Modelling
Code Code Available 1Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning Oct 10, 2023 Language Modeling Language Modelling
Code Code Available 2Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder Oct 10, 2023 Language Modeling Language Modelling
Code Code Available 0SWE-bench: Can Language Models Resolve Real-World GitHub Issues? Oct 10, 2023 Bug fixing Code Generation
Code Code Available 4Mistral 7B Oct 10, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6MuseChat: A Conversational Music Recommendation System for Videos Oct 10, 2023 Language Modeling Language Modelling
Code Code Available 0Making Large Language Models Perform Better in Knowledge Graph Completion Oct 10, 2023 In-Context Learning Knowledge Graph Completion
Code Code Available 2SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA Oct 10, 2023 Diversity In-Context Learning
Code Code Available 0Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations Oct 10, 2023 In-Context Learning Language Modelling
— Unverified 0Bridging Items and Language: A Transition Paradigm for Large Language Model-Based Recommendation Oct 10, 2023 Attribute Language Modeling
— Unverified 0