| AspirinSum: an Aspect-based utility-preserved de-identification Summarization framework | Jun 20, 2024 | De-identificationLanguage Modelling | —Unverified | 0 |
| How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks | Jun 20, 2024 | General KnowledgeHuman Dynamics | CodeCode Available | 1 |
| LiveMind: Low-latency Large Language Models with Simultaneous Inference | Jun 20, 2024 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Ranking LLMs by compression | Jun 20, 2024 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages | Jun 20, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| ReaL: Efficient RLHF Training of Large Language Models with Parameter Reallocation | Jun 20, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking | Jun 20, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Detecting hallucinations in large language models using semantic entropy | Jun 19, 2024 | Large Language ModelQuestion Answering | CodeCode Available | 3 |