| Data Efficient Evaluation of Large Language Models and Text-to-Image Models via Adaptive Sampling | Jun 21, 2024 | ClusteringMMLU | —Unverified | 0 |
| DEM: Distribution Edited Model for Training with Mixed Data Distributions | Jun 21, 2024 | DiversityInstruction Following | —Unverified | 0 |
| Pistis-RAG: Enhancing Retrieval-Augmented Generation with Human Feedback | Jun 21, 2024 | Information RetrievalLearning-To-Rank | —Unverified | 0 |
| Optimised Grouped-Query Attention Mechanism for Transformers | Jun 21, 2024 | MMLU | —Unverified | 0 |
| Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation | Jun 20, 2024 | GSM8KLanguage Model Evaluation | CodeCode Available | 0 |
| Understanding Finetuning for Factual Knowledge Extraction | Jun 20, 2024 | MMLUQuestion Answering | —Unverified | 0 |
| Input Conditioned Graph Generation for Language Agents | Jun 17, 2024 | Graph GenerationMMLU | CodeCode Available | 0 |
| The Base-Rate Effect on LLM Benchmark Performance: Disambiguating Test-Taking Strategies from Benchmark Performance | Jun 17, 2024 | counterfactualMMLU | —Unverified | 0 |
| Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting | Jun 17, 2024 | EthicsMMLU | —Unverified | 0 |
| ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation | Jun 16, 2024 | Continual LearningGSM8K | CodeCode Available | 0 |