| SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Abrupt Learning in Transformers: A Case Study on Matrix Completion | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding | Oct 29, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| From melodic note sequences to pitches using word2vec | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Are VLMs Really Blind | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Energy-Based Diffusion Language Models for Text Generation | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |