| ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model Training | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing | Oct 14, 2024 | AllBinary Classification | —Unverified | 0 |
| Skill Learning Using Process Mining for Large Language Model Plan Generation | Oct 14, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Character-aware audio-visual subtitling in context | Oct 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queries | Oct 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Oct 14, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 3 |
| Diagnosing Hate Speech Classification: Where Do Humans and Machines Disagree, and Why? | Oct 14, 2024 | DiagnosticLarge Language Model | —Unverified | 0 |
| Large Language Model Evaluation via Matrix Nuclear-Norm | Oct 14, 2024 | Computational EfficiencyData Compression | CodeCode Available | 0 |
| Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios? | Oct 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Oct 14, 2024 | Density Ratio EstimationGSM8K | CodeCode Available | 0 |