| FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 |
| Democratizing Reward Design for Personal and Representative Value-Alignment | Oct 29, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Abrupt Learning in Transformers: A Case Study on Matrix Completion | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents | Oct 29, 2024 | Decision MakingIntent Discovery | —Unverified | 0 |
| From melodic note sequences to pitches using word2vec | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| Discrete Modeling via Boundary Conditional Diffusion Processes | Oct 29, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Anticipating Future with Large Language Model for Simultaneous Machine Translation | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Are VLMs Really Blind | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CurateGPT: A flexible language-model assisted biocuration tool | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation | Oct 29, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Rethinking Code Refinement: Learning to Judge Code Efficiency | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration | Oct 29, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MARCO: Multi-Agent Real-time Chat Orchestration | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real Data | Oct 29, 2024 | Data IntegrationImage-text Classification | CodeCode Available | 0 |
| Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PerSRV: Personalized Sticker Retrieval with Vision-Language Model | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding | Oct 29, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Visualizing attention zones in machine reading comprehension models | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation | Oct 28, 2024 | Activity PredictionContrastive Learning | —Unverified | 0 |
| Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training | Oct 28, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games | Oct 28, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |