| Improved Regret in Stochastic Decision-Theoretic Online Learning under Differential Privacy | Feb 16, 2025 | 2k | —Unverified | 0 |
| Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains | Jan 24, 2025 | 2kLegal Reasoning | —Unverified | 0 |
| TimeLogic: A Temporal Logic Benchmark for Video QA | Jan 13, 2025 | 2kAction Segmentation | —Unverified | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 |
| Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language | Dec 31, 2024 | 2k | —Unverified | 0 |
| Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces | Dec 30, 2024 | 2kRobot Navigation | —Unverified | 0 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 |
| AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models | Dec 17, 2024 | 2kCode Generation | —Unverified | 0 |
| Block-Based Multi-Scale Image Rescaling | Dec 16, 2024 | 2k4k | —Unverified | 0 |
| Do Large Language Models Show Biases in Causal Learning? | Dec 13, 2024 | 2kMisinformation | —Unverified | 0 |