| PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst | Jun 14, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data | Jun 14, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting | Jun 14, 2024 | Dialogue GenerationForm | CodeCode Available | 0 |
| Datasets for Multilingual Answer Sentence Selection | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models | Jun 14, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Automatically Labeling $200B Life-Saving Datasets: A Large Clinical Trial Outcome Benchmark | Jun 13, 2024 | Drug DiscoveryLarge Language Model | —Unverified | 0 |