| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | May 19, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 1 | 5 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning | Dec 15, 2024 | Decision MakingLarge Language Model | CodeCode Available | 1 | 5 |
| LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport | Jan 16, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 | 5 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Feb 9, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Large Language Model Unlearning via Embedding-Corrupted Prompts | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |