| A Realistic Threat Model for Large Language Model Jailbreaks | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SeisLM: a Foundation Model for Seismic Waveforms | Oct 21, 2024 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| M-RewardBench: Evaluating Reward Models in Multilingual Settings | Oct 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning | Oct 18, 2024 | HallucinationKnowledge Base Question Answering | CodeCode Available | 1 |
| MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task Automation | Oct 17, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| FIRE: Fact-checking with Iterative Retrieval and Verification | Oct 17, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems | Oct 17, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims | Oct 16, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |