| NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results | Apr 17, 2024 | Formvalid | CodeCode Available | 2 |
| KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking | Apr 3, 2024 | Fact CheckingForm | CodeCode Available | 2 |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Mar 15, 2024 | EgoSchemaForm | CodeCode Available | 2 |
| RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering | Feb 26, 2024 | FormOpen-Domain Question Answering | CodeCode Available | 2 |
| PEDANTS: Cheap but Effective and Interpretable Answer Equivalence | Feb 17, 2024 | BenchmarkingForm | CodeCode Available | 2 |
| KVQ: Kwai Video Quality Assessment for Short-form Videos | Feb 11, 2024 | FormVideo Quality Assessment | CodeCode Available | 2 |
| ChemDFM: A Large Language Foundation Model for Chemistry | Jan 26, 2024 | Formmodel | CodeCode Available | 2 |
| VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models | Jul 12, 2023 | FormLanguage Modelling | CodeCode Available | 2 |
| LEACE: Perfect linear concept erasure in closed form | Jun 6, 2023 | FairnessForm | CodeCode Available | 2 |
| FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation | May 23, 2023 | FormLanguage Modelling | CodeCode Available | 2 |