| VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models | Oct 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FVEval: Understanding Language Model Capabilities in Formal Verification of Digital Hardware | Oct 15, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited Responses | Oct 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| TopoLM: brain-like spatio-functional organization in a topographic language model | Oct 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning | Oct 11, 2024 | Data PoisoningLanguage Modeling | CodeCode Available | 1 |
| Retraining-Free Merging of Sparse MoE via Hierarchical Clustering | Oct 11, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Zeroth-Order Fine-Tuning of LLMs in Random Subspaces | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |