| WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher Learning | Jan 15, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification | Jan 11, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Merging Feed-Forward Sublayers for Compressed Transformers | Jan 10, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Jan 6, 2025 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model | Jan 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding | Jan 3, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding | Jan 1, 2025 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |