| An Egocentric Vision-Language Model based Portable Real-time Smart Assistant | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scaling Rich Style-Prompted Text-to-Speech Datasets | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization | Mar 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments | Mar 4, 2025 | 2D Panoptic SegmentationGraph Generation | CodeCode Available | 2 |
| OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFD | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Forgetting Transformer: Softmax Attention with a Forget Gate | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 |