| Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Jul 24, 2024 | ARCInductive Bias | CodeCode Available | 1 |
| TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| dMel: Speech Tokenization made Simple | Jul 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Jul 22, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning | Jul 20, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| ViLLa: Video Reasoning Segmentation with Large Language Model | Jul 18, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Analyzing the Generalization and Reliability of Steering Vectors | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |