| Libra: Building Decoupled Vision System on Large Language Models | May 16, 2024 | Image to textLanguage Modeling | CodeCode Available | 2 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 |
| Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model | May 15, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| PLeak: Prompt Leaking Attacks against Large Language Model Applications | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| State-Free Inference of State-Space Models: The Transfer Function Approach | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Memory Mosaics | May 10, 2024 | DisentanglementIn-Context Learning | CodeCode Available | 2 |
| HMT: Hierarchical Memory Transformer for Long Context Language Processing | May 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio | May 8, 2024 | Audio Deepfake DetectionAudio Generation | CodeCode Available | 2 |
| SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change Detector | May 8, 2024 | Change DetectionLanguage Modeling | CodeCode Available | 2 |
| AntiFold: Improved antibody structure-based design using inverse folding | May 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom | May 6, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 2 |
| A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model | May 3, 2024 | Decision MakingFew-Shot Learning | CodeCode Available | 2 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Paint by Inpaint: Learning to Add Image Objects by Removing Them First | Apr 28, 2024 | Image InpaintingLanguage Modeling | CodeCode Available | 2 |
| PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games | Apr 26, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| REBEL: Reinforcement Learning via Regressing Relative Rewards | Apr 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Apr 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| σ-GPTs: A New Approach to Autoregressive Models | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Compression Represents Intelligence Linearly | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning | Apr 14, 2024 | Dense Video CaptioningDescriptive | CodeCode Available | 2 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |
| HGRN2: Gated Linear RNNs with State Expansion | Apr 11, 2024 | Image ClassificationLanguage Modeling | CodeCode Available | 2 |
| Behavior Trees Enable Structured Programming of Language Model Agents | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |