| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 |
| CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Apr 29, 2024 | Data VisualizationDecision Making | CodeCode Available | 1 |
| Mamba-FETrack: Frame-Event Tracking via State Space Model | Apr 28, 2024 | GPUMamba | CodeCode Available | 4 |
| Deep Learning for Low-Latency, Quantum-Ready RF Sensing | Apr 27, 2024 | CPUDeep Learning | —Unverified | 0 |
| Child Speech Recognition in Human-Robot Interaction: Problem Solved? | Apr 26, 2024 | GPUspeech-recognition | —Unverified | 0 |
| Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection | Apr 26, 2024 | Classify murmursGPU | —Unverified | 0 |
| Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services | Apr 25, 2024 | GPU | CodeCode Available | 3 |
| NeRF-XL: Scaling NeRFs with Multiple GPUs | Apr 24, 2024 | GPUNeRF | —Unverified | 0 |
| BASS: Batched Attention-optimized Speculative Sampling | Apr 24, 2024 | GPUHumanEval | —Unverified | 0 |
| CORM: Cache Optimization with Recent Message for Large Language Model Inference | Apr 24, 2024 | GPULanguage Modeling | —Unverified | 0 |