| Distilling Tiny and Ultra-fast Deep Neural Networks for Autonomous Navigation on Nano-UAVs | Jul 17, 2024 | Autonomous NavigationCollision Avoidance | CodeCode Available | 4 |
| Halu-J: Critique-Based Hallucination Judge | Jul 17, 2024 | Evidence SelectionHallucination | CodeCode Available | 4 |
| Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference | Jul 16, 2024 | | CodeCode Available | 4 |
| When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments | Jul 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Deep-TEMPEST: Using Deep Learning to Eavesdrop on HDMI from its Unintended Electromagnetic Emanations | Jul 12, 2024 | | CodeCode Available | 4 |
| MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine | Jul 11, 2024 | Contrastive LearningLanguage Modelling | CodeCode Available | 4 |
| SEED-Story: Multimodal Long Story Generation with Large Language Model | Jul 11, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 4 |
| A Survey on Deep Stereo Matching in the Twenties | Jul 10, 2024 | Stereo MatchingSurvey | CodeCode Available | 4 |
| A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends | Jul 10, 2024 | Data Poisoning | CodeCode Available | 4 |
| OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training | Jul 10, 2024 | | CodeCode Available | 4 |
| The GeometricKernels Package: Heat and Matérn Kernels for Geometric Learning on Manifolds, Meshes, and Graphs | Jul 10, 2024 | Gaussian ProcessesUncertainty Quantification | CodeCode Available | 4 |
| Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence | Jul 9, 2024 | Retrieval-augmented Generation | CodeCode Available | 4 |
| Wavelet Convolutions for Large Receptive Fields | Jul 8, 2024 | 2D Object Detection2D Semantic Segmentation | CodeCode Available | 4 |
| MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Jul 8, 2024 | Video AlignmentVideo Generation | CodeCode Available | 4 |
| ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation | Jul 8, 2024 | multimodal generationText Generation | CodeCode Available | 4 |
| MUSE: Machine Unlearning Six-Way Evaluation for Language Models | Jul 8, 2024 | ArticlesMachine Unlearning | CodeCode Available | 4 |
| TALENT: A Tabular Analytics and Learning Toolbox | Jul 4, 2024 | | CodeCode Available | 4 |
| Modern Neighborhood Components Analysis: A Deep Tabular Baseline Two Decades Later | Jul 3, 2024 | | CodeCode Available | 4 |
| MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis | Jul 2, 2024 | AttributeImage Generation | CodeCode Available | 4 |
| Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models | Jul 2, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 4 |
| Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-Drones | Jul 2, 2024 | Autonomous Navigation | CodeCode Available | 4 |
| Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies | Jul 1, 2024 | image-classificationImage Classification | CodeCode Available | 4 |
| fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Jul 1, 2024 | GPUPoint cloud reconstruction | CodeCode Available | 4 |
| FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds | Jul 1, 2024 | Audio GenerationVideo Alignment | CodeCode Available | 4 |
| A Closer Look at Deep Learning Methods on Tabular Datasets | Jul 1, 2024 | AttributeDeep Learning | CodeCode Available | 4 |