| PGN: The RNN's New Successor is Effective for Long-Range Time Series Forecasting | Sep 26, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Sep 26, 2024 | Image GenerationObject | CodeCode Available | 2 |
| E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding | Sep 26, 2024 | Question AnsweringVideo Understanding | CodeCode Available | 2 |
| A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications | Sep 26, 2024 | EEGSelf-Supervised Learning | CodeCode Available | 2 |
| Control Industrial Automation System with Large Language Model Agents | Sep 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Mamba Meets Financial Markets: A Graph-Mamba Approach for Stock Price Prediction | Sep 26, 2024 | MambaPrediction | CodeCode Available | 2 |
| Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Sep 26, 2024 | Image SegmentationNavigate | CodeCode Available | 2 |
| Neural Light Spheres for Implicit Image Stitching and View Synthesis | Sep 26, 2024 | Image Stitching | CodeCode Available | 2 |
| Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection | Sep 26, 2024 | Event DetectionRepresentation Learning | CodeCode Available | 2 |
| Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction | Sep 26, 2024 | 4D reconstructionObject | CodeCode Available | 2 |
| From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with Reflection | Sep 26, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| Event-based Stereo Depth Estimation: A Survey | Sep 26, 2024 | Depth EstimationNavigate | CodeCode Available | 2 |
| MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models | Sep 26, 2024 | Large Language ModelModel Compression | CodeCode Available | 2 |
| SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion | Sep 26, 2024 | DescriptiveGeneralized Referring Expression Comprehension | CodeCode Available | 2 |
| FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Sep 26, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation | Sep 26, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| General Detection-based Text Line Recognition | Sep 25, 2024 | HTROptical Character Recognition (OCR) | CodeCode Available | 2 |
| INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Sep 25, 2024 | GPUQuantization | CodeCode Available | 2 |
| Empirical Asset Pricing with Large Language Model Agents | Sep 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Attention Prompting on Image for Large Vision-Language Models | Sep 25, 2024 | MM-VetVisual Prompting | CodeCode Available | 2 |
| Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction | Sep 25, 2024 | GPUToken Reduction | CodeCode Available | 2 |
| E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL | Sep 25, 2024 | Natural Language QueriesText to SQL | CodeCode Available | 2 |
| Source-Free Domain Adaptation for YOLO Object Detection | Sep 25, 2024 | Domain AdaptationModel Selection | CodeCode Available | 2 |
| Game4Loc: A UAV Geo-Localization Benchmark from Game Data | Sep 25, 2024 | Drone-view target localizationgeo-localization | CodeCode Available | 2 |
| ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis | Sep 25, 2024 | ECG DigitizationTime Series | CodeCode Available | 2 |