| Act Now: A Novel Online Forecasting Framework for Large-Scale Streaming Data | Nov 28, 2024 | GPU | CodeCode Available | 1 |
| Global Tensor Motion Planning | Nov 28, 2024 | Dataset GenerationDiversity | CodeCode Available | 1 |
| ADAF: An Artificial Intelligence Data Assimilation Framework for Weather Forecasting | Nov 25, 2024 | GPUWeather Forecasting | CodeCode Available | 1 |
| Quantization without Tears | Nov 21, 2024 | GPUQuantization | CodeCode Available | 1 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask Generation | Nov 11, 2024 | GPU | CodeCode Available | 1 |
| Diffusion Sampling Correction via Approximately 10 Parameters | Nov 10, 2024 | GPU | CodeCode Available | 1 |
| HRDecoder: High-Resolution Decoder Network for Fundus Image Lesion Segmentation | Nov 6, 2024 | DecoderGPU | CodeCode Available | 1 |
| LiVOS: Light Video Object Segmentation with Gated Linear Matching | Nov 5, 2024 | GPUSemantic Segmentation | CodeCode Available | 1 |
| Fast and Memory-Efficient Video Diffusion Using Streamlined Inference | Nov 2, 2024 | GPUVideo Generation | CodeCode Available | 1 |
| KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Oct 28, 2024 | GPUKnowledge Distillation | CodeCode Available | 1 |
| LOGO -- Long cOntext aliGnment via efficient preference Optimization | Oct 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing | Oct 24, 2024 | GPU | CodeCode Available | 1 |
| syren-new: Precise formulae for the linear and nonlinear matter power spectra with massive neutrinos and dynamical dark energy | Oct 18, 2024 | CPUGPU | CodeCode Available | 1 |
| xPerT: Extended Persistence Transformer | Oct 18, 2024 | GPU | CodeCode Available | 1 |
| EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything | Oct 17, 2024 | DiagnosticGPU | CodeCode Available | 1 |
| SPA: 3D Spatial-Awareness Enables Effective Embodied Representation | Oct 10, 2024 | GPUNeural Rendering | CodeCode Available | 1 |
| Neural Reasoning Networks: Efficient Interpretable Neural Networks With Automatic Textual Explanations | Oct 10, 2024 | FairnessFeature Importance | CodeCode Available | 1 |
| PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing | Oct 7, 2024 | GPU | CodeCode Available | 1 |
| Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective | Oct 6, 2024 | CPUGPU | CodeCode Available | 1 |
| LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services | Oct 3, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices | Oct 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| TorchSISSO: A PyTorch-Based Implementation of the Sure Independence Screening and Sparsifying Operator for Efficient and Interpretable Model Discovery | Oct 2, 2024 | GPUModel Discovery | CodeCode Available | 1 |
| STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting | Oct 1, 2024 | GPU | CodeCode Available | 1 |
| Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language Models | Sep 28, 2024 | GPU | CodeCode Available | 1 |