| Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies | Oct 24, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Sort-free Gaussian Splatting via Weighted Sum Rendering | Oct 24, 2024 | 3DGS3D Scene Reconstruction | —Unverified | 0 |
| ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Oct 23, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs | Oct 23, 2024 | GPUScheduling | —Unverified | 0 |
| POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inference | Oct 23, 2024 | GPU | CodeCode Available | 0 |
| Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing | Oct 23, 2024 | GPU | —Unverified | 0 |
| CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation | Oct 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs | Oct 22, 2024 | CPUGPU | —Unverified | 0 |
| Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling | Oct 22, 2024 | AllGPU | —Unverified | 0 |
| Semantic-guided Search for Efficient Program Repair with Large Language Models | Oct 22, 2024 | GPUHumanEval | —Unverified | 0 |
| AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost | Oct 22, 2024 | CPUGPU | —Unverified | 0 |
| Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small | Oct 21, 2024 | GPU | —Unverified | 0 |
| Mean-Field Simulation-Based Inference for Cosmological Initial Conditions | Oct 21, 2024 | GPUNavigate | —Unverified | 0 |
| Fully Explicit Dynamic Gaussian Splatting | Oct 21, 2024 | GPUNovel View Synthesis | —Unverified | 0 |
| CompAct: Compressed Activations for Memory-Efficient LLM Training | Oct 20, 2024 | GPU | —Unverified | 0 |
| A Remedy to Compute-in-Memory with Dynamic Random Access Memory: 1FeFET-1C Technology for Neuro-Symbolic AI | Oct 20, 2024 | GPU | —Unverified | 0 |
| SemiHVision: Enhancing Medical Multimodal Models with a Semi-Human Annotated Dataset and Fine-Tuned Instruction Generation | Oct 19, 2024 | DiagnosticGPU | CodeCode Available | 0 |
| Accelerate Coastal Ocean Circulation Model with AI Surrogate | Oct 19, 2024 | CPUDisaster Response | —Unverified | 0 |
| Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language Benchmarks | Oct 18, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows | Oct 18, 2024 | DeblurringGPU | CodeCode Available | 0 |
| AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup | Oct 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization | Oct 18, 2024 | GPUPortrait Animation | —Unverified | 0 |
| Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching | Oct 17, 2024 | GPUQuantization | —Unverified | 0 |
| Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and Undervolting | Oct 17, 2024 | GPU | CodeCode Available | 0 |
| MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes | Oct 17, 2024 | AttributeGPU | —Unverified | 0 |
| FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling | Oct 17, 2024 | Decision MakingDenoising | CodeCode Available | 0 |
| Optimization and Application of Cloud-based Deep Learning Architecture for Multi-Source Data Prediction | Oct 16, 2024 | Deep LearningDistributed Computing | —Unverified | 0 |
| RapidDock: Unlocking Proteome-scale Molecular Docking | Oct 16, 2024 | Drug DiscoveryGPU | —Unverified | 0 |
| Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats | Oct 16, 2024 | GPU | —Unverified | 0 |
| CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment | Oct 16, 2024 | CPUGPU | —Unverified | 0 |
| Learning Representations for Reasoning: Generalizing Across Diverse Structures | Oct 16, 2024 | GPUKnowledge Graphs | —Unverified | 0 |
| LR-SQL: A Supervised Fine-Tuning Method for Text2SQL Tasks under Low-Resource Scenarios | Oct 15, 2024 | GPU | CodeCode Available | 0 |
| Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Oct 14, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 |
| ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera | Oct 14, 2024 | 3D Semantic Scene CompletionDecision Making | —Unverified | 0 |
| Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Oct 14, 2024 | DiversityGPU | CodeCode Available | 0 |
| Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models | Oct 14, 2024 | GPUImage Generation | —Unverified | 0 |
| PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs | Oct 14, 2024 | GPURecommendation Systems | —Unverified | 0 |
| MoIN: Mixture of Introvert Experts to Upcycle an LLM | Oct 13, 2024 | GPULanguage Modeling | —Unverified | 0 |
| VIBES -- Vision Backbone Efficient Selection | Oct 11, 2024 | GPU | —Unverified | 0 |
| ActNAS : Generating Efficient YOLO Models using Activation NAS | Oct 11, 2024 | CPUGPU | —Unverified | 0 |
| Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models | Oct 11, 2024 | CPUGPU | CodeCode Available | 0 |
| Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation | Oct 11, 2024 | GPUImage Segmentation | —Unverified | 0 |
| CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features | Oct 10, 2024 | Cross-Modal RetrievalGPU | —Unverified | 0 |
| HM-DF SNN: Transcending Conventional Online Learning with Advanced Training and Deployment | Oct 10, 2024 | GPU | —Unverified | 0 |
| Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models | Oct 9, 2024 | GPU | —Unverified | 0 |
| TinyClick: Single-Turn Agent for Empowering GUI Automation | Oct 9, 2024 | Data AugmentationGPU | —Unverified | 0 |
| Do better language models have crisper vision? | Oct 9, 2024 | DecoderGPU | —Unverified | 0 |
| QuAILoRA: Quantization-Aware Initialization for LoRA | Oct 9, 2024 | Causal Language ModelingGPU | —Unverified | 0 |
| PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches | Oct 8, 2024 | GPUGSM8K | —Unverified | 0 |
| Automated Quality Control System for Canned Tuna Production using Artificial Vision | Oct 8, 2024 | GPUOptical Character Recognition (OCR) | —Unverified | 0 |