| The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation | Apr 7, 2025 | Inference OptimizationReferring Video Object Segmentation | CodeCode Available | 5 | 5 |
| Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints | Apr 15, 2025 | GPUInference Optimization | CodeCode Available | 4 | 5 |
| SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL | Apr 15, 2025 | Inference Optimization | CodeCode Available | 3 | 5 |
| A Survey on Inference Optimization Techniques for Mixture of Experts Models | Dec 18, 2024 | Computational EfficiencyDistributed Computing | CodeCode Available | 3 | 5 |
| Inference Performance Optimization for Large Language Models on CPUs | Jul 10, 2024 | CPUGPU | CodeCode Available | 3 | 5 |
| CycleBNN: Cyclic Precision Training in Binary Neural Networks | Sep 28, 2024 | Inference Optimization | CodeCode Available | 2 | 5 |
| Painterly Image Harmonization using Diffusion Model | Aug 4, 2023 | Generative Adversarial NetworkImage Harmonization | CodeCode Available | 1 | 5 |
| A Novel 1D State Space for Efficient Music Rhythmic Analysis | Nov 1, 2021 | Inference OptimizationOnline Beat Tracking | CodeCode Available | 1 | 5 |
| Adaptive Deep Neural Network Inference Optimization with EENet | Jan 15, 2023 | Inference OptimizationScheduling | CodeCode Available | 1 | 5 |
| ADJUST: A Dictionary-Based Joint Reconstruction and Unmixing Method for Spectral Tomography | Dec 21, 2021 | 3D ReconstructionComputed Tomography (CT) | CodeCode Available | 1 | 5 |
| Easy and Efficient Transformer : Scalable Inference Solution For large NLP model | Apr 26, 2021 | DecoderGPU | CodeCode Available | 1 | 5 |
| A General Method for Amortizing Variational Filtering | Nov 13, 2018 | Inference OptimizationVariational Inference | CodeCode Available | 0 | 5 |
| Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging | Jun 29, 2025 | Inference OptimizationMixture-of-Experts | CodeCode Available | 0 | 5 |
| Representing Edge Flows on Graphs via Sparse Cell Complexes | Sep 4, 2023 | Inference OptimizationRepresentation Learning | CodeCode Available | 0 | 5 |
| Brevity is the soul of sustainability: Characterizing LLM response lengths | Jun 10, 2025 | DecoderInference Optimization | CodeCode Available | 0 | 5 |
| A Temporal Linear Network for Time Series Forecasting | Oct 28, 2024 | Computational EfficiencyInference Optimization | CodeCode Available | 0 | 5 |
| Input Convex Neural Networks | Sep 22, 2016 | ImputationInference Optimization | CodeCode Available | 0 | 5 |
| Iterative Amortized Inference | Jul 24, 2018 | Inference OptimizationVariational Inference | CodeCode Available | 0 | 5 |
| Enhanced graph-learning schemes driven by similar distributions of motifs | Jul 11, 2022 | Graph LearningInference Optimization | CodeCode Available | 0 | 5 |
| LLaSA: Large Language and E-Commerce Shopping Assistant | Aug 4, 2024 | Inference OptimizationSpecificity | CodeCode Available | 0 | 5 |
| LLM-Rank: A Graph Theoretical Approach to Pruning Large Language Models | Oct 17, 2024 | Inference OptimizationNetwork Pruning | CodeCode Available | 0 | 5 |
| Patched MOA: optimizing inference for diverse software development tasks | Jul 26, 2024 | Inference Optimization | CodeCode Available | 0 | 5 |
| CRVI: Convex Relaxation for Variational Inference | Jul 1, 2018 | Inference Optimizationregression | —Unverified | 0 | 0 |
| Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems | Jun 21, 2023 | Inference Optimization | —Unverified | 0 | 0 |
| Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification | Feb 23, 2025 | ClassificationInference Optimization | —Unverified | 0 | 0 |