| SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models | May 23, 2024 | Natural Language UnderstandingQuantization | CodeCode Available | 2 |
| An L-BFGS-B approach for linear and nonlinear system identification under _1 and group-Lasso regularization | Mar 6, 2024 | State Space Modelssubspace methods | CodeCode Available | 2 |
| Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive Survey | May 1, 2024 | Quantization | CodeCode Available | 2 |
| Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion | Mar 18, 2022 | 3D Object DetectionData Augmentation | CodeCode Available | 2 |
| Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems | Feb 16, 2025 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators | Jan 31, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities | Oct 18, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 2 |
| MegaScenes: Scene-Level View Synthesis at Scale | Jun 17, 2024 | Novel View Synthesis | CodeCode Available | 2 |
| PointDreamer: Zero-shot 3D Textured Mesh Reconstruction from Colored Point Cloud | Jun 22, 2024 | Image Inpainting | CodeCode Available | 2 |
| Unicorn: Text-Only Data Synthesis for Vision Language Model Training | Mar 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tactile-Augmented Radiance Fields | May 7, 2024 | | CodeCode Available | 2 |
| DeGCN: Deformable Graph Convolutional Networks for Skeleton-Based Action Recognition | Mar 25, 2024 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 2 |
| Polis: Scaling Deliberation by Mapping High Dimensional Opinion Spaces | Jul 22, 2021 | Data VisualizationDecision Making | CodeCode Available | 2 |
| Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward | Apr 1, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding | Nov 28, 2022 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Dense Text Retrieval based on Pretrained Language Models: A Survey | Nov 27, 2022 | RetrievalSurvey | CodeCode Available | 2 |
| DynIBaR: Neural Dynamic Image-Based Rendering | Nov 20, 2022 | Dynamic Reconstruction | CodeCode Available | 2 |
| V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer | Jan 9, 2025 | | CodeCode Available | 2 |
| Iterative Geometry Encoding Volume for Stereo Matching | Mar 12, 2023 | Omnnidirectional Stereo Depth EstimationStereo Matching | CodeCode Available | 2 |
| FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition | May 22, 2024 | Image Generation | CodeCode Available | 2 |
| Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline | Jan 29, 2023 | Data AugmentationLightweight Deployment | CodeCode Available | 2 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 |
| Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jul 3, 2024 | 3DGS3D Reconstruction | CodeCode Available | 2 |
| From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation | Apr 23, 2024 | Image Generation | CodeCode Available | 2 |
| E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL | Sep 25, 2024 | Natural Language QueriesText to SQL | CodeCode Available | 2 |
| CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models | Mar 28, 2025 | GPUGSM8K | CodeCode Available | 2 |
| Diffusion Predictive Control with Constraints | Dec 12, 2024 | Denoising | CodeCode Available | 2 |
| HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation | Nov 30, 2020 | 3D human pose and shape estimation3D Human Pose Estimation | CodeCode Available | 2 |
| OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows | Dec 2, 2024 | Audio SynthesisImage Generation | CodeCode Available | 2 |
| DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection | Dec 11, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 |
| AI-Generated Video Detection via Spatio-Temporal Anomaly Learning | Mar 25, 2024 | Optical Flow Estimation | CodeCode Available | 2 |
| DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? | Sep 12, 2024 | | CodeCode Available | 2 |
| BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis | Feb 28, 2023 | Novel View Synthesis | CodeCode Available | 2 |
| Hyperbolic Vision Transformers: Combining Improvements in Metric Learning | Mar 21, 2022 | Metric Learning | CodeCode Available | 2 |
| Sequential Multivariate Change Detection with Calibrated and Memoryless False Detection Rates | Aug 2, 2021 | Change Detection | CodeCode Available | 2 |
| TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks | Sep 9, 2024 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| Efficient Long-Range Attention Network for Image Super-resolution | Mar 13, 2022 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey | Aug 18, 2023 | DeblurringImage Restoration | CodeCode Available | 2 |
| Honegumi: An Interface for Accelerating the Adoption of Bayesian Optimization in the Experimental Sciences | Feb 4, 2025 | Bayesian OptimizationExperimental Design | CodeCode Available | 2 |
| PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization | Dec 18, 2019 | Abstractive Text SummarizationDecoder | CodeCode Available | 2 |
| Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era | Mar 13, 2024 | | CodeCode Available | 2 |
| TransNeXt: Robust Foveal Visual Perception for Vision Transformers | Nov 28, 2023 | ClassificationDomain Generalization | CodeCode Available | 2 |
| UniDrive: Towards Universal Driving Perception Across Camera Configurations | Oct 17, 2024 | Autonomous Driving | CodeCode Available | 2 |
| PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization | Oct 25, 2023 | Navigate | CodeCode Available | 2 |
| Continual Learning on Graphs: Challenges, Solutions, and Opportunities | Feb 18, 2024 | Continual LearningGraph Learning | CodeCode Available | 2 |
| Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework | Mar 22, 2022 | Object TrackingRelation | CodeCode Available | 2 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 |
| DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design | Feb 26, 2024 | AvgDrug Design | CodeCode Available | 2 |
| NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality | May 9, 2022 | SentenceSpeech Synthesis | CodeCode Available | 2 |
| General Detection-based Text Line Recognition | Sep 25, 2024 | HTROptical Character Recognition (OCR) | CodeCode Available | 2 |