| DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization | May 18, 2025 | Mathematical Reasoning | CodeCode Available | 2 | 5 |
| Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems | Feb 24, 2025 | Computational EfficiencyPDE Surrogate Modeling | CodeCode Available | 2 | 5 |
| TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Aug 25, 2024 | Autonomous DrivingDenoising | CodeCode Available | 2 | 5 |
| Mamba Meets Financial Markets: A Graph-Mamba Approach for Stock Price Prediction | Sep 26, 2024 | MambaPrediction | CodeCode Available | 2 | 5 |
| Audio-Synchronized Visual Animation | Mar 8, 2024 | | CodeCode Available | 2 | 5 |
| InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning | Mar 8, 2023 | Semantic Segmentation | CodeCode Available | 2 | 5 |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Jan 29, 2024 | CPUGPU | CodeCode Available | 2 | 5 |
| LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding | Jun 29, 2023 | 16kImage Captioning | CodeCode Available | 2 | 5 |
| Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens | Nov 23, 2024 | Hallucination | CodeCode Available | 2 | 5 |
| MaskBit: Embedding-free Image Generation via Bit Tokens | Sep 24, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 2 | 5 |
| True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning | Jan 25, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Emulating Self-attention with Convolution for Efficient Image Super-Resolution | Mar 9, 2025 | Computational EfficiencyImage Super-Resolution | CodeCode Available | 2 | 5 |
| GuardReasoner: Towards Reasoning-based LLM Safeguards | Jan 30, 2025 | | CodeCode Available | 2 | 5 |
| RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction | Mar 8, 2024 | Audio GenerationComputational Efficiency | CodeCode Available | 2 | 5 |
| PPSURF: Combining Patches and Point Convolutions for Detailed Surface Reconstruction | Jan 16, 2024 | Surface Reconstruction | CodeCode Available | 2 | 5 |
| Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse | Sep 17, 2024 | In-Context LearningRAG | CodeCode Available | 2 | 5 |
| Matryoshka Query Transformer for Large Vision-Language Models | May 29, 2024 | Language ModellingRepresentation Learning | CodeCode Available | 2 | 5 |
| Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery | Apr 14, 2024 | Change DetectionEdge Detection | CodeCode Available | 2 | 5 |
| DiffusionInst: Diffusion Model for Instance Segmentation | Dec 6, 2022 | DenoisingInstance Segmentation | CodeCode Available | 2 | 5 |
| Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues | Apr 12, 2024 | Data AugmentationFace Anti-Spoofing | CodeCode Available | 2 | 5 |
| Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation | Mar 17, 2023 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| Non-stationary Transformers: Exploring the Stationarity in Time Series Forecasting | May 28, 2022 | Time SeriesTime Series Analysis | CodeCode Available | 2 | 5 |
| In-Context Language Learning: Architectures and Algorithms | Jan 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement | Aug 2, 2024 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 | 5 |
| Fin-GAN: forecasting and classifying financial time series via generative adversarial networks | Jan 31, 2024 | Generative Adversarial NetworkProbabilistic Time Series Forecasting | CodeCode Available | 2 | 5 |
| INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models | Jun 7, 2023 | | CodeCode Available | 2 | 5 |
| SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Apr 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 | 5 |
| Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator | Dec 20, 2023 | Data Augmentationobject-detection | CodeCode Available | 2 | 5 |
| Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering | Nov 25, 2024 | Question AnsweringVisual Question Answering | CodeCode Available | 2 | 5 |
| Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction | Jul 30, 2021 | Click-Through Rate Prediction | CodeCode Available | 2 | 5 |
| Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data | Jun 6, 2024 | DenoisingLanguage Modeling | CodeCode Available | 2 | 5 |
| KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application | May 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| When Attention Sink Emerges in Language Models: An Empirical View | Oct 14, 2024 | Quantization | CodeCode Available | 2 | 5 |
| SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding | Aug 28, 2024 | Instruction Followingscientific discovery | CodeCode Available | 2 | 5 |
| CFAT: Unleashing Triangular Windows for Image Super-resolution | Jan 1, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Towards Fast, Accurate and Stable 3D Dense Face Alignment | Sep 21, 2020 | 3D Face Modelling3D Face Reconstruction | CodeCode Available | 2 | 5 |
| Diffusion Models for Adversarial Purification | May 16, 2022 | Adversarial Purification | CodeCode Available | 2 | 5 |
| Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations | Feb 27, 2024 | Recommendation Systems | CodeCode Available | 2 | 5 |
| RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions | Dec 31, 2024 | DiversityRAG | CodeCode Available | 2 | 5 |
| Samba: A Unified Mamba-based Framework for General Salient Object Detection | Jan 1, 2025 | Mambaobject-detection | CodeCode Available | 2 | 5 |
| Centralized Feature Pyramid for Object Detection | Oct 5, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models | Oct 17, 2022 | DiversityText Generation | CodeCode Available | 2 | 5 |
| An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock Forecasting | Mar 25, 2024 | PositionTime Series | CodeCode Available | 2 | 5 |
| PartCraft: Crafting Creative Objects by Parts | Jul 5, 2024 | | CodeCode Available | 2 | 5 |
| Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters | Aug 7, 2024 | GPU | CodeCode Available | 2 | 5 |
| A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images | Apr 25, 2021 | DecoderSegmentation | CodeCode Available | 2 | 5 |
| Large Language Models Are Zero-Shot Time Series Forecasters | Oct 11, 2023 | ImputationTime Series | CodeCode Available | 2 | 5 |
| TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Nov 26, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices | Feb 5, 2025 | DenoisingModel Optimization | CodeCode Available | 2 | 5 |