| Diffusion Models and Representation Learning: A Survey | Jun 30, 2024 | DenoisingRepresentation Learning | CodeCode Available | 2 | 5 |
| HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors | Jul 26, 2024 | Depth EstimationGPU | CodeCode Available | 2 | 5 |
| XMainframe: A Large Language Model for Mainframe Modernization | Aug 5, 2024 | Code SummarizationLanguage Modeling | CodeCode Available | 2 | 5 |
| Learning Generative Interactive Environments By Trained Agent Exploration | Sep 10, 2024 | | CodeCode Available | 2 | 5 |
| Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration | Oct 7, 2024 | Image RestorationNavigate | CodeCode Available | 2 | 5 |
| DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models | Nov 22, 2024 | | CodeCode Available | 2 | 5 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 | 5 |
| 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification | Dec 1, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners | Dec 23, 2024 | Mathematical Reasoning | CodeCode Available | 2 | 5 |
| Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models | Apr 3, 2025 | | CodeCode Available | 2 | 5 |
| ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation | Dec 2, 2023 | 3D GenerationObject | CodeCode Available | 2 | 5 |
| MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations | Jul 1, 2024 | Benchmarkingdocument understanding | CodeCode Available | 2 | 5 |
| Saving 77% of the Parameters in Large Language Models Technical Report | Feb 9, 2025 | GPUText Generation | CodeCode Available | 2 | 5 |
| Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds | Feb 22, 2018 | Data AugmentationTranslation | CodeCode Available | 2 | 5 |
| RARE: Retrieval-Augmented Reasoning Modeling | Mar 30, 2025 | HallucinationMemorization | CodeCode Available | 2 | 5 |
| Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | May 2, 2024 | Model CompressionNeural Network Compression | CodeCode Available | 2 | 5 |
| Data Science Education in Undergraduate Physics: Lessons Learned from a Community of Practice | Mar 1, 2024 | | CodeCode Available | 2 | 5 |
| Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding | Jan 1, 2024 | Attribute | CodeCode Available | 2 | 5 |
| Adaptive Multi-Agent Reasoning via Automated Workflow Generation | Jul 18, 2025 | | CodeCode Available | 2 | 5 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Mar 7, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 2 | 5 |
| LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments | Mar 13, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 | 5 |
| Interactive and Explainable Region-guided Radiology Report Generation | Apr 17, 2023 | Medical Report Generation | CodeCode Available | 2 | 5 |
| LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation | Sep 30, 2024 | AttributeCollaborative Filtering | CodeCode Available | 2 | 5 |
| COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act | Oct 10, 2024 | BenchmarkingFairness | CodeCode Available | 2 | 5 |