| Coercing LLMs to do and reveal (almost) anything | Feb 21, 2024 | | CodeCode Available | 2 |
| Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models | Jan 1, 2024 | Survey | CodeCode Available | 2 |
| Learning-Rate-Free Learning by D-Adaptation | Jan 18, 2023 | | CodeCode Available | 2 |
| Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning | Dec 17, 2024 | Denoising | CodeCode Available | 2 |
| X-Pose: Detecting Any Keypoints | Oct 12, 2023 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 2 |
| A Survey on Detection of LLMs-Generated Content | Oct 24, 2023 | Survey | CodeCode Available | 2 |
| Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting | Jul 21, 2023 | ImputationProbabilistic Time Series Forecasting | CodeCode Available | 2 |
| COLMAP-Free 3D Gaussian Splatting | Dec 12, 2023 | 3DGSCamera Pose Estimation | CodeCode Available | 2 |
| Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement | Mar 3, 2023 | 3D ReconstructionNeRF | CodeCode Available | 2 |
| Super Monotonic Alignment Search | Sep 12, 2024 | CPUGPU | CodeCode Available | 2 |
| RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search | May 21, 2024 | Quantization | CodeCode Available | 2 |
| M^3CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought | May 26, 2024 | | CodeCode Available | 2 |
| Hello Again! LLM-powered Personalized Agent for Long-term Dialogue | Jun 9, 2024 | Response GenerationRetrieval | CodeCode Available | 2 |
| Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment | Jan 1, 2024 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 2 |
| Diffusion Models and Representation Learning: A Survey | Jun 30, 2024 | DenoisingRepresentation Learning | CodeCode Available | 2 |
| HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors | Jul 26, 2024 | Depth EstimationGPU | CodeCode Available | 2 |
| XMainframe: A Large Language Model for Mainframe Modernization | Aug 5, 2024 | Code SummarizationLanguage Modeling | CodeCode Available | 2 |
| Learning Generative Interactive Environments By Trained Agent Exploration | Sep 10, 2024 | | CodeCode Available | 2 |
| Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration | Oct 7, 2024 | Image RestorationNavigate | CodeCode Available | 2 |
| DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models | Nov 22, 2024 | | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification | Dec 1, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners | Dec 23, 2024 | Mathematical Reasoning | CodeCode Available | 2 |
| Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models | Apr 3, 2025 | | CodeCode Available | 2 |
| ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation | Dec 2, 2023 | 3D GenerationObject | CodeCode Available | 2 |
| MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations | Jul 1, 2024 | Benchmarkingdocument understanding | CodeCode Available | 2 |
| Saving 77% of the Parameters in Large Language Models Technical Report | Feb 9, 2025 | GPUText Generation | CodeCode Available | 2 |
| Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds | Feb 22, 2018 | Data AugmentationTranslation | CodeCode Available | 2 |
| RARE: Retrieval-Augmented Reasoning Modeling | Mar 30, 2025 | HallucinationMemorization | CodeCode Available | 2 |
| Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | May 2, 2024 | Model CompressionNeural Network Compression | CodeCode Available | 2 |
| Data Science Education in Undergraduate Physics: Lessons Learned from a Community of Practice | Mar 1, 2024 | | CodeCode Available | 2 |
| Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding | Jan 1, 2024 | Attribute | CodeCode Available | 2 |
| Adaptive Multi-Agent Reasoning via Automated Workflow Generation | Jul 18, 2025 | | CodeCode Available | 2 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 |
| CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Mar 7, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 2 |
| LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments | Mar 13, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| LightGNN: Simple Graph Neural Network for Recommendation | Jan 6, 2025 | Computational EfficiencyGraph Neural Network | CodeCode Available | 2 |
| Interactive and Explainable Region-guided Radiology Report Generation | Apr 17, 2023 | Medical Report Generation | CodeCode Available | 2 |
| LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation | Sep 30, 2024 | AttributeCollaborative Filtering | CodeCode Available | 2 |
| COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act | Oct 10, 2024 | BenchmarkingFairness | CodeCode Available | 2 |
| Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models | Jul 17, 2024 | BenchmarkingRed Teaming | CodeCode Available | 2 |
| Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 | Mar 31, 2025 | Logical ReasoningMultiple-choice | CodeCode Available | 2 |
| Scattertext: a Browser-Based Tool for Visualizing how Corpora Differ | Jul 1, 2017 | | CodeCode Available | 2 |
| ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Nov 11, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats | May 29, 2024 | De-identificationFairness | CodeCode Available | 2 |
| NeRF-RPN: A general framework for object detection in NeRFs | Nov 21, 2022 | NeRFobject-detection | CodeCode Available | 2 |
| HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | Oct 23, 2023 | DiagnosticHallucination | CodeCode Available | 2 |
| Automatic Differentiation-based Full Waveform Inversion with Flexible Workflows | Nov 30, 2024 | Dynamic Time Warping | CodeCode Available | 2 |
| AirMorph: Topology-Preserving Deep Learning for Pulmonary Airway Analysis | Dec 15, 2024 | AnatomyDeep Learning | CodeCode Available | 2 |
| Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey | Feb 14, 2024 | Survey | CodeCode Available | 2 |