| OptiMUS: Optimization Modeling Using MIP Solvers and large language models | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fast Calibrated Explanations: Efficient and Uncertainty-Aware Explanations for Machine Learning Models | Oct 28, 2024 | Computational EfficiencyFeature Importance | CodeCode Available | 2 |
| GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control | Dec 15, 2024 | Autonomous Driving | CodeCode Available | 2 |
| ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback | Feb 15, 2024 | Computational chemistryGraph Neural Network | CodeCode Available | 2 |
| InteractVLM: 3D Interaction Reasoning from 2D Foundational Models | Apr 7, 2025 | 3D ReconstructionObject | CodeCode Available | 2 |
| Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models | Jun 13, 2024 | MathQuantization | CodeCode Available | 2 |
| ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings | May 19, 2023 | In-Context LearningQuestion Answering | CodeCode Available | 2 |
| PointAvatar: Deformable Point-based Head Avatars from Videos | Dec 16, 2022 | | CodeCode Available | 2 |
| QuasiSim: Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer | Apr 11, 2024 | | CodeCode Available | 2 |
| Large Scale Radio Frequency Signal Classification | Jul 20, 2022 | DiversityGeneral Classification | CodeCode Available | 2 |
| CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \10,000 Budget; An Extra \4,000 Unlocks 81.8% Accuracy | Jun 27, 2023 | | CodeCode Available | 2 |
| ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object Detection | Aug 15, 2023 | Multispectral Object Detectionobject-detection | CodeCode Available | 2 |
| Temporal Action Segmentation: An Analysis of Modern Techniques | Oct 19, 2022 | Action SegmentationSegmentation | CodeCode Available | 2 |
| HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting | Dec 5, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 2 |
| Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | May 26, 2024 | 3D ReconstructionSimultaneous Localization and Mapping | CodeCode Available | 2 |
| SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions | Jul 16, 2024 | In-Context LearningKnowledge Base Question Answering | CodeCode Available | 2 |
| Recent Advances of Multimodal Continual Learning: A Comprehensive Survey | Oct 7, 2024 | Continual LearningSurvey | CodeCode Available | 2 |
| KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches | Jul 1, 2024 | Book summarizationQuantization | CodeCode Available | 2 |
| Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation | Mar 29, 2022 | Instance SegmentationNeRF | CodeCode Available | 2 |
| Spatial Mental Modeling from Limited Views | Jun 26, 2025 | | CodeCode Available | 2 |
| Quamba: A Post-Training Quantization Recipe for Selective State Space Models | Oct 17, 2024 | Computational EfficiencyMamba | CodeCode Available | 2 |
| SegViT: Semantic Segmentation with Plain Vision Transformers | Oct 12, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Process Reward Models for LLM Agents: Practical Framework and Directions | Feb 14, 2025 | | CodeCode Available | 2 |
| Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation | Oct 13, 2022 | Fairness | CodeCode Available | 2 |
| Slideflow: Deep Learning for Digital Histopathology with Real-Time Whole-Slide Visualization | Apr 9, 2023 | Deep LearningHistopathological Image Classification | CodeCode Available | 2 |
| Dereflection Any Image with Diffusion Priors and Diversified Data | Mar 21, 2025 | DiversityReflection Removal | CodeCode Available | 2 |
| pyABC: Efficient and robust easy-to-use approximate Bayesian computation | Mar 24, 2022 | | CodeCode Available | 2 |
| Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio | Jun 12, 2024 | Clustering | CodeCode Available | 2 |
| Generating Images with Multimodal Language Models | May 26, 2023 | DecoderImage Generation | CodeCode Available | 2 |
| Sparse Autoencoders for Hypothesis Generation | Feb 5, 2025 | | CodeCode Available | 2 |
| Open-World Semantic Segmentation Including Class Similarity | Mar 12, 2024 | Anomaly SegmentationAutonomous Vehicles | CodeCode Available | 2 |
| Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery | Mar 14, 2025 | Management | CodeCode Available | 2 |
| Tamil-Llama: A New Tamil Language Model Based on Llama 2 | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PnP-Flow: Plug-and-Play Image Restoration with Flow Matching | Oct 3, 2024 | DeblurringDenoising | CodeCode Available | 2 |
| VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis | Oct 10, 2024 | Medical Image AnalysisQuestion Answering | CodeCode Available | 2 |
| MotifBench: A standardized protein design benchmark for motif-scaffolding problems | Feb 18, 2025 | Protein DesignProtein Structure Prediction | CodeCode Available | 2 |
| Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor Environments | Nov 30, 2024 | Autonomous NavigationGPU | CodeCode Available | 2 |
| Generative Artificial Intelligence for Navigating Synthesizable Chemical Space | Oct 4, 2024 | Drug DiscoveryNavigate | CodeCode Available | 2 |
| Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos | Dec 5, 2024 | Robot Manipulation | CodeCode Available | 2 |
| Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations | Apr 26, 2024 | Imitation Learning | CodeCode Available | 2 |
| Powershap: A Power-full Shapley Feature Selection Method | Jun 16, 2022 | feature selection | CodeCode Available | 2 |
| Skinned Motion Retargeting with Residual Perception of Motion Semantics & Geometry | Mar 15, 2023 | motion retargeting | CodeCode Available | 2 |
| Effective Data Augmentation With Diffusion Models | Feb 7, 2023 | Data AugmentationDiversity | CodeCode Available | 2 |
| Ref-GS: Directional Factorization for 2D Gaussian Splatting | Dec 1, 2024 | | CodeCode Available | 2 |
| DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation | Jul 13, 2024 | DenoisingImage Segmentation | CodeCode Available | 2 |
| HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition | Jan 11, 2024 | Contrastive LearningDynamic Facial Expression Recognition | CodeCode Available | 2 |
| DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Nov 18, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| GenoTEX: An LLM Agent Benchmark for Automated Gene Expression Data Analysis | Jun 21, 2024 | AI AgentAutoML | CodeCode Available | 2 |
| V-Max: A Reinforcement Learning Framework for Autonomous Driving | Mar 11, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Do As I Can, Not As I Say: Grounding Language in Robotic Affordances | Apr 4, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 2 |