| GASP: Gaussian Splatting for Physic-Based Simulations | Sep 9, 2024 | | CodeCode Available | 2 |
| GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization | Nov 4, 2022 | Depression DetectionDomain Generalization | CodeCode Available | 2 |
| ActiveGS: Active Scene Reconstruction Using Gaussian Splatting | Dec 23, 2024 | | CodeCode Available | 2 |
| RL-ADN: A High-Performance Deep Reinforcement Learning Environment for Optimal Energy Storage Systems Dispatch in Active Distribution Networks | Aug 7, 2024 | Computational EfficiencyData Augmentation | CodeCode Available | 2 |
| LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch | Oct 17, 2024 | Code GenerationCombinatorial Optimization | CodeCode Available | 2 |
| Actions Speak Louder Than Goals: Valuing Player Actions in Soccer | Feb 18, 2018 | Football Action Valuation | CodeCode Available | 2 |
| Spatial-Temporal Identity: A Simple yet Effective Baseline for Multivariate Time Series Forecasting | Aug 10, 2022 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| Bridging the Divide: Reconsidering Softmax and Linear Attention | Dec 9, 2024 | | CodeCode Available | 2 |
| HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing | Nov 30, 2021 | | CodeCode Available | 2 |
| Efficient and Systematic Partitioning of Large and Deep Neural Networks for Parallelization | Aug 25, 2021 | | CodeCode Available | 2 |
| Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation | Aug 5, 2024 | RhythmSelf-Supervised Learning | CodeCode Available | 2 |
| SpectralGPT: Spectral Remote Sensing Foundation Model | Nov 13, 2023 | Change Detectionmodel | CodeCode Available | 2 |
| A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning | Aug 5, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Discrete Event, Continuous Time RNNs | Oct 11, 2017 | Inductive BiasRetrieval | CodeCode Available | 2 |
| Implicit Neural Representation for Cooperative Low-light Image Enhancement | Mar 21, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 2 |
| NICE-SLAM: Neural Implicit Scalable Encoding for SLAM | Dec 22, 2021 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy Prediction | Mar 3, 2024 | 3D Semantic Occupancy PredictionAutonomous Driving | CodeCode Available | 2 |
| LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action | Jul 10, 2022 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Detecting Multimedia Generated by Large AI Models: A Survey | Jan 22, 2024 | Survey | CodeCode Available | 2 |
| GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation | Apr 9, 2024 | Go to AnyThingNavigate | CodeCode Available | 2 |
| A Controlled Study on Long Context Extension and Generalization in LLMs | Sep 18, 2024 | In-Context Learning | CodeCode Available | 2 |
| Involution: Inverting the Inherence of Convolution for Visual Recognition | Mar 10, 2021 | Image Classification | CodeCode Available | 2 |
| GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers | Feb 29, 2024 | GSM8KMath | CodeCode Available | 2 |
| Towards Learning a Generalist Model for Embodied Navigation | Dec 4, 2023 | 3D Question Answering (3D-QA)Embodied Question Answering | CodeCode Available | 2 |
| FlowDB a large scale precipitation, river, and flash flood dataset | Dec 21, 2020 | Multivariate Time Series Forecasting | CodeCode Available | 2 |
| Wavelet Diffusion Models are fast and scalable Image Generators | Nov 29, 2022 | BlockingImage Generation | CodeCode Available | 2 |
| Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach | Dec 16, 2024 | Representation LearningRetrieval | CodeCode Available | 2 |
| Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram | Oct 25, 2019 | Generative Adversarial NetworkGPU | CodeCode Available | 2 |
| Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning | Jul 29, 2024 | Chart Question AnsweringQuestion Answering | CodeCode Available | 2 |
| $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources | Oct 30, 2024 | GPU | CodeCode Available | 2 |
| SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding | Feb 15, 2025 | Question AnsweringStreaming video understanding | CodeCode Available | 2 |
| Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding | Nov 4, 2020 | Multi-Task LearningScene Understanding | CodeCode Available | 2 |
| Hardware-Efficient Attention for Fast Decoding | May 27, 2025 | | CodeCode Available | 2 |
| DeepGCNs: Making GCNs Go as Deep as CNNs | Oct 15, 2019 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 2 |
| LOVA3: Learning to Visual Question Answering, Asking and Assessment | May 23, 2024 | Question AnsweringVisual Question Answering | CodeCode Available | 2 |
| Video Diffusion Models are Training-free Motion Interpreter and Controller | May 23, 2024 | Video Generation | CodeCode Available | 2 |
| ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation | Oct 29, 2024 | Drug Discovery | CodeCode Available | 2 |
| EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks | Jan 31, 2019 | Data AugmentationGeneral Classification | CodeCode Available | 2 |
| Computer Vision for Road Imaging and Pothole Detection: A State-of-the-Art Review of Systems and Algorithms | Apr 28, 2022 | ArticlesSegmentation | CodeCode Available | 2 |
| Causal Context Adjustment Loss for Learned Image Compression | Oct 7, 2024 | Image Compression | CodeCode Available | 2 |
| Beyond Pinball Loss: Quantile Methods for Calibrated Uncertainty Quantification | Nov 18, 2020 | regressionUncertainty Quantification | CodeCode Available | 2 |
| UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control | Mar 4, 2024 | DiversityVideo Generation | CodeCode Available | 2 |
| 3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans | Feb 15, 2020 | 3D ReconstructionHuman Detection | CodeCode Available | 2 |
| WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models | Nov 8, 2024 | Task PlanningZero-shot Generalization | CodeCode Available | 2 |
| MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis | Nov 16, 2022 | Image GenerationRepresentation Learning | CodeCode Available | 2 |
| Compositional Transformers for Scene Generation | Nov 17, 2021 | DisentanglementScene Generation | CodeCode Available | 2 |
| PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance | Jun 13, 2024 | Motion GenerationPosition | CodeCode Available | 2 |
| PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor | Mar 30, 2023 | Object | CodeCode Available | 2 |
| Pre-Trained Language Models for Interactive Decision-Making | Feb 3, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data | May 7, 2020 | Voice Conversion | CodeCode Available | 2 |