| Deep Learning Based Automatic Modulation Recognition: Models, Datasets, and Challenges | Jul 20, 2022 | Automatic Modulation RecognitionDeep Learning | CodeCode Available | 2 |
| Robust Human Matting via Semantic Guidance | Oct 11, 2022 | Image MattingSegmentation | CodeCode Available | 2 |
| InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation | Jul 8, 2024 | | CodeCode Available | 2 |
| Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular Data | Jan 25, 2023 | Causal DiscoveryCausal Inference | CodeCode Available | 2 |
| PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM | Jun 5, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| GraphGPT: Graph Instruction Tuning for Large Language Models | Oct 19, 2023 | Data AugmentationGraph Learning | CodeCode Available | 2 |
| Making LLaMA SEE and Draw with SEED Tokenizer | Oct 2, 2023 | multimodal generation | CodeCode Available | 2 |
| Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained Optimization | Jun 11, 2023 | | CodeCode Available | 2 |
| Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Aug 20, 2024 | Multi-agent Reinforcement LearningMulti-Task Learning | CodeCode Available | 2 |
| RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head Avatars | Mar 17, 2025 | | CodeCode Available | 2 |
| ReliableSwap: Boosting General Face Swapping Via Reliable Supervision | Jun 8, 2023 | Face ReenactmentFace Swapping | CodeCode Available | 2 |
| Structure-Aware Transformer for Graph Representation Learning | Feb 7, 2022 | Emotion Recognition in ConversationGraph Representation Learning | CodeCode Available | 2 |
| High-Order Control Barrier Functions: Insights and a Truncated Taylor-Based Formulation | Mar 19, 2025 | Collision Avoidance | CodeCode Available | 2 |
| Contrastive Learning of Asset Embeddings from Financial Time Series | Jul 26, 2024 | Contrastive LearningManagement | CodeCode Available | 2 |
| Pix2NeRF: Unsupervised Conditional p-GAN for Single Image to Neural Radiance Fields Translation | Jan 1, 2022 | 3D-Aware Image SynthesisImage Generation | CodeCode Available | 2 |
| CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Models | Jun 14, 2024 | Multiple-choiceQuestion Answering | CodeCode Available | 2 |
| Graph Neural Network-based surrogate model for granular flows | May 9, 2023 | Graph Neural Network | CodeCode Available | 2 |
| Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training | Mar 4, 2023 | | CodeCode Available | 2 |
| One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning | Jun 13, 2023 | AllDomain Generalization | CodeCode Available | 2 |
| GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting | Dec 28, 2024 | Camera LocalizationPose Estimation | CodeCode Available | 2 |
| Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion | Jun 5, 2024 | 3D Generation3D Reconstruction | CodeCode Available | 2 |
| SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models | Aug 31, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Video Polyp Segmentation: A Deep Learning Perspective | Mar 27, 2022 | AttributeDeep Learning | CodeCode Available | 2 |
| P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds | Jul 7, 2024 | 3D Single Object TrackingGPU | CodeCode Available | 2 |
| PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models | May 15, 2024 | Benchmarking | CodeCode Available | 2 |
| mdCATH: A Large-Scale MD Dataset for Data-Driven Computational Biophysics | Jul 20, 2024 | | CodeCode Available | 2 |
| BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning | Feb 27, 2024 | Drug DiscoveryForward reaction prediction | CodeCode Available | 2 |
| Melting Pot 2.0 | Nov 24, 2022 | Artificial LifeNavigate | CodeCode Available | 2 |
| A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation | Feb 29, 2024 | Anomaly DetectionDecoder | CodeCode Available | 2 |
| SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation | Feb 12, 2025 | Earth Observationobject-detection | CodeCode Available | 2 |
| SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama | Aug 18, 2024 | Script GenerationVideo Captioning | CodeCode Available | 2 |
| GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Sep 24, 2024 | 3D geometry3DGS | CodeCode Available | 2 |
| TraDiffusion: Trajectory-Based Training-Free Image Generation | Aug 19, 2024 | Image Generation | CodeCode Available | 2 |
| Mephisto: A Framework for Portable, Reproducible, and Iterative Crowdsourcing | Jan 12, 2023 | | CodeCode Available | 2 |
| Attention-based Deep Multiple Instance Learning | Feb 13, 2018 | Aerial Scene ClassificationMultiple Instance Learning | CodeCode Available | 2 |
| Interacting Attention Graph for Single Image Two-Hand Reconstruction | Mar 17, 2022 | 3D Interacting Hand Pose EstimationVocal Bursts Valence Prediction | CodeCode Available | 2 |
| Frequency-domain MLPs are More Effective Learners in Time Series Forecasting | Nov 10, 2023 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| REALY: Rethinking the Evaluation of 3D Face Reconstruction | Mar 18, 2022 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 |
| SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training | Jan 25, 2022 | DenoisingRepresentation Learning | CodeCode Available | 2 |
| Does Image Anonymization Impact Computer Vision Training? | Jun 8, 2023 | Face AnonymizationInstance Segmentation | CodeCode Available | 2 |
| NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario | May 24, 2023 | Autonomous DrivingQuestion Answering | CodeCode Available | 2 |
| In-Context Imitation Learning via Next-Token Prediction | Aug 28, 2024 | Imitation LearningPrediction | CodeCode Available | 2 |
| A Hybrid Transformer-Mamba Network for Single Image Deraining | Aug 31, 2024 | MambaRain Removal | CodeCode Available | 2 |
| Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization | Apr 8, 2025 | MathMathematical Reasoning | CodeCode Available | 2 |
| LViT: Language meets Vision Transformer in Medical Image Segmentation | Jun 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| gRNAde: Geometric Deep Learning for 3D RNA inverse design | May 24, 2023 | 3D geometryDeep Learning | CodeCode Available | 2 |
| Uni3D: Exploring Unified 3D Representation at Scale | Oct 10, 2023 | 3D Object ClassificationRetrieval | CodeCode Available | 2 |
| OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting | Jan 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation | Jul 17, 2024 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 |
| Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning | Apr 21, 2025 | AllForm | CodeCode Available | 2 |