| Can Graph Learning Improve Planning in LLM-based Agents? | May 29, 2024 | Decision MakingGraph Learning | CodeCode Available | 2 | 5 |
| Universal Segmentation at Arbitrary Granularity with Language Instruction | Dec 4, 2023 | Referring Expression SegmentationSegmentation | CodeCode Available | 2 | 5 |
| UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified Approach | Mar 24, 2025 | Data Compression | CodeCode Available | 2 | 5 |
| Do Transformers Really Perform Bad for Graph Representation? | Jun 9, 2021 | Graph ClassificationGraph Property Prediction | CodeCode Available | 2 | 5 |
| A Comprehensive Survey on Continual Learning in Generative Models | Jun 16, 2025 | Continual LearningSurvey | CodeCode Available | 2 | 5 |
| ConvMAE: Masked Convolution Meets Masked Autoencoders | May 8, 2022 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| Hyperion - A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM | Jul 9, 2024 | Simultaneous Localization and Mapping | CodeCode Available | 2 | 5 |
| Learning Spatiotemporal Features with 3D Convolutional Networks | Dec 2, 2014 | Action RecognitionAction Recognition In Videos | CodeCode Available | 2 | 5 |
| FlowDec: A flow-based full-band general audio codec with high perceptual quality | Mar 3, 2025 | FAD | CodeCode Available | 2 | 5 |
| Map-free Visual Relocalization: Metric Pose Relative to a Single Image | Oct 11, 2022 | Depth EstimationDepth Prediction | CodeCode Available | 2 | 5 |
| Towards Comprehensive Detection of Chinese Harmful Memes | Oct 3, 2024 | | CodeCode Available | 2 | 5 |
| Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications | Jul 9, 2024 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| TerDiT: Ternary Diffusion Models with Transformers | May 23, 2024 | Image GenerationQuantization | CodeCode Available | 2 | 5 |
| Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs | Jun 13, 2024 | Arithmetic ReasoningFact Verification | CodeCode Available | 2 | 5 |
| URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics | Jan 8, 2025 | MathMathematical Reasoning | CodeCode Available | 2 | 5 |
| Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking | Dec 28, 2024 | Knowledge DistillationVisual Tracking | CodeCode Available | 2 | 5 |
| ICP-Flow: LiDAR Scene Flow Estimation with ICP | Feb 27, 2024 | Autonomous DrivingScene Flow Estimation | CodeCode Available | 2 | 5 |
| Large Selective Kernel Network for Remote Sensing Object Detection | Mar 16, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Crafting Better Contrastive Views for Siamese Representation Learning | Feb 7, 2022 | Contrastive LearningObject Localization | CodeCode Available | 2 | 5 |
| GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation | Mar 6, 2022 | Drug Discovery | CodeCode Available | 2 | 5 |
| LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound Videos | Jul 8, 2024 | SegmentationVideo Polyp Segmentation | CodeCode Available | 2 | 5 |
| CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning | Apr 18, 2022 | ChatbotOffline RL | CodeCode Available | 2 | 5 |
| 3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation | Oct 21, 2024 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 | 5 |
| Learning charges and long-range interactions from energies and forces | Dec 19, 2024 | | CodeCode Available | 2 | 5 |
| PubLayNet: largest dataset ever for document layout analysis | Aug 16, 2019 | ArticlesDocument Layout Analysis | CodeCode Available | 2 | 5 |
| A Simple Image Segmentation Framework via In-Context Examples | Oct 7, 2024 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model | May 16, 2024 | Image EnhancementImage Reconstruction | CodeCode Available | 2 | 5 |
| Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis | Jul 13, 2024 | Mambaspeech-recognition | CodeCode Available | 2 | 5 |
| Large Language Models are Zero-Shot Rankers for Recommender Systems | May 15, 2023 | Recommendation Systems | CodeCode Available | 2 | 5 |
| Learning to Prompt with Text Only Supervision for Vision-Language Models | Jan 4, 2024 | Prompt Engineering | CodeCode Available | 2 | 5 |
| MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment | Sep 19, 2017 | Music Generation | CodeCode Available | 2 | 5 |
| Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection | Mar 29, 2025 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Koopman neural operator as a mesh-free solver of non-linear partial differential equations | Jan 24, 2023 | Precipitation Forecasting | CodeCode Available | 2 | 5 |
| Analytic Federated Learning | May 25, 2024 | Federated Learning | CodeCode Available | 2 | 5 |
| EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce | Aug 14, 2023 | DiversityInstruction Following | CodeCode Available | 2 | 5 |
| Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages | Jul 3, 2024 | Language Modellingvalid | CodeCode Available | 2 | 5 |
| Concept Bottleneck Language Models For protein design | Nov 9, 2024 | Decision MakingDrug Discovery | CodeCode Available | 2 | 5 |
| LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model | Jun 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas | Mar 3, 2025 | Spatial Reasoning | CodeCode Available | 2 | 5 |
| LibCity: An Open Library for Traffic Prediction | Nov 4, 2021 | Multivariate Time Series ForecastingPrediction | CodeCode Available | 2 | 5 |
| Cartesian atomic cluster expansion for machine learning interatomic potentials | Feb 12, 2024 | | CodeCode Available | 2 | 5 |
| Skill Expansion and Composition in Parameter Space | Feb 9, 2025 | D4RL | CodeCode Available | 2 | 5 |
| Training-Free Activation Sparsity in Large Language Models | Aug 26, 2024 | Quantization | CodeCode Available | 2 | 5 |
| TexPainter: Generative Mesh Texturing with Multi-view Consistency | May 17, 2024 | Denoising | CodeCode Available | 2 | 5 |
| Benchmarking the Robustness of LiDAR Semantic Segmentation Models | Jan 3, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 | 5 |
| What Matters in Transformers? Not All Attention is Needed | Jun 22, 2024 | AllMMLU | CodeCode Available | 2 | 5 |
| Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision | Mar 14, 2024 | MathReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Rulebook: bringing co-routines to reinforcement learning environments | Apr 28, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 | 5 |
| Generative Semantic Segmentation | Mar 20, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom | May 6, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 2 | 5 |