| Parsel🐍: Algorithmic Reasoning with Language Models by Composing Decompositions | Sep 21, 2023 | | CodeCode Available | 2 | 5 |
| SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Feb 27, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context | Sep 21, 2023 | | CodeCode Available | 2 | 5 |
| Memorize What Matters: Emergent Scene Decomposition from Multitraverse | May 27, 2024 | 3D ReconstructionNeural Rendering | CodeCode Available | 2 | 5 |
| Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Sep 26, 2024 | Image SegmentationNavigate | CodeCode Available | 2 | 5 |
| Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework | Sep 19, 2024 | Autonomous VehiclesDecision Making | CodeCode Available | 2 | 5 |
| Return of Unconditional Generation: A Self-supervised Representation Generation Method | Dec 6, 2023 | Conditional Image GenerationImage Generation | CodeCode Available | 2 | 5 |
| Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images | Nov 22, 2023 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 | 5 |
| SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection | Apr 27, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| A Content-Driven Micro-Video Recommendation Dataset at Scale | Sep 27, 2023 | BenchmarkingRecommendation Systems | CodeCode Available | 2 | 5 |
| A Systematic Survey of Chemical Pre-trained Models | Oct 29, 2022 | Drug Designmolecular representation | CodeCode Available | 2 | 5 |
| MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation | Oct 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 | 5 |
| Renate: A Library for Real-World Continual Learning | Apr 24, 2023 | Continual Learning | CodeCode Available | 2 | 5 |
| State-Free Inference of State-Space Models: The Transfer Function Approach | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment | Feb 16, 2024 | Entity AlignmentGraph Neural Network | CodeCode Available | 2 | 5 |
| DeMo: Decoupled Momentum Optimization | Nov 29, 2024 | 10-shot image generation1 Image, 2*2 Stitchi | CodeCode Available | 2 | 5 |
| Enhancing Large Vision Language Models with Self-Training on Image Comprehension | May 30, 2024 | Image ComprehensionVisual Question Answering | CodeCode Available | 2 | 5 |
| CamI2V: Camera-Controlled Image-to-Video Diffusion Model | Oct 21, 2024 | | CodeCode Available | 2 | 5 |
| AutoRE: Document-Level Relation Extraction with Large Language Models | Mar 21, 2024 | Document-level Relation ExtractionRelation | CodeCode Available | 2 | 5 |
| Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding | Apr 11, 2024 | 3D geometryparameter-efficient fine-tuning | CodeCode Available | 2 | 5 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 | 5 |
| WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs | Jun 26, 2024 | | CodeCode Available | 2 | 5 |
| Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation | Jun 4, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 | 5 |
| Ask Me Anything: A simple strategy for prompting language models | Oct 5, 2022 | Coreference ResolutionNatural Language Inference | CodeCode Available | 2 | 5 |
| An Ensemble Method to Produce High-Quality Word Embeddings (2016) | Apr 6, 2016 | Vocal Bursts Intensity PredictionWord Embeddings | CodeCode Available | 2 | 5 |
| Multistain Pretraining for Slide Representation Learning in Pathology | Aug 5, 2024 | Representation LearningSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| CViT: Continuous Vision Transformer for Operator Learning | May 22, 2024 | Operator learning | CodeCode Available | 2 | 5 |
| In-Hand Object Rotation via Rapid Motor Adaptation | Oct 10, 2022 | ObjectReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution | May 8, 2024 | Image Super-ResolutionMamba | CodeCode Available | 2 | 5 |
| Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse Representations | Apr 29, 2024 | RetrievalText Retrieval | CodeCode Available | 2 | 5 |
| LLaGA: Large Language and Graph Assistant | Feb 13, 2024 | | CodeCode Available | 2 | 5 |
| BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer | Jan 1, 2025 | Data Augmentation | CodeCode Available | 2 | 5 |
| PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models | Apr 11, 2025 | ClusteringLanguage Modeling | CodeCode Available | 2 | 5 |
| One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency | Jan 1, 2025 | Object | CodeCode Available | 2 | 5 |
| DD-Ranking: Rethinking the Evaluation of Dataset Distillation | May 19, 2025 | Data AugmentationData Compression | CodeCode Available | 2 | 5 |
| TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese | Jan 30, 2024 | Text Generation | CodeCode Available | 2 | 5 |
| BraTS orchestrator : Democratizing and Disseminating state-of-the-art brain tumor image analysis | Jun 13, 2025 | Brain Tumor SegmentationTumor Segmentation | CodeCode Available | 2 | 5 |
| Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration | May 6, 2024 | | CodeCode Available | 2 | 5 |
| Local Feature Matching Using Deep Learning: A Survey | Jan 31, 2024 | 3D ReconstructionDeep Learning | CodeCode Available | 2 | 5 |
| Outlier-robust Kalman Filtering through Generalised Bayes | May 9, 2024 | Bayesian InferenceComputational Efficiency | CodeCode Available | 2 | 5 |
| SoulChat: Improving LLMs' Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations | Nov 1, 2023 | | CodeCode Available | 2 | 5 |
| PortaSpeech: Portable and High-Quality Generative Text-to-Speech | Sep 30, 2021 | text-to-speechText to Speech | CodeCode Available | 2 | 5 |
| Perspective Fields for Single Image Camera Calibration | Dec 6, 2022 | Camera Calibration | CodeCode Available | 2 | 5 |
| SGFormer: Simplifying and Empowering Transformers for Large-Graph Representations | Jun 19, 2023 | Node Property PredictionPhilosophy | CodeCode Available | 2 | 5 |
| GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks | Nov 28, 2024 | BenchmarkingObject Counting | CodeCode Available | 2 | 5 |
| DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer | Jul 10, 2022 | FormInductive Bias | CodeCode Available | 2 | 5 |
| What Makes a Scene ? Scene Graph-based Evaluation and Feedback for Controllable Generation | Nov 23, 2024 | Image GenerationScene Generation | CodeCode Available | 2 | 5 |
| A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field Experiment | Mar 8, 2025 | speech-recognitionSpeech Recognition | CodeCode Available | 2 | 5 |
| CoMoSVC: Consistency Model-based Singing Voice Conversion | Jan 3, 2024 | GPUmodel | CodeCode Available | 2 | 5 |
| Speech Model Pre-training for End-to-End Spoken Language Understanding | Apr 7, 2019 | Speech-to-TextSpoken Language Understanding | CodeCode Available | 2 | 5 |