| Flaming-hot Initiation with Regular Execution Sampling for Large Language Models | Oct 28, 2024 | DiversityMath | CodeCode Available | 2 | 5 |
| aMUSEd: An Open MUSE Reproduction | Jan 3, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 | 5 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 | 5 |
| Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Jun 2, 2024 | AllImage Compression | CodeCode Available | 2 | 5 |
| Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution | Apr 15, 2025 | Image Super-ResolutionKnowledge Distillation | CodeCode Available | 2 | 5 |
| Generative Modeling for Mathematical Discovery | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents | Oct 18, 2024 | | CodeCode Available | 2 | 5 |
| Physics-based Deep Learning | Sep 11, 2021 | Deep LearningPhysical Simulations | CodeCode Available | 2 | 5 |
| LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks | Oct 28, 2024 | Quantization | CodeCode Available | 2 | 5 |
| Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting | Dec 20, 2023 | 3D GenerationImage Generation | CodeCode Available | 2 | 5 |
| Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation | Nov 8, 2023 | Style TransferVoice Conversion | CodeCode Available | 2 | 5 |
| MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving | Jul 27, 2023 | Autonomous DrivingNeRF | CodeCode Available | 2 | 5 |
| Text-to-3D using Gaussian Splatting | Sep 28, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 | 5 |
| UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing | Nov 25, 2024 | | CodeCode Available | 2 | 5 |
| Segment Anything with Multiple Modalities | Aug 17, 2024 | SegmentationSensor Fusion | CodeCode Available | 2 | 5 |
| JudgeBench: A Benchmark for Evaluating LLM-based Judges | Oct 16, 2024 | Math | CodeCode Available | 2 | 5 |
| Global Features are All You Need for Image Retrieval and Reranking | Aug 14, 2023 | AllImage Retrieval | CodeCode Available | 2 | 5 |
| Self-supervised Anomaly Detection Pretraining Enhances Long-tail ECG Diagnosis | Aug 30, 2024 | Anomaly DetectionDiagnostic | CodeCode Available | 2 | 5 |
| BianQue: Balancing the Questioning and Suggestion Ability of Health LLMs with Multi-turn Health Conversations Polished by ChatGPT | Oct 24, 2023 | | CodeCode Available | 2 | 5 |
| In-Context Retrieval-Augmented Language Models | Jan 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| MIST: A Simple and Scalable End-To-End 3D Medical Imaging Segmentation Framework | Jul 31, 2024 | 3D Medical Imaging SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| Self-Harmonized Chain of Thought | Sep 6, 2024 | | CodeCode Available | 2 | 5 |
| Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents | Jan 18, 2022 | Robot Task PlanningWorld Knowledge | CodeCode Available | 2 | 5 |
| CRS-Diff: Controllable Remote Sensing Image Generation with Diffusion Model | Mar 18, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| MFA-KWS: Effective Keyword Spotting with Multi-head Frame-asynchronous Decoding | May 26, 2025 | Keyword Spotting | CodeCode Available | 2 | 5 |
| TEXTure: Text-Guided Texturing of 3D Shapes | Feb 3, 2023 | Image Generationtext-guided-generation | CodeCode Available | 2 | 5 |
| OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | May 30, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos | Nov 29, 2024 | Boundary DetectionDense Video Captioning | CodeCode Available | 2 | 5 |
| IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos | Nov 18, 2024 | Pose EstimationSemantic Segmentation | CodeCode Available | 2 | 5 |
| MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks | Oct 14, 2024 | | CodeCode Available | 2 | 5 |
| MMA-Diffusion: MultiModal Attack on Diffusion Models | Nov 29, 2023 | | CodeCode Available | 2 | 5 |
| Dual-Camera Smooth Zoom on Mobile Phones | Apr 7, 2024 | | CodeCode Available | 2 | 5 |
| Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge | Jan 23, 2025 | SchedulingStreaming video understanding | CodeCode Available | 2 | 5 |
| CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra | Sep 6, 2023 | CoLAGaussian Processes | CodeCode Available | 2 | 5 |
| Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment | Apr 28, 2024 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 | 5 |
| Harmonizing Visual Text Comprehension and Generation | Jul 23, 2024 | multimodal generationReading Comprehension | CodeCode Available | 2 | 5 |
| TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder | Sep 12, 2024 | Diffusion PersonalizationDisentanglement | CodeCode Available | 2 | 5 |
| LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models | Mar 21, 2023 | Layout DesignLayout Generation | CodeCode Available | 2 | 5 |
| Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation | Feb 13, 2023 | Loop Closure DetectionPose Estimation | CodeCode Available | 2 | 5 |
| Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models | Mar 4, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 2 | 5 |
| Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts | Jan 1, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 | 5 |
| TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument | Feb 13, 2025 | Audio GenerationDecoder | CodeCode Available | 2 | 5 |
| OpenChemIE: An Information Extraction Toolkit For Chemistry Literature | Apr 1, 2024 | | CodeCode Available | 2 | 5 |
| DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut | Jun 5, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 | 5 |
| Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Jun 17, 2024 | DecoderSegmentation | CodeCode Available | 2 | 5 |
| Optimizing tiny colorless feedback delay networks | Feb 17, 2024 | | CodeCode Available | 2 | 5 |
| Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation | Apr 17, 2025 | GPUObject Recognition | CodeCode Available | 2 | 5 |
| Detector-Free Structure from Motion | Jun 27, 2023 | Keypoint Detection | CodeCode Available | 2 | 5 |
| UMERegRobust -- Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration | Aug 22, 2024 | Point Cloud Registration | CodeCode Available | 2 | 5 |