| Disentangling Length from Quality in Direct Preference Optimization | Mar 28, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 | 5 |
| WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency | Sep 16, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation | Apr 5, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| SymbolFit: Automatic Parametric Modeling with Symbolic Regression | Nov 15, 2024 | Formregression | CodeCode Available | 2 | 5 |
| Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios | Jan 30, 2024 | Benchmarking | CodeCode Available | 2 | 5 |
| An open dataset for oracle bone script recognition and decipherment | Jan 27, 2024 | Decipherment | CodeCode Available | 2 | 5 |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Dec 19, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 | 5 |
| Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems | Jan 17, 2025 | Response Generation | CodeCode Available | 2 | 5 |
| EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis | Sep 10, 2024 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 2 | 5 |
| Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling | Oct 10, 2024 | Protein Folding | CodeCode Available | 2 | 5 |
| LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation | May 31, 2024 | Recommendation SystemsSequential Recommendation | CodeCode Available | 2 | 5 |
| VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos | May 29, 2024 | EgoSchemaMME | CodeCode Available | 2 | 5 |
| Reward Design with Language Models | Feb 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation | Apr 13, 2025 | Domain AdaptationLanguage Modeling | CodeCode Available | 2 | 5 |
| Relevance-guided Supervision for OpenQA with ColBERT | Jul 1, 2020 | Natural QuestionsOpen-Domain Question Answering | CodeCode Available | 2 | 5 |
| Modern Evolution Strategies for Creativity: Fitting Concrete Images and Abstract Concepts | Sep 18, 2021 | Evolutionary Algorithms | CodeCode Available | 2 | 5 |
| Mixed-curvature decision trees and random forests | Oct 3, 2024 | Link Predictionregression | CodeCode Available | 2 | 5 |
| XLB: A differentiable massively parallel lattice Boltzmann library in Python | Nov 27, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| OmniXAI: A Library for Explainable AI | Jun 1, 2022 | counterfactualCounterfactual Explanation | CodeCode Available | 2 | 5 |
| Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis | Jun 12, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 2 | 5 |
| AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research | May 17, 2025 | scientific discovery | CodeCode Available | 2 | 5 |
| Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems | Oct 13, 2022 | Recommendation Systems | CodeCode Available | 2 | 5 |
| Learned Image Compression with Dictionary-based Entropy Model | Apr 1, 2025 | Image Compressionmodel | CodeCode Available | 2 | 5 |
| Context Autoencoder for Self-Supervised Representation Learning | Feb 7, 2022 | DecoderInstance Segmentation | CodeCode Available | 2 | 5 |
| Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector | Feb 5, 2024 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 | 5 |
| SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation | Nov 13, 2022 | Earth ObservationMulti-Label Image Classification | CodeCode Available | 2 | 5 |
| Audio-FLAN: A Preliminary Release | Feb 23, 2025 | Zero-Shot Learning | CodeCode Available | 2 | 5 |
| FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models | Feb 21, 2024 | Question Answering | CodeCode Available | 2 | 5 |
| VCoder: Versatile Vision Encoders for Multimodal Large Language Models | Dec 21, 2023 | Image CaptioningImage Generation | CodeCode Available | 2 | 5 |
| RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance | May 23, 2024 | Image GenerationPersonalized Image Generation | CodeCode Available | 2 | 5 |
| 3D Gaussian Splatting with Deferred Reflection | Apr 29, 2024 | Novel View Synthesis | CodeCode Available | 2 | 5 |
| Centroid-Based Efficient Minimum Bayes Risk Decoding | Feb 17, 2024 | de-enTranslation | CodeCode Available | 2 | 5 |
| VectorMapNet: End-to-end Vectorized HD Map Learning | Jun 17, 2022 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection | Jan 28, 2024 | | CodeCode Available | 2 | 5 |
| Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization | Mar 19, 2024 | Quantization | CodeCode Available | 2 | 5 |
| TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models | Aug 7, 2023 | HallucinationObject Hallucination | CodeCode Available | 2 | 5 |
| Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance | Sep 2, 2024 | | CodeCode Available | 2 | 5 |
| Measuring Re-identification Risk | Apr 12, 2023 | | CodeCode Available | 2 | 5 |
| DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents | Jan 2, 2022 | Image GenerationVocal Bursts Intensity Prediction | CodeCode Available | 2 | 5 |
| RingFormer: A Neural Vocoder with Ring Attention and Convolution-Augmented Transformer | Jan 2, 2025 | Audio Generationtext-to-speech | CodeCode Available | 2 | 5 |
| Transformer-Based Visual Segmentation: A Survey | Apr 19, 2023 | Autonomous DrivingPoint Cloud Segmentation | CodeCode Available | 2 | 5 |
| Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process | Sep 29, 2023 | Change Data GenerationChange Detection | CodeCode Available | 2 | 5 |
| MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning | Jul 23, 2024 | BenchmarkingDecision Making | CodeCode Available | 2 | 5 |
| CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention | Mar 13, 2023 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| YOLOPoint Joint Keypoint and Object Detection | Feb 6, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics | Aug 28, 2024 | | CodeCode Available | 2 | 5 |
| VeriThinker: Learning to Verify Makes Reasoning Model Efficient | May 23, 2025 | model | CodeCode Available | 2 | 5 |
| Colar: Effective and Efficient Online Action Detection by Consulting Exemplars | Mar 2, 2022 | Action DetectionOnline Action Detection | CodeCode Available | 2 | 5 |
| InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models | Dec 18, 2024 | Reasoning SegmentationSegmentation | CodeCode Available | 2 | 5 |
| MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking | Jul 28, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 | 5 |