| xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token | May 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Automated Bioinformatics Analysis via AutoBA | Sep 6, 2023 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed Audio | Jul 31, 2023 | AllDownbeat Tracking | CodeCode Available | 2 |
| DataDream: Few-shot Guided Dataset Generation | Jul 15, 2024 | ClassificationDataset Generation | CodeCode Available | 2 |
| UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems | Jun 29, 2024 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 2 |
| Partial-to-Partial Shape Matching with Geometric Consistency | Apr 18, 2024 | | CodeCode Available | 2 |
| Learning to Reason for Long-Form Story Generation | Mar 28, 2025 | FormMath | CodeCode Available | 2 |
| Post-Training Sparse Attention with Double Sparsity | Aug 11, 2024 | | CodeCode Available | 2 |
| Group Robust Preference Optimization in Reward-free RLHF | May 30, 2024 | | CodeCode Available | 2 |
| FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin | Nov 18, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions | Jun 29, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs | Jul 15, 2025 | Code GenerationSafety Alignment | CodeCode Available | 2 |
| CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis | Jul 18, 2024 | Decision MakingDiagnostic | CodeCode Available | 2 |
| Exploring Radar Data Representations in Autonomous Driving: A Comprehensive Review | Dec 8, 2023 | Autonomous DrivingRetrieval | CodeCode Available | 2 |
| mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval | Jan 31, 2025 | Instruction FollowingRetrieval | CodeCode Available | 2 |
| GRID: A Platform for General Robot Intelligence Development | Oct 2, 2023 | | CodeCode Available | 2 |
| 4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling | Nov 29, 2023 | | CodeCode Available | 2 |
| Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | Jun 26, 2023 | HallucinationVisual Question Answering | CodeCode Available | 2 |
| Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey | Apr 17, 2024 | Survey | CodeCode Available | 2 |
| OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization | Oct 25, 2024 | Imitation Learning | CodeCode Available | 2 |
| LiDAR Snowfall Simulation for Robust 3D Object Detection | Mar 28, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Accelerated Hierarchical Density Clustering | May 20, 2017 | Clustering | CodeCode Available | 2 |
| EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech | Jun 12, 2024 | Emotional Speech Synthesistext-to-speech | CodeCode Available | 2 |
| UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes | May 29, 2025 | Texture Synthesis | CodeCode Available | 2 |
| Compressing Large Language Models using Low Rank and Low Precision Decomposition | May 29, 2024 | Quantization | CodeCode Available | 2 |
| DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction | Apr 21, 2023 | In-Context LearningText to SQL | CodeCode Available | 2 |
| Towards Robust Multimodal Sentiment Analysis with Incomplete Data | Sep 30, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | CodeCode Available | 2 |
| Repo2Run: Automated Building Executable Environment for Code Repository at Scale | Feb 19, 2025 | | CodeCode Available | 2 |
| A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions | May 26, 2025 | Speech Enhancement | CodeCode Available | 2 |
| Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions | Apr 9, 2023 | Video Captioning | CodeCode Available | 2 |
| VidToMe: Video Token Merging for Zero-Shot Video Editing | Dec 17, 2023 | Video EditingVideo Generation | CodeCode Available | 2 |
| RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments | Oct 23, 2023 | | CodeCode Available | 2 |
| Leveraging Instance Features for Label Aggregation in Programmatic Weak Supervision | Oct 6, 2022 | Variational Inference | CodeCode Available | 2 |
| A Survey on Graph Neural Networks for Remaining Useful Life Prediction: Methodologies, Evaluation and Future Trends | Sep 29, 2024 | Benchmarkinggraph construction | CodeCode Available | 2 |
| OncoReg: Medical Image Registration for Oncological Challenges | Mar 29, 2025 | Image RegistrationMedical Image Registration | CodeCode Available | 2 |
| DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion | Mar 13, 2023 | Denoising | CodeCode Available | 2 |
| A 3D Generative Model for Structure-Based Drug Design | Mar 20, 2022 | Drug Designvalid | CodeCode Available | 2 |
| TriDet: Temporal Action Detection with Relative Boundary Modeling | Mar 13, 2023 | Action DetectionTemporal Action Localization | CodeCode Available | 2 |
| Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation | Oct 17, 2024 | | CodeCode Available | 2 |
| Pop2Piano : Pop Audio-based Piano Cover Generation | Nov 2, 2022 | | CodeCode Available | 2 |
| Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering | Nov 21, 2022 | Dynamic ReconstructionTensor Decomposition | CodeCode Available | 2 |
| Seq vs Seq: An Open Suite of Paired Encoders and Decoders | Jul 15, 2025 | DecoderLarge Language Model | CodeCode Available | 2 |
| QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models | Oct 25, 2023 | GPUMixture-of-Experts | CodeCode Available | 2 |
| TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Apr 29, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| Metadata Conditioning Accelerates Language Model Pre-training | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models | Oct 17, 2024 | Contrastive LearningDiversity | CodeCode Available | 2 |
| SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning | Feb 6, 2025 | BenchmarkingData Poisoning | CodeCode Available | 2 |
| Large-Scale Pre-training for Person Re-identification with Noisy Labels | Mar 30, 2022 | Contrastive LearningMulti-Object Tracking | CodeCode Available | 2 |
| RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection | Aug 18, 2022 | Objectobject-detection | CodeCode Available | 2 |