| Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data | Feb 22, 2024 | Irregular Time SeriesMissing Values | CodeCode Available | 2 | 5 |
| Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing Process | Mar 17, 2025 | Anomaly Detection | CodeCode Available | 2 | 5 |
| OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFD | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision | Sep 25, 2023 | Image Quality Assessment | CodeCode Available | 2 | 5 |
| Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation | Jul 13, 2023 | RetrievalVideo Generation | CodeCode Available | 2 | 5 |
| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| KAGNNs: Kolmogorov-Arnold Networks meet Graph Learning | Jun 26, 2024 | Graph ClassificationGraph Learning | CodeCode Available | 2 | 5 |
| TRAK: Attributing Model Behavior at Scale | Mar 24, 2023 | model | CodeCode Available | 2 | 5 |
| Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction | Apr 13, 2023 | 3D-Aware Image Synthesis3D Generation | CodeCode Available | 2 | 5 |
| DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion | Jan 23, 2023 | Image-text ClassificationNode Classification | CodeCode Available | 2 | 5 |
| Long-Context Language Modeling with Parallel Context Encoding | Feb 26, 2024 | In-Context LearningInstruction Following | CodeCode Available | 2 | 5 |
| EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera | May 14, 2024 | Depth EstimationSurface Reconstruction | CodeCode Available | 2 | 5 |
| Diffusion Guidance Is a Controllable Policy Improvement Operator | May 29, 2025 | Offline RL | CodeCode Available | 2 | 5 |
| FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing | Jul 25, 2024 | Text-based Image Editing | CodeCode Available | 2 | 5 |
| OpenForest: A data catalogue for machine learning in forest monitoring | Nov 1, 2023 | | CodeCode Available | 2 | 5 |
| Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models | Jan 16, 2023 | Audio ClassificationFew-Shot Learning | CodeCode Available | 2 | 5 |
| Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging | Mar 26, 2025 | Prompt EngineeringReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs | May 16, 2025 | Retrieval | CodeCode Available | 2 | 5 |
| Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model | Mar 6, 2025 | General KnowledgeImage Captioning | CodeCode Available | 2 | 5 |
| LoQT: Low-Rank Adapters for Quantized Pretraining | May 26, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World | Mar 24, 2024 | Action AnticipationAction Quality Assessment | CodeCode Available | 2 | 5 |
| NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM | Feb 16, 2025 | NavigateRAG | CodeCode Available | 2 | 5 |
| Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | May 27, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 | 5 |
| Visual Speech Recognition for Multiple Languages in the Wild | Feb 26, 2022 | Hyperparameter OptimizationLipreading | CodeCode Available | 2 | 5 |
| LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving | Dec 26, 2023 | Autonomous Driving | CodeCode Available | 2 | 5 |
| Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution | Oct 5, 2024 | Image Super-ResolutionKnowledge Distillation | CodeCode Available | 2 | 5 |
| Efficient Reinforcement Finetuning via Adaptive Curriculum Learning | Apr 7, 2025 | MathMathematical Reasoning | CodeCode Available | 2 | 5 |
| Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Feb 29, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Writing in the Margins: Better Inference Pattern for Long Context Retrieval | Aug 27, 2024 | RAGRetrieval | CodeCode Available | 2 | 5 |
| chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations | Jun 4, 2025 | Graph Neural Network | CodeCode Available | 2 | 5 |
| Envision3D: One Image to 3D with Anchor Views Interpolation | Mar 13, 2024 | Image to 3D | CodeCode Available | 2 | 5 |
| Full-Atom Peptide Design based on Multi-modal Flow Matching | Jun 2, 2024 | Drug Discovery | CodeCode Available | 2 | 5 |
| Inter-subject Contrastive Learning for Subject Adaptive EEG-based Visual Recognition | Feb 7, 2022 | Contrastive LearningEEG | CodeCode Available | 2 | 5 |
| HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details | Jun 15, 2022 | Neural RenderingSurface Reconstruction | CodeCode Available | 2 | 5 |
| SKIPP'D: a SKy Images and Photovoltaic Power Generation Dataset for Short-term Solar Forecasting | Jul 2, 2022 | | CodeCode Available | 2 | 5 |
| LayoutGPT: Compositional Visual Planning and Generation with Large Language Models | May 24, 2023 | Image GenerationIndoor Scene Synthesis | CodeCode Available | 2 | 5 |
| SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation | Dec 28, 2023 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Neural-Fly Enables Rapid Learning for Agile Flight in Strong Winds | May 13, 2022 | Meta-Learning | CodeCode Available | 2 | 5 |
| Neural Cloth Simulation | Dec 13, 2022 | Physical Simulations | CodeCode Available | 2 | 5 |
| AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator | Feb 15, 2024 | BenchmarkingDiagnostic | CodeCode Available | 2 | 5 |
| Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection | Aug 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| A Case Study in CUDA Kernel Fusion: Implementing FlashAttention-2 on NVIDIA Hopper Architecture using the CUTLASS Library | Dec 19, 2023 | GPU | CodeCode Available | 2 | 5 |
| EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector | Nov 4, 2024 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 | 5 |
| Striped Attention: Faster Ring Attention for Causal Transformers | Nov 15, 2023 | | CodeCode Available | 2 | 5 |
| 4D Contrastive Superflows are Dense 3D Representation Learners | Jul 8, 2024 | Autonomous DrivingContrastive Learning | CodeCode Available | 2 | 5 |
| Symbol as Points: Panoptic Symbol Spotting via Point-based Representation | Jan 19, 2024 | Point Cloud SegmentationVector Graphics | CodeCode Available | 2 | 5 |
| Personalized Large Language Models | Feb 14, 2024 | Emotion RecognitionHate Speech Detection | CodeCode Available | 2 | 5 |
| Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search | Jun 14, 2022 | | CodeCode Available | 2 | 5 |
| Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning | Jun 2, 2025 | Fact VerificationLanguage Modeling | CodeCode Available | 2 | 5 |
| STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery | Jun 13, 2024 | Graph GenerationObject | CodeCode Available | 2 | 5 |