| Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration | Apr 19, 2024 | Ensemble Learning | CodeCode Available | 2 |
| Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert | Mar 29, 2023 | Contrastive LearningFace Generation | CodeCode Available | 2 |
| MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning | Nov 15, 2023 | Chart Understanding | CodeCode Available | 2 |
| VBR: A Vision Benchmark in Rome | Apr 17, 2024 | Autonomous VehiclesBenchmarking | CodeCode Available | 2 |
| Eliminating Warping Shakes for Unsupervised Online Video Stitching | Mar 11, 2024 | Image StitchingVideo Stabilization | CodeCode Available | 2 |
| Querying Databases with Function Calling | Jan 23, 2025 | | CodeCode Available | 2 |
| Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task | Oct 24, 2022 | | CodeCode Available | 2 |
| Query-Centric Trajectory Prediction | Jan 1, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Understanding Why ViT Trains Badly on Small Datasets: An Intuitive Perspective | Feb 7, 2023 | Image Classification | CodeCode Available | 2 |
| Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation | May 6, 2025 | Boundary DetectionDecoder | CodeCode Available | 2 |
| ConDistFL: Conditional Distillation for Federated Learning from Partially Annotated Data | Aug 8, 2023 | Federated LearningKnowledge Distillation | CodeCode Available | 2 |
| Exploring the Potential of Large Language Models (LLMs) in Learning on Graphs | Jul 7, 2023 | General KnowledgeNode Classification | CodeCode Available | 2 |
| CNOS: A Strong Baseline for CAD-based Novel Object Segmentation | Jul 20, 2023 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers | Jan 11, 2023 | DenoisingInductive Bias | CodeCode Available | 2 |
| Hardware-Rasterized Ray-Based Gaussian Splatting | Mar 24, 2025 | Mixed RealityNovel View Synthesis | CodeCode Available | 2 |
| BSD: a Bayesian framework for parametric models of neural spectra | Oct 28, 2024 | Bayesian InferenceEEG | CodeCode Available | 2 |
| Execution Guided Line-by-Line Code Generation | Jun 12, 2025 | Code Generation | CodeCode Available | 2 |
| SEMv3: A Fast and Robust Approach to Table Separation Line Detection | May 20, 2024 | Line Detection | CodeCode Available | 2 |
| GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction | Sep 5, 2023 | 3D Reconstructionglobal-optimization | CodeCode Available | 2 |
| Interpreting the Weight Space of Customized Diffusion Models | Jun 13, 2024 | | CodeCode Available | 2 |
| HugNLP: A Unified and Comprehensive Library for Natural Language Processing | Feb 28, 2023 | | CodeCode Available | 2 |
| UNeXt: MLP-based Rapid Medical Image Segmentation Network | Mar 9, 2022 | DecoderImage Segmentation | CodeCode Available | 2 |
| FrontierNet: Learning Visual Cues to Explore | Jan 8, 2025 | Object Discovery | CodeCode Available | 2 |
| POTATO: The Portable Text Annotation Tool | Dec 16, 2022 | Active Learningtext annotation | CodeCode Available | 2 |
| LLaVAction: evaluating and training multi-modal large language models for action recognition | Mar 24, 2025 | Action RecognitionAction Understanding | CodeCode Available | 2 |
| Multi-Agent Reinforcement Learning is a Sequence Modeling Problem | May 30, 2022 | Decision MakingMuJoCo | CodeCode Available | 2 |
| Convergence Analysis of Probability Flow ODE for Score-based Generative Models | Apr 15, 2024 | | CodeCode Available | 2 |
| From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens | Feb 26, 2025 | | CodeCode Available | 2 |
| E3x: E(3)-Equivariant Deep Learning Made Easy | Jan 15, 2024 | Deep Learning | CodeCode Available | 2 |
| An Economic Framework for 6-DoF Grasp Detection | Jul 11, 2024 | Robotic Grasping | CodeCode Available | 2 |
| Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution | Oct 25, 2023 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models | Jan 4, 2025 | Computational Efficiency | CodeCode Available | 2 |
| Fine-tuned In-Context Learning Transformers are Excellent Tabular Data Classifiers | May 22, 2024 | In-Context Learning | CodeCode Available | 2 |
| Three Bricks to Consolidate Watermarks for Large Language Models | Jul 26, 2023 | valid | CodeCode Available | 2 |
| NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions | Sep 27, 2023 | | CodeCode Available | 2 |
| SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning | Mar 14, 2024 | Deep Reinforcement LearningDictionary Learning | CodeCode Available | 2 |
| High-Performance Transformers for Table Structure Recognition Need Early Convolutions | Nov 9, 2023 | DecoderRepresentation Learning | CodeCode Available | 2 |
| mbrs: A Library for Minimum Bayes Risk Decoding | Aug 8, 2024 | Text Generation | CodeCode Available | 2 |
| DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping | Feb 2, 2024 | 3D ReconstructionEarth Observation | CodeCode Available | 2 |
| Text2Light: Zero-Shot Text-Driven HDR Panorama Generation | Sep 20, 2022 | 4kinverse tone mapping | CodeCode Available | 2 |
| MTLoRA: Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Jan 1, 2024 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training | Aug 12, 2022 | GPU | CodeCode Available | 2 |
| Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation | Oct 11, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices | Jun 12, 2024 | Navigate | CodeCode Available | 2 |
| Blind Video Deflickering by Neural Filtering with a Flawed Atlas | Mar 14, 2023 | Video GenerationVideo Temporal Consistency | CodeCode Available | 2 |
| 4Hammer: a board-game reinforcement learning environment for the hour long time frame | May 19, 2025 | Board Gamesreinforcement-learning | CodeCode Available | 2 |
| WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit | Oct 30, 2022 | Keyword Spotting | CodeCode Available | 2 |
| OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation | Nov 29, 2023 | Hallucination | CodeCode Available | 2 |
| RoCo: Dialectic Multi-Robot Collaboration with Large Language Models | Jul 10, 2023 | Trajectory Planning | CodeCode Available | 2 |
| Towards a Unified Multi-Dimensional Evaluator for Text Generation | Oct 13, 2022 | nlg evaluationQuestion Answering | CodeCode Available | 2 |