| Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models | Oct 24, 2023 | Audio ClassificationAudio Tagging | CodeCode Available | 2 | 5 |
| Denoising Diffusion Restoration Models | Jan 27, 2022 | ColorizationDeblurring | CodeCode Available | 2 | 5 |
| AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Oct 15, 2020 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Jul 5, 2024 | Action RecognitionFew-Shot Image Classification | CodeCode Available | 2 | 5 |
| Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis | Jul 9, 2024 | | CodeCode Available | 2 | 5 |
| The Power of Noise: Redefining Retrieval for RAG Systems | Jan 26, 2024 | Information RetrievalRAG | CodeCode Available | 2 | 5 |
| BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Mar 5, 2025 | Autonomous DrivingMotion Planning | CodeCode Available | 2 | 5 |
| CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition | Mar 20, 2023 | RetrievalScene Understanding | CodeCode Available | 2 | 5 |
| Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion | Jul 19, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 | 5 |
| One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian Optimization | Feb 15, 2022 | AllBayesian Optimization | CodeCode Available | 2 | 5 |
| NeuRAD: Neural Rendering for Autonomous Driving | Nov 26, 2023 | Autonomous DrivingData Augmentation | CodeCode Available | 2 | 5 |
| ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond | May 26, 2023 | Text-to-Video EditingVideo Editing | CodeCode Available | 2 | 5 |
| MDETR - Modulated Detection for End-to-End Multi-Modal Understanding | Jan 1, 2021 | Phrase GroundingQuestion Answering | CodeCode Available | 2 | 5 |
| A Multi-task Learning Model for Chinese-oriented Aspect Polarity Classification and Aspect Term Extraction | Dec 17, 2019 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 2 | 5 |
| vSHARP: variable Splitting Half-quadratic Admm algorithm for Reconstruction of inverse-Problems | Sep 18, 2023 | compressed sensingMRI Reconstruction | CodeCode Available | 2 | 5 |
| ReGenNet: Towards Human Action-Reaction Synthesis | Mar 18, 2024 | Decoder | CodeCode Available | 2 | 5 |
| Scalable 3D Registration via Truncated Entry-wise Absolute Residuals | Apr 1, 2024 | | CodeCode Available | 2 | 5 |
| CRA5: Extreme Compression of ERA5 for Portable Global Climate and Weather Research via an Efficient Variational Transformer | May 6, 2024 | Weather Forecasting | CodeCode Available | 2 | 5 |
| EffiBench: Benchmarking the Efficiency of Automatically Generated Code | Feb 3, 2024 | BenchmarkingCode Completion | CodeCode Available | 2 | 5 |
| Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation | Jul 15, 2024 | | CodeCode Available | 2 | 5 |
| FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models | Dec 30, 2024 | Question AnsweringToken Reduction | CodeCode Available | 2 | 5 |
| Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding | Apr 14, 2023 | 3D Object DetectionScene Understanding | CodeCode Available | 2 | 5 |
| SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing | Dec 18, 2023 | DecoderImage Generation | CodeCode Available | 2 | 5 |
| Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems | Feb 26, 2025 | Instruction Following | CodeCode Available | 2 | 5 |
| DRLE: Decentralized Reinforcement Learning at the Edge for Traffic Light Control in the IoV | Sep 3, 2020 | Edge-computingManagement | CodeCode Available | 2 | 5 |
| Perception Test: A Diagnostic Benchmark for Multimodal Models | Oct 19, 2022 | DiagnosticMultiple-choice | CodeCode Available | 2 | 5 |
| Log-based Anomaly Detection with Deep Learning: How Far Are We? | Feb 9, 2022 | Anomaly DetectionDeep Learning | CodeCode Available | 2 | 5 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 | 5 |
| BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network | Sep 6, 2023 | Generative Adversarial NetworkSpeech Synthesis | CodeCode Available | 2 | 5 |
| Generating Diverse and Natural 3D Human Motions From Text | Jan 1, 2022 | Motion Synthesis | CodeCode Available | 2 | 5 |
| DiGress: Discrete Denoising diffusion for graph generation | Sep 29, 2022 | DenoisingEdge Classification | CodeCode Available | 2 | 5 |
| InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment | Feb 13, 2024 | Hallucination | CodeCode Available | 2 | 5 |
| SceneTracker: Long-term Scene Flow Estimation Network | Mar 29, 2024 | 3D Object TrackingObject Tracking | CodeCode Available | 2 | 5 |
| MoVA: Adapting Mixture of Vision Experts to Multimodal Context | Apr 19, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| The ArtBench Dataset: Benchmarking Generative Models with Artworks | Jun 22, 2022 | BenchmarkingConditional Image Generation | CodeCode Available | 2 | 5 |
| Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutions | Jun 29, 2022 | | CodeCode Available | 2 | 5 |
| Space-time 2D Gaussian Splatting for Accurate Surface Reconstruction under Complex Dynamic Scenes | Sep 27, 2024 | Human-Object Interaction DetectionSurface Reconstruction | CodeCode Available | 2 | 5 |
| Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Jan 18, 2024 | Code GenerationEvolutionary Algorithms | CodeCode Available | 2 | 5 |
| A Generalizable Anomaly Detection Method in Dynamic Graphs | Dec 21, 2024 | Anomaly DetectionDiversity | CodeCode Available | 2 | 5 |
| Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine | Jan 20, 2023 | Machine TranslationSentence | CodeCode Available | 2 | 5 |
| What Matters In The Structured Pruning of Generative Language Models? | Feb 7, 2023 | Text Generation | CodeCode Available | 2 | 5 |
| Efficient 3D Semantic Segmentation with Superpoint Transformer | Jun 13, 2023 | 3D Semantic SegmentationGPU | CodeCode Available | 2 | 5 |
| A differentiable brain simulator bridging brain simulation and brain-inspired computing | Nov 9, 2023 | | CodeCode Available | 2 | 5 |
| StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation | Dec 22, 2022 | Speech DereverberationSpeech Enhancement | CodeCode Available | 2 | 5 |
| Plenoxels: Radiance Fields without Neural Networks | Dec 9, 2021 | | CodeCode Available | 2 | 5 |
| Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models | Jan 6, 2024 | | CodeCode Available | 2 | 5 |
| Birbal: An efficient 7B instruct-model fine-tuned with curated datasets | Mar 4, 2024 | GPU | CodeCode Available | 2 | 5 |
| Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation | Mar 12, 2024 | Cross-Modal RetrievalGPU | CodeCode Available | 2 | 5 |
| InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior | Feb 7, 2024 | BenchmarkingDecoder | CodeCode Available | 2 | 5 |