| Autoregressive Visual Tracking | Jan 1, 2023 | ObjectObject Tracking | CodeCode Available | 2 | 5 |
| OpenCOLE: Towards Reproducible Automatic Graphic Design Generation | Jun 12, 2024 | | CodeCode Available | 2 | 5 |
| COSMIC: COmmonSense knowledge for eMotion Identification in Conversations | Oct 6, 2020 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 2 | 5 |
| GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information | Apr 19, 2023 | In-Context LearningRetrieval | CodeCode Available | 2 | 5 |
| Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models | Mar 29, 2024 | Question AnsweringVisual Question Answering | CodeCode Available | 2 | 5 |
| Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image Super-Resolution with Subpixel Fusion | May 7, 2022 | Hyperspectral Image Super-ResolutionImage Super-Resolution | CodeCode Available | 2 | 5 |
| Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions | Mar 22, 2022 | Vision and Language Navigation | CodeCode Available | 2 | 5 |
| Lossless Image Compression through Super-Resolution | Apr 6, 2020 | Image CompressionSuper-Resolution | CodeCode Available | 2 | 5 |
| DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based Refinement | Jan 1, 2024 | DiversityScene Flow Estimation | CodeCode Available | 2 | 5 |
| DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models | Jun 26, 2023 | | CodeCode Available | 2 | 5 |
| CLIP-CLOP: CLIP-Guided Collage and Photomontage | May 6, 2022 | Prompt Engineering | CodeCode Available | 2 | 5 |
| A Survey on In-context Learning | Dec 31, 2022 | In-Context LearningSurvey | CodeCode Available | 2 | 5 |
| Learn to Reason Efficiently with Adaptive Length-based Reward Shaping | May 21, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Using Large Language Models to Tackle Fundamental Challenges in Graph Learning: A Comprehensive Survey | May 24, 2025 | Graph Learning | CodeCode Available | 2 | 5 |
| Spiking Transformers Need High Frequency Information | May 24, 2025 | Avg | CodeCode Available | 2 | 5 |
| HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges | Jun 18, 2025 | Combinatorial Optimization | CodeCode Available | 2 | 5 |
| ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration | Nov 25, 2024 | AI AgentVisual Question Answering | CodeCode Available | 2 | 5 |
| COVID-19 Image Data Collection | Mar 25, 2020 | COVID-19 Diagnosis | CodeCode Available | 2 | 5 |
| Against The Achilles' Heel: A Survey on Red Teaming for Generative Models | Mar 31, 2024 | Red TeamingSurvey | CodeCode Available | 2 | 5 |
| OmniCaptioner: One Captioner to Rule Them All | Apr 9, 2025 | AllImage Captioning | CodeCode Available | 2 | 5 |
| DeepDTA: Deep Drug-Target Binding Affinity Prediction | Jan 30, 2018 | Binary ClassificationDrug Discovery | CodeCode Available | 2 | 5 |
| Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality Assessment | Mar 20, 2024 | Action Quality AssessmentAction Quality Assessment Report Generation | CodeCode Available | 2 | 5 |
| Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting | Feb 2, 2023 | DecoderMultivariate Time Series Forecasting | CodeCode Available | 2 | 5 |
| FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models | Jan 28, 2024 | DecoderStyle Transfer | CodeCode Available | 2 | 5 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 | 5 |
| DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training | Mar 6, 2024 | DenoisingDiversity | CodeCode Available | 2 | 5 |
| Generative Diffusion-based Downscaling for Climate | Apr 27, 2024 | Super-Resolution | CodeCode Available | 2 | 5 |
| MambaVC: Learned Visual Compression with Selective State Spaces | May 24, 2024 | Long-range modelingState Space Models | CodeCode Available | 2 | 5 |
| Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts | Oct 13, 2022 | Atari GamesDecision Making | CodeCode Available | 2 | 5 |
| Matte Anything: Interactive Natural Image Matting with Segment Anything Models | Jun 7, 2023 | Image Matting | CodeCode Available | 2 | 5 |
| AmadeusGPT: a natural language interface for interactive animal behavioral analysis | Jul 10, 2023 | Descriptive | CodeCode Available | 2 | 5 |
| FocalFormer3D: Focusing on Hard Instance for 3D Object Detection | Jan 1, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task | Oct 4, 2024 | Translation | CodeCode Available | 2 | 5 |
| The Neural Hype and Comparisons Against Weak Baselines | Dec 1, 2018 | Ad-Hoc Information RetrievalCultural Vocal Bursts Intensity Prediction | CodeCode Available | 2 | 5 |
| Residual Quantization with Implicit Neural Codebooks | Jan 26, 2024 | Data CompressionQuantization | CodeCode Available | 2 | 5 |
| ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks | Dec 9, 2024 | GPUImitation Learning | CodeCode Available | 2 | 5 |
| AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results | Sep 15, 2020 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| User Behavior Simulation with Large Language Model based Agents | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future | Apr 12, 2025 | | CodeCode Available | 2 | 5 |
| Nemo: First Glimpse of a New Rule Engine | Aug 30, 2023 | Knowledge Graphs | CodeCode Available | 2 | 5 |
| Softpick: No Attention Sink, No Massive Activations with Rectified Softmax | Apr 29, 2025 | Quantization | CodeCode Available | 2 | 5 |
| Interpretability at Scale: Identifying Causal Mechanisms in Alpaca | May 15, 2023 | | CodeCode Available | 2 | 5 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 | 5 |
| Point Segment and Count: A Generalized Framework for Object Counting | Jan 1, 2024 | Few-shot Object Counting and DetectionKnowledge Distillation | CodeCode Available | 2 | 5 |
| XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| QQQ: Quality Quattuor-Bit Quantization for Large Language Models | Jun 14, 2024 | Quantization | CodeCode Available | 2 | 5 |
| CV-Cities: Advancing Cross-View Geo-Localization in Global Cities | Nov 19, 2024 | Cross-View Geo-LocalisationDrone-view target localization | CodeCode Available | 2 | 5 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 | 5 |
| Differentiable Reward Optimization for LLM based TTS system | Jul 8, 2025 | text-to-speechText to Speech | CodeCode Available | 2 | 5 |
| SF-V: Single Forward Video Generation Model | Jun 6, 2024 | Denoisingmodel | CodeCode Available | 2 | 5 |