| Optimizing Model Selection for Compound AI Systems | Feb 20, 2025 | modelModel Selection | CodeCode Available | 2 | 5 |
| MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing | Feb 28, 2025 | Image GenerationTransfer Learning | CodeCode Available | 2 | 5 |
| A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning | Oct 16, 2024 | In-Context LearningKnowledge Graphs | CodeCode Available | 2 | 5 |
| Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks | Oct 28, 2024 | | CodeCode Available | 2 | 5 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 | 5 |
| What Limits LLM-based Human Simulation: LLMs or Our Design? | Jan 15, 2025 | | CodeCode Available | 2 | 5 |
| Zero-Shot Vision Encoder Grafting via LLM Surrogates | May 28, 2025 | DecoderLanguage Modeling | CodeCode Available | 2 | 5 |
| OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching | Apr 19, 2022 | Graph Neural Network | CodeCode Available | 2 | 5 |
| Omni-Kernel Network for Image Restoration | Mar 24, 2024 | DeblurringImage Defocus Deblurring | CodeCode Available | 2 | 5 |
| FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation | Jul 4, 2023 | Autonomous DrivingPrediction Of Occupancy Grid Maps | CodeCode Available | 2 | 5 |
| StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction | Aug 24, 2023 | Autonomous Driving | CodeCode Available | 2 | 5 |
| Trends, Applications, and Challenges in Human Attention Modelling | Feb 28, 2024 | Language Modelling | CodeCode Available | 2 | 5 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Jan 11, 2024 | 3D Part SegmentationForeground Segmentation | CodeCode Available | 2 | 5 |
| Tri^2-plane: Thinking Head Avatar via Feature Pyramid | Jan 17, 2024 | | CodeCode Available | 2 | 5 |
| nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model | Feb 5, 2024 | 3D Medical Imaging SegmentationImage Segmentation | CodeCode Available | 2 | 5 |
| LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation | Jan 30, 2024 | HallucinationKnowledge Distillation | CodeCode Available | 2 | 5 |
| Interpretable Pre-Trained Transformers for Heart Time-Series Data | Jul 30, 2024 | DecoderElectrocardiography (ECG) | CodeCode Available | 2 | 5 |
| Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft Dataset | Apr 1, 2022 | 3D Object DetectionBenchmarking | CodeCode Available | 2 | 5 |
| RigNet: Neural Rigging for Articulated Characters | May 1, 2020 | Skeleton Rig Prediction | CodeCode Available | 2 | 5 |
| Building Cooperative Embodied Agents Modularly with Large Language Models | Jul 5, 2023 | Text Generation | CodeCode Available | 2 | 5 |
| Pretrained Transformers for Text Ranking: BERT and Beyond | Oct 13, 2020 | Information RetrievalReranking | CodeCode Available | 2 | 5 |
| Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural Nets | Jun 25, 2020 | Few-Shot LearningMeta-Learning | CodeCode Available | 2 | 5 |
| MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering | Mar 27, 2022 | DiversityMultiple-choice | CodeCode Available | 2 | 5 |
| Balanced MSE for Imbalanced Visual Regression | Mar 30, 2022 | Age EstimationFairness | CodeCode Available | 2 | 5 |
| A Review of Safe Reinforcement Learning: Methods, Theory and Applications | May 20, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks | Jun 17, 2022 | text similarity | CodeCode Available | 2 | 5 |
| 3D Object Detection for Autonomous Driving: A Comprehensive Survey | Jun 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Is Attention All That NeRF Needs? | Jul 27, 2022 | AllGeneralizable Novel View Synthesis | CodeCode Available | 2 | 5 |
| Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022 | Jul 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Place Recognition: A Comprehensive Review, Current Challenges and Future Directions | May 20, 2025 | 3D Place RecognitionCross-modal place recognition | CodeCode Available | 2 | 5 |
| Reporting Eye-Tracking Data Quality: Towards a New Standard | Mar 31, 2024 | | CodeCode Available | 2 | 5 |
| TEACH: Temporal Action Composition for 3D Humans | Sep 9, 2022 | Motion SynthesisSentence | CodeCode Available | 2 | 5 |
| ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech | Sep 15, 2022 | Gesture Generation | CodeCode Available | 2 | 5 |
| GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts | Sep 19, 2023 | Red Teaming | CodeCode Available | 2 | 5 |
| SPARF: Neural Radiance Fields from Sparse and Noisy Poses | Nov 21, 2022 | NeRFNovel View Synthesis | CodeCode Available | 2 | 5 |
| FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Jul 9, 2024 | Decision Making | CodeCode Available | 2 | 5 |
| Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models | Feb 26, 2024 | Language Modelling | CodeCode Available | 2 | 5 |
| Single channel voice separation for unknown number of speakers under reverberant and noisy settings | Nov 4, 2020 | ClassificationGeneral Classification | CodeCode Available | 2 | 5 |
| Reconstructing Hands in 3D with Transformers | Dec 8, 2023 | 3D Hand Pose Estimation | CodeCode Available | 2 | 5 |
| PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | May 9, 2023 | NeRFSurface Reconstruction | CodeCode Available | 2 | 5 |
| BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting | Apr 12, 2025 | 3DGSNovel View Synthesis | CodeCode Available | 2 | 5 |
| Person Re-Identification | Apr 27, 2022 | Person Re-Identification | CodeCode Available | 2 | 5 |
| Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior | Mar 6, 2025 | Image Retrieval | CodeCode Available | 2 | 5 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 | 5 |
| HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance | Apr 8, 2025 | Image Generation | CodeCode Available | 2 | 5 |
| AIM: Adapting Image Models for Efficient Video Action Recognition | Feb 6, 2023 | Action ClassificationAction Recognition | CodeCode Available | 2 | 5 |
| Shifts 2.0: Extending The Dataset of Real Distributional Shifts | Jun 30, 2022 | Autonomous Drivingimage-classification | CodeCode Available | 2 | 5 |
| SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM | Mar 12, 2024 | Semantic SegmentationSemantic SLAM | CodeCode Available | 2 | 5 |
| Leveraging Procedural Generation to Benchmark Reinforcement Learning | Dec 3, 2019 | Procgen Hard (100M)reinforcement-learning | CodeCode Available | 2 | 5 |