| Few-Shot Bearing Fault Diagnosis Via Ensembling Transformer-Based Model With Mahalanobis Distance Metric Learning From Multiscale Features | Mar 25, 2024 | ClassificationFault Diagnosis | CodeCode Available | 2 |
| DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation | Dec 30, 2022 | Font GenerationImage-to-Image Translation | CodeCode Available | 2 |
| YOLOv5-6D: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries | Mar 22, 2024 | 6D Pose Estimation using RGBGPU | CodeCode Available | 2 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Analysing the Residual Stream of Language Models Under Knowledge Conflicts | Oct 21, 2024 | | CodeCode Available | 2 |
| JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation Framework | Oct 11, 2024 | | CodeCode Available | 2 |
| Hypergraph Neural Networks | Sep 25, 2018 | Object RecognitionRepresentation Learning | CodeCode Available | 2 |
| Peeling Back the Layers: An In-Depth Evaluation of Encoder Architectures in Neural News Recommenders | Oct 2, 2024 | Model SelectionNews Recommendation | CodeCode Available | 2 |
| Efficient Non-stationary Online Learning by Wavelets with Applications to Online Distribution Shift Adaptation | Jul 21, 2024 | | CodeCode Available | 2 |
| ViSpeak: Visual Instruction Feedback in Streaming Videos | Mar 17, 2025 | Streaming video understandingVideo Understanding | CodeCode Available | 2 |
| SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery | Jul 17, 2022 | Land Cover ClassificationSemantic Segmentation | CodeCode Available | 2 |
| Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 Model | Sep 14, 2024 | Medical Image SegmentationPolyp Segmentation | CodeCode Available | 2 |
| Detection Transformer with Stable Matching | Apr 10, 2023 | DecoderPosition | CodeCode Available | 2 |
| Chain-of-Thought Reasoning Without Prompting | Feb 15, 2024 | Prompt Engineering | CodeCode Available | 2 |
| Domain Adaptation with a Single Vision-Language Embedding | Oct 28, 2024 | Domain AdaptationOne-shot Unsupervised Domain Adaptation | CodeCode Available | 2 |
| An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval | Jun 13, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 2 |
| HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation | Apr 15, 2025 | Benchmarkingscientific discovery | CodeCode Available | 2 |
| Prototype-based Cross-Modal Object Tracking | Dec 22, 2023 | ObjectObject Tracking | CodeCode Available | 2 |
| BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer | Jul 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models | Aug 1, 2024 | | CodeCode Available | 2 |
| 1st Place Solution of Multiview Egocentric Hand Tracking Challenge ECCV2024 | Sep 28, 2024 | Position | CodeCode Available | 2 |
| C^2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation | Dec 6, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| Region Rebalance for Long-Tailed Semantic Segmentation | Apr 5, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| NLLB-CLIP -- train performant multilingual image retrieval model on a budget | Sep 4, 2023 | Image RetrievalRetrieval | CodeCode Available | 2 |
| TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis | May 2, 2023 | Moment RetrievalMotion Generation | CodeCode Available | 2 |
| Gaussian Processes for Big Data | Sep 26, 2013 | Gaussian ProcessesVariational Inference | CodeCode Available | 2 |
| DetGPT: Detect What You Need via Reasoning | May 23, 2023 | Autonomous DrivingObject | CodeCode Available | 2 |
| HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling | Aug 27, 2024 | Domain GeneralizationPrompt Engineering | CodeCode Available | 2 |
| GAIA: a benchmark for General AI Assistants | Nov 21, 2023 | Philosophy | CodeCode Available | 2 |
| WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects | Feb 18, 2025 | Machine Translation | CodeCode Available | 2 |
| Seeing through Satellite Images at Street Views | May 22, 2025 | | CodeCode Available | 2 |
| Large Language Models are In-Context Molecule Learners | Mar 7, 2024 | Cross-Modal RetrievalIn-Context Learning | CodeCode Available | 2 |
| Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models | Dec 19, 2023 | DenoisingNeural Architecture Search | CodeCode Available | 2 |
| Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent | May 12, 2025 | RAGReinforcement Learning (RL) | CodeCode Available | 2 |
| Deduplicating Training Data Mitigates Privacy Risks in Language Models | Feb 14, 2022 | | CodeCode Available | 2 |
| RandAugment: Practical automated data augmentation with a reduced search space | Sep 30, 2019 | Data AugmentationDomain Generalization | CodeCode Available | 2 |
| Mamba-R: Vision Mamba ALSO Needs Registers | May 23, 2024 | MambaSemantic Segmentation | CodeCode Available | 2 |
| The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs) | May 26, 2023 | BenchmarkingBrain Tumor Segmentation | CodeCode Available | 2 |
| Structured Denoising Diffusion Models in Discrete State-Spaces | Jul 7, 2021 | DenoisingText Generation | CodeCode Available | 2 |
| Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models | May 30, 2024 | | CodeCode Available | 2 |
| Neural Responding Machine for Short-Text Conversation | Mar 9, 2015 | DecoderRetrieval | CodeCode Available | 2 |
| Neural Lander: Stable Drone Landing Control using Learned Dynamics | Nov 19, 2018 | | CodeCode Available | 2 |
| Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate | Jan 29, 2025 | Instruction FollowingMath | CodeCode Available | 2 |
| Scaling up Differentially Private Deep Learning with Fast Per-Example Gradient Clipping | Sep 7, 2020 | GPU | CodeCode Available | 2 |
| Interpreting the Latent Space of GANs for Semantic Face Editing | Jul 25, 2019 | AttributeDisentanglement | CodeCode Available | 2 |
| Improving RetinaNet for CT Lesion Detection with Dense Masks from Weak RECIST Labels | Jun 5, 2019 | Computed Tomography (CT)Lesion Detection | CodeCode Available | 2 |
| NeuralUQ: A comprehensive library for uncertainty quantification in neural differential equations and operators | Aug 25, 2022 | Uncertainty Quantification | CodeCode Available | 2 |
| Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models | Jun 29, 2023 | Audio Synthesis | CodeCode Available | 2 |
| Double Difference Earthquake Location with Graph Neural Networks | Oct 25, 2024 | Graph Neural Network | CodeCode Available | 2 |
| A Library for Representing Python Programs as Graphs for Machine Learning | Aug 15, 2022 | | CodeCode Available | 2 |