| V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Dec 21, 2023 | Visual Question AnsweringWorld Knowledge | CodeCode Available | 2 | 5 |
| SimPhony: A Device-Circuit-Architecture Cross-Layer Modeling and Simulation Framework for Heterogeneous Electronic-Photonic AI System | Nov 20, 2024 | | CodeCode Available | 2 | 5 |
| Abstractive Summarization of Spoken andWritten Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 2 | 5 |
| Controlling Length in Image Captioning | May 29, 2020 | Image Captioning | CodeCode Available | 2 | 5 |
| An Inverse Scaling Law for CLIP Training | May 11, 2023 | | CodeCode Available | 2 | 5 |
| Recipro-CAM: Fast gradient-free visual explanations for convolutional neural networks | Sep 28, 2022 | Explainable Artificial Intelligence (XAI) | CodeCode Available | 2 | 5 |
| Focal Loss for Dense Object Detection | Aug 7, 2017 | 2D Object DetectionDense Object Detection | CodeCode Available | 2 | 5 |
| A Synthetic Dataset for Personal Attribute Inference | Jun 11, 2024 | AttributeAuthor Profiling | CodeCode Available | 2 | 5 |
| Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video | Jan 24, 2025 | 3D ReconstructionBenchmarking | CodeCode Available | 2 | 5 |
| Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Mar 28, 2025 | DescriptiveImage Quality Assessment | CodeCode Available | 2 | 5 |
| FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models | Jan 1, 2024 | DecoderDenoising | CodeCode Available | 2 | 5 |
| ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning | Jul 30, 2024 | ARCreinforcement-learning | CodeCode Available | 2 | 5 |
| Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes | Jan 28, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 | 5 |
| Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection | Sep 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at Scale | Jul 17, 2024 | GPULAMBADA | CodeCode Available | 2 | 5 |
| LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Oct 23, 2024 | ChunkingQuestion Answering | CodeCode Available | 2 | 5 |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | Mar 25, 2021 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency Consistency | Jun 17, 2022 | Activity RecognitionDomain Adaptation | CodeCode Available | 2 | 5 |
| FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting | Feb 26, 2025 | Model SelectionTime Series | CodeCode Available | 2 | 5 |
| HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection | Dec 16, 2024 | 3D Object Detection3D Object Detection on View-of-Delft (val) | CodeCode Available | 2 | 5 |
| ABodyBuilder3: Improved and scalable antibody structure predictions | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| TrustRAG: Enhancing Robustness and Trustworthiness in RAG | Jan 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Scaling Language-Image Pre-training via Masking | Dec 1, 2022 | Diversity | CodeCode Available | 2 | 5 |
| LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology | Feb 6, 2024 | AllBenchmarking | CodeCode Available | 2 | 5 |
| TODS: An Automated Time Series Outlier Detection System | Sep 18, 2020 | Outlier DetectionTime Series | CodeCode Available | 2 | 5 |
| LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error | Mar 7, 2024 | Continual LearningIn-Context Learning | CodeCode Available | 2 | 5 |
| A Survey of Machine Unlearning | Sep 6, 2022 | AttributeMachine Unlearning | CodeCode Available | 2 | 5 |
| Dynamic Spatial Propagation Network for Depth Completion | Feb 20, 2022 | Depth Completion | CodeCode Available | 2 | 5 |
| OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Perception Test: A Diagnostic Benchmark for Multimodal Video Models | May 23, 2023 | DiagnosticGrounded Video Question Answering | CodeCode Available | 2 | 5 |
| RITA: a Study on Scaling Up Generative Protein Sequence Models | May 11, 2022 | PredictionProtein Design | CodeCode Available | 2 | 5 |
| Multi-target stain normalization for histology slides | Jun 4, 2024 | | CodeCode Available | 2 | 5 |
| MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking | Jan 21, 2025 | Multiple-choice | CodeCode Available | 2 | 5 |
| Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline | Mar 9, 2024 | Object TrackingRgb-T Tracking | CodeCode Available | 2 | 5 |
| ChaCha for Online AutoML | Jun 9, 2021 | AutoMLScheduling | CodeCode Available | 2 | 5 |
| Graph-based Topology Reasoning for Driving Scenes | Apr 11, 2023 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| SegFix: Model-Agnostic Boundary Refinement for Segmentation | Jul 8, 2020 | modelSegmentation | CodeCode Available | 2 | 5 |
| Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction | May 31, 2022 | Surface Reconstruction | CodeCode Available | 2 | 5 |
| TrafficGPT: An LLM Approach for Open-Set Encrypted Traffic Classification | Aug 6, 2024 | Traffic Classification | CodeCode Available | 2 | 5 |
| Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models | Feb 5, 2024 | Data AugmentationData Poisoning | CodeCode Available | 2 | 5 |
| DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models | Sep 28, 2023 | 10-shot image generation1 Image, 2*2 Stitchi | CodeCode Available | 2 | 5 |
| Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data | Jul 11, 2024 | Autonomous NavigationPrediction | CodeCode Available | 2 | 5 |
| Probability density estimation for sets of large graphs with respect to spectral information using stochastic block models | Jul 5, 2022 | Density Estimation | CodeCode Available | 2 | 5 |
| MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly Detection | Jan 10, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 | 5 |
| One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts | Dec 28, 2023 | AllAnatomy | CodeCode Available | 2 | 5 |
| OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation | Apr 14, 2025 | 3D Shape Generation | CodeCode Available | 2 | 5 |
| LeanDojo: Theorem Proving with Retrieval-Augmented Language Models | Jun 27, 2023 | Automated Theorem ProvingGPU | CodeCode Available | 2 | 5 |
| LongVLM: Efficient Long Video Understanding via Large Language Models | Apr 4, 2024 | Question AnsweringVideo Question Answering | CodeCode Available | 2 | 5 |
| Geometry-Informed Neural Networks | Feb 21, 2024 | Diversity | CodeCode Available | 2 | 5 |
| MOROCCO: Model Resource Comparison Framework | Nov 16, 2021 | Computational Efficiencymodel | CodeCode Available | 2 | 5 |