| The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models | Aug 12, 2020 | counterfactualSentiment Analysis | CodeCode Available | 2 |
| Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with Perspectives | Jun 1, 2024 | Combinatorial Optimization | CodeCode Available | 2 |
| Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Feb 22, 2024 | AllMixture-of-Experts | CodeCode Available | 2 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation | May 29, 2022 | DecoderOptical Flow Estimation | CodeCode Available | 2 |
| AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation | May 31, 2025 | | CodeCode Available | 2 |
| PromptIR: Prompting for All-in-One Blind Image Restoration | Jun 22, 2023 | AllBlind All-in-One Image Restoration | CodeCode Available | 2 |
| Convolutional Neural Operators for robust and accurate learning of PDEs | Feb 2, 2023 | Operator learningPDE Surrogate Modeling | CodeCode Available | 2 |
| Grappa -- A Machine Learned Molecular Mechanics Force Field | Mar 25, 2024 | Computational Efficiency | CodeCode Available | 2 |
| The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants | Aug 31, 2023 | BelebeleCross-Lingual Transfer | CodeCode Available | 2 |
| A Machine Learning Approach That Beats Large Rubik's Cubes | Feb 18, 2025 | Rubik's Cube | CodeCode Available | 2 |
| Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Sep 27, 2022 | NeRFVisual Odometry | CodeCode Available | 2 |
| CAPO: Cost-Aware Prompt Optimization | Apr 22, 2025 | Arithmetic ReasoningAutoML | CodeCode Available | 2 |
| BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Nov 9, 2023 | Face ReenactmentNeRF | CodeCode Available | 2 |
| MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones | Jul 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Artificial Intelligence of Things: A Survey | Oct 25, 2024 | Survey | CodeCode Available | 2 |
| BianCang: A Traditional Chinese Medicine Large Language Model | Nov 17, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Fast Dynamic Radiance Fields with Time-Aware Neural Voxels | May 30, 2022 | NeRF | CodeCode Available | 2 |
| Automatically Bounding the Taylor Remainder Series: Tighter Bounds and New Applications | Dec 22, 2022 | global-optimizationNumerical Integration | CodeCode Available | 2 |
| LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations | Dec 11, 2024 | AttributeImage Generation | CodeCode Available | 2 |
| Fraud Dataset Benchmark and Applications | Aug 30, 2022 | AutoMLFeature Engineering | CodeCode Available | 2 |
| Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Jul 8, 2024 | Action Quality AssessmentDescriptive | CodeCode Available | 2 |
| DeBERTa: Decoding-enhanced BERT with Disentangled Attention | Jun 5, 2020 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | May 29, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Streaming Active Learning with Deep Neural Networks | Mar 5, 2023 | Active LearningDiversity | CodeCode Available | 2 |
| StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Translation | Feb 24, 2022 | Style TransferTranslation | CodeCode Available | 2 |
| Frequency-Adaptive Dilated Convolution for Semantic Segmentation | Mar 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features | Feb 12, 2025 | Pose EstimationVisual Odometry | CodeCode Available | 2 |
| On Meta-Prompting | Dec 11, 2023 | In-Context Learning | CodeCode Available | 2 |
| Reducing Hallucinations in Vision-Language Models via Latent Space Steering | Oct 21, 2024 | Hallucination | CodeCode Available | 2 |
| Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes | Oct 14, 2024 | Motion GenerationMotion Synthesis | CodeCode Available | 2 |
| Mapping Global Floods with 10 Years of Satellite Radar Data | Nov 3, 2024 | Disaster Response | CodeCode Available | 2 |
| EchoTracker: Advancing Myocardial Point Tracking in Echocardiography | May 14, 2024 | DiagnosticMotion Estimation | CodeCode Available | 2 |
| Getting it Right: Improving Spatial Consistency in Text-to-Image Models | Apr 1, 2024 | Spatial Reasoning | CodeCode Available | 2 |
| Compression Represents Intelligence Linearly | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Image Inversion: A Survey from GANs to Diffusion and Beyond | Feb 17, 2025 | Generative Adversarial NetworkStyle Transfer | CodeCode Available | 2 |
| CoSER: Coordinating LLM-Based Persona Simulation of Established Roles | Feb 13, 2025 | | CodeCode Available | 2 |
| EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding | Dec 5, 2024 | PredictionScene Understanding | CodeCode Available | 2 |
| DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Jul 3, 2024 | Image GenerationMolecular Docking | CodeCode Available | 2 |
| PatternRank: Leveraging Pretrained Language Models and Part of Speech for Unsupervised Keyphrase Extraction | Oct 11, 2022 | Keyphrase Extraction | CodeCode Available | 2 |
| Snuffy: Efficient Whole Slide Image Classifier | Aug 15, 2024 | Breast Cancer DetectionLung Cancer Diagnosis | CodeCode Available | 2 |
| Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models | Nov 12, 2023 | | CodeCode Available | 2 |
| Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning | Jan 7, 2019 | image-classificationImage Classification | CodeCode Available | 2 |
| TreeRL: LLM Reinforcement Learning with On-Policy Tree Search | Jun 13, 2025 | Mathreinforcement-learning | CodeCode Available | 2 |
| CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels | Nov 25, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders | Oct 28, 2024 | Denoising | CodeCode Available | 2 |
| A User's Guide to KSig: GPU-Accelerated Computation of the Signature Kernel | Jan 13, 2025 | GPU | CodeCode Available | 2 |
| RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering | Feb 26, 2024 | FormOpen-Domain Question Answering | CodeCode Available | 2 |
| SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents | Mar 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Localized Fine-Grained Control for Facial Expression Generation | Jul 25, 2024 | AnatomyFace Generation | CodeCode Available | 2 |