| MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention | Dec 14, 2023 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| ResAD: A Simple Framework for Class Generalizable Anomaly Detection | Oct 26, 2024 | Anomaly Detection | CodeCode Available | 2 |
| RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds | Mar 16, 2025 | GPU | CodeCode Available | 2 |
| AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jan 31, 2024 | Denoising | CodeCode Available | 2 |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images | Feb 28, 2023 | 3D Face ReconstructionDisentanglement | CodeCode Available | 2 |
| Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection | Jun 14, 2024 | Decoderspeech-recognition | CodeCode Available | 2 |
| Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jul 23, 2024 | Image GenerationPoint Tracking | CodeCode Available | 2 |
| Preserving Fairness Generalization in Deepfake Detection | Feb 27, 2024 | DeepFake DetectionDisentanglement | CodeCode Available | 2 |
| Merging Context Clustering with Visual State Space Models for Medical Image Segmentation | Jan 3, 2025 | ClusteringImage Segmentation | CodeCode Available | 2 |
| Phantom of Latent for Large Language and Vision Models | Sep 23, 2024 | Visual Question Answering | CodeCode Available | 2 |
| PosSAM: Panoptic Open-vocabulary Segment Anything | Mar 14, 2024 | DecoderOpen Vocabulary Panoptic Segmentation | CodeCode Available | 2 |
| DPO-Shift: Shifting the Distribution of Direct Preference Optimization | Feb 11, 2025 | | CodeCode Available | 2 |
| Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| On the Guidance of Flow Matching | Feb 4, 2025 | Decision MakingImage Generation | CodeCode Available | 2 |
| Quantformer: from attention to profit with a quantitative transformer trading strategy | Mar 30, 2024 | Sentiment AnalysisTransfer Learning | CodeCode Available | 2 |
| Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks | Jan 6, 2025 | DecoderQuantization | CodeCode Available | 2 |
| Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model | Feb 3, 2022 | Multi-Armed BanditsOff-policy evaluation | CodeCode Available | 2 |
| Fake News Detection on Social Media: A Data Mining Perspective | Aug 7, 2017 | Fake News Detection | CodeCode Available | 2 |
| FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching | Jan 9, 2025 | Audio Super-ResolutionComputational Efficiency | CodeCode Available | 2 |
| Tests for model misspecification in simulation-based inference: from local distortions to global model checks | Dec 19, 2024 | Anomaly Detectionmodel | CodeCode Available | 2 |
| Underwater Image Enhancement by Diffusion Model with Customized CLIP-Classifier | May 25, 2024 | Image EnhancementImage Generation | CodeCode Available | 2 |
| LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement | Jan 31, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction | Dec 2, 2021 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 |
| A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Non-rigid Point Cloud Registration with Neural Deformation Pyramid | May 25, 2022 | Point Cloud Registration | CodeCode Available | 2 |
| Matting Anything | Jun 8, 2023 | Image MattingReferring Image Matting | CodeCode Available | 2 |
| Learning What Not to Segment: A New Perspective on Few-Shot Segmentation | Mar 15, 2022 | Few-Shot Semantic SegmentationMeta-Learning | CodeCode Available | 2 |
| GPT4Point: A Unified Framework for Point-Language Understanding and Generation | Dec 5, 2023 | 3D GenerationImage Generation | CodeCode Available | 2 |
| Retrieval-Augmented Perception: High-Resolution Image Perception Meets Visual RAG | Mar 3, 2025 | RAGRetrieval | CodeCode Available | 2 |
| Controlling Vision-Language Models for Multi-Task Image Restoration | Oct 2, 2023 | Image DehazingImage Denoising | CodeCode Available | 2 |
| ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time Series Forecasting | Apr 8, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| GFT: Graph Foundation Model with Transferable Tree Vocabulary | Nov 9, 2024 | Drug DiscoveryGraph Learning | CodeCode Available | 2 |
| ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Oct 17, 2024 | 3D Semantic SegmentationImage Generation | CodeCode Available | 2 |
| Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies | Jan 4, 2025 | Edge-computingKnowledge Distillation | CodeCode Available | 2 |
| AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model | Jun 5, 2025 | DecoderImage Generation | CodeCode Available | 2 |
| ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning | Jan 4, 2024 | Data VisualizationDecision Making | CodeCode Available | 2 |
| Short-Term Density Forecasting of Low-Voltage Load using Bernstein-Polynomial Normalizing Flows | Apr 29, 2022 | Decision MakingLoad Forecasting | CodeCode Available | 2 |
| DiffusionLight: Light Probes for Free by Painting a Chrome Ball | Dec 14, 2023 | DiversityLighting Estimation | CodeCode Available | 2 |
| RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors | May 13, 2024 | Adversarial RobustnessText Detection | CodeCode Available | 2 |
| From Generalist to Specialist: A Survey of Large Language Models for Chemistry | Dec 28, 2024 | scientific discoverySurvey | CodeCode Available | 2 |
| UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation | Dec 14, 2023 | Motion CompensationMulti-Object Tracking | CodeCode Available | 2 |
| SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Jun 13, 2025 | BenchmarkingLarge Language Model | CodeCode Available | 2 |
| FlowIE: Efficient Image Enhancement via Rectified Flow | Jun 1, 2024 | Image Enhancement | CodeCode Available | 2 |
| EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images | Feb 21, 2023 | | CodeCode Available | 2 |
| PyGlove: Efficiently Exchanging ML Ideas as Code | Feb 3, 2023 | | CodeCode Available | 2 |
| Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora | Apr 10, 2025 | | CodeCode Available | 2 |
| Detecting Everything in the Open World: Towards Universal Object Detection | Mar 21, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving | Oct 3, 2023 | Action GenerationAutonomous Driving | CodeCode Available | 2 |
| InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization | Apr 6, 2024 | valid | CodeCode Available | 2 |
| SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation | Apr 6, 2025 | Multi-Object TrackingObject | CodeCode Available | 2 |