| G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid Model | Aug 22, 2023 | | CodeCode Available | 2 |
| Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification | Oct 26, 2020 | Few-Shot Text ClassificationGeneral Classification | CodeCode Available | 2 |
| U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation | Jun 5, 2024 | Image SegmentationKolmogorov-Arnold Networks | CodeCode Available | 2 |
| YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition | Aug 5, 2024 | Action Detection | CodeCode Available | 2 |
| ConceptNet 5.5: An Open Multilingual Graph of General Knowledge | Dec 12, 2016 | General KnowledgeWord Embeddings | CodeCode Available | 2 |
| Efficient One-Pass End-to-End Entity Linking for Questions | Oct 6, 2020 | CPUEntity Linking | CodeCode Available | 2 |
| On the Emergence of Thinking in LLMs I: Searching for the Right Intuition | Feb 10, 2025 | Math | CodeCode Available | 2 |
| 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions | Apr 7, 2024 | 3D Reconstruction | CodeCode Available | 2 |
| SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry | Jul 5, 2024 | Benchmarkingobject-detection | CodeCode Available | 2 |
| A Better Variant of Self-Critical Sequence Training | Mar 22, 2020 | Image Captioning | CodeCode Available | 2 |
| DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation | May 24, 2024 | 3D ReconstructionCamera Calibration | CodeCode Available | 2 |
| Pedagogical Alignment of Large Language Models | Feb 7, 2024 | Synthetic Data Generation | CodeCode Available | 2 |
| Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet | Jan 28, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting | Jan 21, 2024 | 3D Reconstruction | CodeCode Available | 2 |
| Preble: Efficient Distributed Prompt Scheduling for LLM Serving | May 8, 2024 | GPUScheduling | CodeCode Available | 2 |
| Deep Learning-based Compression Detection for explainable Face Image Quality Assessment | Jan 7, 2025 | Face Image QualityFace Image Quality Assessment | CodeCode Available | 2 |
| Debiasing Multimodal Large Language Models | Mar 8, 2024 | FairnessQuestion Answering | CodeCode Available | 2 |
| Representation Learning and Identity Adversarial Training for Facial Behavior Understanding | Jul 15, 2024 | Facial Action Unit DetectionFacial Expression Recognition (FER) | CodeCode Available | 2 |
| GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment | Oct 17, 2023 | AttributeObject | CodeCode Available | 2 |
| Streaming Anomaly Detection | Jan 30, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 |
| FP8-LM: Training FP8 Large Language Models | Oct 27, 2023 | GPU | CodeCode Available | 2 |
| Causal structure learning with momentum: Sampling distributions over Markov Equivalence Classes of DAGs | Oct 9, 2023 | Causal DiscoveryGraph Sampling | CodeCode Available | 2 |
| A New Era in Software Security: Towards Self-Healing Software via Large Language Models and Formal Verification | May 24, 2023 | C++ codeMathematical Proofs | CodeCode Available | 2 |
| ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents | Jun 28, 2024 | | CodeCode Available | 2 |
| Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention | Feb 19, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency | Jun 2, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation | Jul 19, 2024 | Data AugmentationDepth Estimation | CodeCode Available | 2 |
| Dual Aggregation Transformer for Image Super-Resolution | Aug 7, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Zooming Out on Zooming In: Advancing Super-Resolution for Remote Sensing | Nov 29, 2023 | Super-Resolution | CodeCode Available | 2 |
| Curiosity-driven Red-teaming for Large Language Models | Feb 29, 2024 | Red TeamingReinforcement Learning (RL) | CodeCode Available | 2 |
| Gradient Alignment for Cross-Domain Face Anti-Spoofing | Feb 29, 2024 | Domain GeneralizationFace Anti-Spoofing | CodeCode Available | 2 |
| MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage | Mar 15, 2024 | Music Transcription | CodeCode Available | 2 |
| Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Mar 26, 2024 | Adversarial AttackAutonomous Driving | CodeCode Available | 2 |
| Event-based Stereo Depth Estimation: A Survey | Sep 26, 2024 | Depth EstimationNavigate | CodeCode Available | 2 |
| Aesthetic Text Logo Synthesis via Content-aware Layout Inferring | Apr 6, 2022 | Layout DesignLayout Generation | CodeCode Available | 2 |
| Neural 3D Scene Reconstruction with the Manhattan-world Assumption | May 5, 2022 | 2D Semantic Segmentation3D Reconstruction | CodeCode Available | 2 |
| Text2Performer: Text-Driven Human Video Generation | Apr 17, 2023 | Video Generation | CodeCode Available | 2 |
| Large language models can be zero-shot anomaly detectors for time series? | May 23, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |
| JADE: A Linguistics-based Safety Evaluation Platform for Large Language Models | Nov 1, 2023 | Natural Questions | CodeCode Available | 2 |
| Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior | Jan 17, 2024 | 3D GenerationText to 3D | CodeCode Available | 2 |
| Koopa: Learning Non-stationary Time Series Dynamics with Koopman Predictors | May 30, 2023 | Time Series | CodeCode Available | 2 |
| StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing | Feb 20, 2024 | Voice Cloning | CodeCode Available | 2 |
| Learning to Solve Job Shop Scheduling under Uncertainty | Mar 4, 2024 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 2 |
| MatchTime: Towards Automatic Soccer Game Commentary Generation | Jun 26, 2024 | | CodeCode Available | 2 |
| LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages | Jul 8, 2024 | Data AugmentationTranslation | CodeCode Available | 2 |
| Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Jul 23, 2024 | ColorizationDeblurring | CodeCode Available | 2 |
| u-μP: The Unit-Scaled Maximal Update Parametrization | Jul 24, 2024 | | CodeCode Available | 2 |
| StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models | Sep 4, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Aug 27, 2024 | Autonomous DrivingNeural Rendering | CodeCode Available | 2 |
| Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation | Sep 2, 2024 | GPU | CodeCode Available | 2 |