| FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence | Jan 21, 2020 | Image ClassificationPseudo Label | CodeCode Available | 2 |
| Mask-Free Video Instance Segmentation | Mar 28, 2023 | Instance SegmentationOptical Flow Estimation | CodeCode Available | 2 |
| Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis | Jan 4, 2024 | | CodeCode Available | 2 |
| AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning | Apr 13, 2024 | Few-Shot Learning | CodeCode Available | 2 |
| ARF: Artistic Radiance Fields | Jun 13, 2022 | | CodeCode Available | 2 |
| A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution | Apr 24, 2024 | Blind Super-ResolutionImage Restoration | CodeCode Available | 2 |
| Bolt: Accelerated Data Mining with Fast Vector Compression | Jun 30, 2017 | Quantization | CodeCode Available | 2 |
| Learning Dense Representations of Phrases at Scale | Dec 23, 2020 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Aug 13, 2020 | Autonomous VehiclesBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis | Nov 5, 2024 | Code Generation | CodeCode Available | 2 |
| FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting | Dec 1, 2023 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis | Oct 8, 2024 | Autonomous DrivingContrastive Learning | CodeCode Available | 2 |
| Simple Guidance Mechanisms for Discrete Diffusion Models | Dec 13, 2024 | Image Generation | CodeCode Available | 2 |
| One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | Sep 29, 2024 | AllImage Segmentation | CodeCode Available | 2 |
| Geoopt: Riemannian Optimization in PyTorch | May 6, 2020 | Riemannian optimization | CodeCode Available | 2 |
| RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models | Apr 17, 2024 | Graph Neural Network | CodeCode Available | 2 |
| Visual Generation Without Guidance | Jan 26, 2025 | Diversity | CodeCode Available | 2 |
| A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | May 24, 2023 | Dense Pixel Correspondence EstimationRepresentation Learning | CodeCode Available | 2 |
| Fast convolutional neural networks on FPGAs with hls4ml | Jan 13, 2021 | Model CompressionQuantization | CodeCode Available | 2 |
| Conformal prediction under ambiguous ground truth | Jul 18, 2023 | Conformal PredictionPrediction | CodeCode Available | 2 |
| Torsional Diffusion for Molecular Conformer Generation | Jun 1, 2022 | BIG-bench Machine LearningComputational chemistry | CodeCode Available | 2 |
| Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging | Apr 30, 2024 | Pose Estimation | CodeCode Available | 2 |
| TSFEL: Time Series Feature Extraction Library | Mar 21, 2020 | Feature EngineeringTime Series | CodeCode Available | 2 |
| BEACON: Benchmark for Comprehensive RNA Tasks and Language Models | Jun 14, 2024 | Language Modelling | CodeCode Available | 2 |
| MathPile: A Billion-Token-Scale Pretraining Corpus for Math | Dec 28, 2023 | Language IdentificationMath | CodeCode Available | 2 |
| Fast R-CNN | Apr 30, 2015 | ObjectObject Detection | CodeCode Available | 2 |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | Sep 26, 2019 | Common Sense ReasoningGPU | CodeCode Available | 2 |
| Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment | May 27, 2025 | Adversarial AttackClustering | CodeCode Available | 2 |
| TableBank: Table Benchmark for Image-based Table Detection and Recognition | May 1, 2020 | Table Detection | CodeCode Available | 2 |
| Towards Garment Sewing Pattern Reconstruction from a Single Image | Nov 7, 2023 | Garment ReconstructionTexture Synthesis | CodeCode Available | 2 |
| Deep TEN: Texture Encoding Network | Dec 8, 2016 | Dictionary LearningMaterial Recognition | CodeCode Available | 2 |
| 3D Human Mesh Estimation from Virtual Markers | Mar 21, 2023 | 3D Human Pose Estimation3D Pose Estimation | CodeCode Available | 2 |
| TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing | Feb 28, 2020 | Knowledge DistillationReading Comprehension | CodeCode Available | 2 |
| DensePose From WiFi | Dec 31, 2022 | 3D Human Pose EstimationBody Detection | CodeCode Available | 2 |
| MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction | Apr 1, 2024 | DecoderOnline Vectorized HD Map Construction | CodeCode Available | 2 |
| Stand-Alone Self-Attention in Vision Models | Jun 13, 2019 | object-detectionObject Detection | CodeCode Available | 2 |
| FASTER: Fast and Safe Trajectory Planner for Navigation in Unknown Environments | Jan 9, 2020 | Motion PlanningTrajectory Planning | CodeCode Available | 2 |
| CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for Chinese | Jan 13, 2020 | Chinese Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 2 |
| Data-Free Learning of Student Networks | Apr 2, 2019 | Neural Network Compression | CodeCode Available | 2 |
| The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models | Apr 24, 2024 | DiversityNavigate | CodeCode Available | 2 |
| Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs | Jun 28, 2024 | Code GenerationCode Translation | CodeCode Available | 2 |
| PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System | Sep 7, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 2 |
| Binary Neural Networks: A Survey | Mar 31, 2020 | Binarizationimage-classification | CodeCode Available | 2 |
| Is Space-Time Attention All You Need for Video Understanding? | Feb 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models | Apr 7, 2025 | Dialogue EvaluationFairness | CodeCode Available | 2 |
| Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement | Oct 6, 2024 | Mathematical ReasoningMeta-Learning | CodeCode Available | 2 |
| rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch | Sep 3, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 2 |
| Training Graph Neural Networks with 1000 Layers | Jun 14, 2021 | GPUGraph Sampling | CodeCode Available | 2 |
| Construction of a Japanese Financial Benchmark for Large Language Models | Mar 22, 2024 | | CodeCode Available | 2 |
| JAX MD: A Framework for Differentiable Physics | Dec 1, 2020 | GPU | CodeCode Available | 2 |