| Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving | Jun 4, 2025 | Autonomous DrivingImitation Learning | CodeCode Available | 0 |
| Boundary Aware U-Net for Glacier Segmentation | Jan 26, 2023 | SegmentationSelf-Learning | CodeCode Available | 0 |
| Less Attention is More: Prompt Transformer for Generalized Category Discovery | Jan 1, 2025 | Contrastive LearningSelf-Learning | CodeCode Available | 0 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | May 15, 2025 | Large Language ModelMath | CodeCode Available | 0 |
| Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution | Jul 2, 2015 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search | Nov 28, 2022 | AI AgentSelf-Learning | CodeCode Available | 0 |
| KBAlign: Efficient Self Adaptation on Specific Knowledge Bases | Nov 22, 2024 | Question AnsweringRAG | CodeCode Available | 0 |
| How to Control Hydrodynamic Force on Fluidic Pinball via Deep Reinforcement Learning | Apr 23, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| A Conceptual Bio-Inspired Framework for the Evolution of Artificial General Intelligence | Mar 25, 2019 | Self-LearningSelf-Supervised Learning | CodeCode Available | 0 |
| Self-Learning Exploration and Mapping for Mobile Robots via Deep Reinforcement Learning | Jan 6, 2019 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 0 |
| Do We Really Need Fully Unsupervised Cross-Lingual Embeddings? | Sep 4, 2019 | Bilingual Lexicon InductionSelf-Learning | CodeCode Available | 0 |
| Cost-effective Object Detection: Active Sample Mining with Switchable Selection Criteria | Jun 30, 2018 | Active Learningobject-detection | CodeCode Available | 0 |
| GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar Frequencies | Jul 19, 2021 | Decision MakingSelf-Learning | CodeCode Available | 0 |
| Active Learning for Abstractive Text Summarization | Jan 9, 2023 | Abstractive Text SummarizationActive Learning | CodeCode Available | 0 |
| Reinforcement Learning of Self Enhancing Camera Image and Signal Processing | Nov 15, 2021 | BlockingData Augmentation | CodeCode Available | 0 |
| Self-learning for weakly supervised Gleason grading of local patterns | May 21, 2021 | PrognosisSelf-Learning | CodeCode Available | 0 |
| Sim-Env: Decoupling OpenAI Gym Environments from Simulation Models | Feb 19, 2021 | OpenAI Gymreinforcement-learning | CodeCode Available | 0 |
| General solutions for nonlinear differential equations: a rule-based self-learning approach using deep reinforcement learning | May 13, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Beneficial and Harmful Explanatory Machine Learning | Sep 9, 2020 | BIG-bench Machine LearningSelf-Learning | CodeCode Available | 0 |
| Domain Adaptation by Class Centroid Matching and Local Manifold Self-Learning | Mar 20, 2020 | Domain AdaptationSelf-Learning | CodeCode Available | 0 |
| A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition | Feb 25, 2025 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Self-learning Machines based on Hamiltonian Echo Backpropagation | Mar 8, 2021 | Self-Learning | CodeCode Available | 0 |
| SMART: Self-learning Meta-strategy Agent for Reasoning Tasks | Oct 21, 2024 | GSM8KSelf-Learning | CodeCode Available | 0 |
| Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic | Aug 6, 2024 | Multi-Agent Path FindingSelf-Learning | CodeCode Available | 0 |