ktrain: A Low-Code Library for Augmented Machine Learning Apr 19, 2020 BIG-bench Machine Learning Classification
Code Code Available 2The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability Jan 28, 2020 BIG-bench Machine Learning Fact Checking
Code Code Available 2Reformer: The Efficient Transformer Jan 13, 2020 D4RL Image Generation
Code Code Available 2Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Oct 23, 2019 Answer Generation Common Sense Reasoning
Code Code Available 2A Pilot Study for Chinese SQL Semantic Parsing Sep 29, 2019 Cross-Lingual Word Embeddings Question Answering
Code Code Available 2OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction Sep 28, 2019 Information Retrieval Question Answering
Code Code Available 2ALBERT: A Lite BERT for Self-supervised Learning of Language Representations Sep 26, 2019 Common Sense Reasoning GPU
Code Code Available 2Unified Vision-Language Pre-Training for Image Captioning and VQA Sep 24, 2019 Decoder Image Captioning
Code Code Available 2Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism Sep 17, 2019 GPU LAMBADA
Code Code Available 2The Replica Dataset: A Digital Replica of Indoor Spaces Jun 13, 2019 3D Scene Reconstruction Instruction Following
Code Code Available 2Synthetic QA Corpora Generation with Roundtrip Consistency Jun 12, 2019 Question Answering Question Generation
Code Code Available 2Document Expansion by Query Prediction Apr 17, 2019 Passage Re-Ranking Prediction
Code Code Available 2Habitat: A Platform for Embodied AI Research Apr 2, 2019 Benchmarking GPU
Code Code Available 2Knowledge Representation Learning: A Quantitative Review Dec 28, 2018 General Classification Information Retrieval
Code Code Available 2Training RNNs as Fast as CNNs Jan 1, 2018 General Classification Language Modeling
Code Code Available 2Simple Recurrent Units for Highly Parallelizable Recurrence Sep 8, 2017 General Classification Machine Translation
Code Code Available 2Tracking the World State with Recurrent Entity Networks Dec 12, 2016 Procedural Text Understanding Question Answering
Code Code Available 2Dialogue Learning With Human-In-The-Loop Nov 29, 2016 Question Answering reinforcement-learning
Code Code Available 2End-To-End Memory Networks Mar 31, 2015 Language Modeling Language Modelling
Code Code Available 2Describe Anything Model for Visual Question Answering on Text-rich Images Jul 16, 2025 Descriptive Language Modeling
Code Code Available 1Warehouse Spatial Question Answering with LLM Agent Jul 14, 2025 Question Answering Spatial Reasoning
Code Code Available 1Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder Jun 28, 2025 Image Segmentation Large Language Model
Code Code Available 1Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective Jun 22, 2025 In-Context Learning Large Language Model
Code Code Available 1SeqPE: Transformer with Sequential Position Encoding Jun 16, 2025 image-classification Image Classification
Code Code Available 1SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement Jun 16, 2025 document understanding Question Answering
Code Code Available 1Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification Jun 8, 2025 Question Answering Visual Question Answering
Code Code Available 1ECoRAG: Evidentiality-guided Compression for Long Context RAG Jun 5, 2025 Answer Generation Open-Domain Question Answering
Code Code Available 1OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation Jun 3, 2025 Question Answering
Code Code Available 1PAKTON: A Multi-Agent Framework for Question Answering in Long Legal Agreements May 31, 2025 Privacy Preserving Question Answering
Code Code Available 1VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software May 30, 2025 Question Answering Spatial Reasoning
Code Code Available 1Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning May 29, 2025 Diagnostic Question Answering
Code Code Available 1Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint May 29, 2025 Image Captioning Question Answering
Code Code Available 1Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration May 27, 2025 Multi-hop Question Answering Question Answering
Code Code Available 1MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding May 26, 2025 Question Answering Visual Question Answering
Code Code Available 1KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing May 26, 2025 Knowledge Tracing Multi-hop Question Answering
Code Code Available 1NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering May 26, 2025 Chunking Large Language Model
Code Code Available 1MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents May 26, 2025 Benchmarking Minecraft
Code Code Available 1Visualized Text-to-Image Retrieval May 26, 2025 Image Retrieval Question Answering
Code Code Available 1Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering May 25, 2025 Anatomy Benchmarking
Code Code Available 1SATORI-R1: Incentivizing Multimodal Reasoning with Spatial Grounding and Verifiable Rewards May 25, 2025 Image Captioning Multimodal Reasoning
Code Code Available 1VEAttack: Downstream-agnostic Vision Encoder Attack against Large Vision Language Models May 23, 2025 Question Answering Visual Question Answering
Code Code Available 1MetaGen Blended RAG: Higher Accuracy for Domain-Specific Q&A Without Fine-Tuning May 23, 2025 Few-Shot Learning Question Answering
Code Code Available 1Benchmarking Retrieval-Augmented Multimomal Generation for Document Question Answering May 22, 2025 Benchmarking Evidence Selection
Code Code Available 1Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression May 22, 2025 Hallucination Image Description
Code Code Available 1Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning May 22, 2025 Form Question Answering
Code Code Available 1O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering May 22, 2025 Answer Generation Open-Ended Question Answering
Code Code Available 1HopWeaver: Synthesizing Authentic Multi-Hop Questions Across Text Corpora May 21, 2025 Multi-hop Question Answering Question Answering
Code Code Available 1The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation May 21, 2025 Answer Generation In-Context Learning
Code Code Available 1From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning May 21, 2025 Question Answering Reinforcement Learning (RL)
Code Code Available 1Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues? May 19, 2025 Logical Reasoning Optical Character Recognition
Code Code Available 1