| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 |
| AutoTimes: Autoregressive Time Series Forecasters via Large Language Models | Feb 4, 2024 | DecoderIn-Context Learning | CodeCode Available | 3 |
| In-Context Learning for Extreme Multi-Label Classification | Jan 22, 2024 | ClassificationExtreme Multi-Label Classification | CodeCode Available | 3 |
| Generative Multimodal Models are In-Context Learners | Dec 20, 2023 | In-Context LearningPersonalized Image Generation | CodeCode Available | 3 |
| AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models | Aug 29, 2023 | Anomaly DetectionIn-Context Learning | CodeCode Available | 3 |
| Fine-Tuning Language Models with Just Forward Passes | May 27, 2023 | GPUIn-Context Learning | CodeCode Available | 3 |
| Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision | May 4, 2023 | DiversityIn-Context Learning | CodeCode Available | 3 |
| Large Language Models Are Human-Level Prompt Engineers | Nov 3, 2022 | Few-Shot LearningIn-Context Learning | CodeCode Available | 3 |
| ConTextTab: A Semantics-Aware Tabular In-Context Learner | Jun 12, 2025 | In-Context LearningWorld Knowledge | CodeCode Available | 2 |
| CausalPFN: Amortized Causal Effect Estimation via In-Context Learning | Jun 9, 2025 | Decision MakingHeterogeneous Treatment Effect Estimation | CodeCode Available | 2 |
| Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap | Mar 27, 2025 | Autonomous DrivingIn-Context Learning | CodeCode Available | 2 |
| OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models | Feb 26, 2025 | In-Context LearningKnowledge Graphs | CodeCode Available | 2 |
| NeoBERT: A Next-Generation BERT | Feb 26, 2025 | In-Context LearningMTEB Benchmark | CodeCode Available | 2 |
| JoLT: Joint Probabilistic Predictions on Tabular Data Using LLMs | Feb 17, 2025 | ImputationIn-Context Learning | CodeCode Available | 2 |
| Enhancing Retrieval-Augmented Generation: A Study of Best Practices | Jan 13, 2025 | In-Context LearningRAG | CodeCode Available | 2 |
| X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models | Dec 2, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 2 |
| KV Shifting Attention Enhances Language Modeling | Nov 29, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers | Nov 17, 2024 | In-Context LearningMeta-Learning | CodeCode Available | 2 |
| Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization | Nov 1, 2024 | Computational EfficiencyIn-Context Learning | CodeCode Available | 2 |
| What is Wrong with Perplexity for Long-context Language Modeling? | Oct 31, 2024 | Document SummarizationIn-Context Learning | CodeCode Available | 2 |
| TabDPT: Scaling Tabular Foundation Models | Oct 23, 2024 | In-Context LearningSelf-Supervised Learning | CodeCode Available | 2 |
| A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning | Oct 16, 2024 | In-Context LearningKnowledge Graphs | CodeCode Available | 2 |
| Differential Transformer | Oct 7, 2024 | HallucinationIn-Context Learning | CodeCode Available | 2 |
| A Simple Image Segmentation Framework via In-Context Examples | Oct 7, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Small Language Models: Survey, Measurements, and Insights | Sep 24, 2024 | BenchmarkingDecoder | CodeCode Available | 2 |
| A Controlled Study on Long Context Extension and Generalization in LLMs | Sep 18, 2024 | In-Context Learning | CodeCode Available | 2 |
| Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse | Sep 17, 2024 | In-Context LearningRAG | CodeCode Available | 2 |
| xGen-MM (BLIP-3): A Family of Open Large Multimodal Models | Aug 16, 2024 | In-Context Learning | CodeCode Available | 2 |
| Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | Aug 7, 2024 | AttributeIn-Context Learning | CodeCode Available | 2 |
| SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions | Jul 16, 2024 | In-Context LearningKnowledge Base Question Answering | CodeCode Available | 2 |
| Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning | Jul 15, 2024 | In-Context Learning | CodeCode Available | 2 |
| SpreadsheetLLM: Encoding Spreadsheets for Large Language Models | Jul 12, 2024 | In-Context LearningTable Detection | CodeCode Available | 2 |
| Just read twice: closing the recall gap for recurrent language models | Jul 7, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| MG-Verilog: Multi-grained Dataset Towards Enhanced LLM-assisted Verilog Generation | Jul 2, 2024 | In-Context Learning | CodeCode Available | 2 |
| Finding Transformer Circuits with Edge Pruning | Jun 24, 2024 | In-Context LearningLanguage Modelling | CodeCode Available | 2 |
| Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study | Jun 20, 2024 | In-Context LearningKnowledge Distillation | CodeCode Available | 2 |
| InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales | Jun 19, 2024 | DenoisingIn-Context Learning | CodeCode Available | 2 |
| In-Context Editing: Learning Knowledge from Self-Induced Distributions | Jun 17, 2024 | Image EditingIn-Context Learning | CodeCode Available | 2 |
| CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation | Jun 15, 2024 | In-Context LearningText Generation | CodeCode Available | 2 |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Jun 3, 2024 | Audio GenerationIn-Context Learning | CodeCode Available | 2 |
| Knowledge Circuits in Pretrained Transformers | May 28, 2024 | In-Context Learningknowledge editing | CodeCode Available | 2 |
| FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction | May 28, 2024 | In-Context LearningPrediction | CodeCode Available | 2 |
| NoteLLM-2: Multimodal Large Representation Models for Recommendation | May 27, 2024 | In-Context Learning | CodeCode Available | 2 |
| Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation | May 24, 2024 | In-Context LearningText to SQL | CodeCode Available | 2 |
| Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Fine-tuned In-Context Learning Transformers are Excellent Tabular Data Classifiers | May 22, 2024 | In-Context Learning | CodeCode Available | 2 |
| Many-Shot In-Context Learning in Multimodal Foundation Models | May 16, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Memory Mosaics | May 10, 2024 | DisentanglementIn-Context Learning | CodeCode Available | 2 |
| Linearizing Large Language Models | May 10, 2024 | In-Context LearningMamba | CodeCode Available | 2 |
| PropertyGPT: LLM-driven Formal Verification of Smart Contracts through Retrieval-Augmented Property Generation | May 4, 2024 | In-Context LearningRetrieval | CodeCode Available | 2 |