| What is Wrong with Perplexity for Long-context Language Modeling? | Oct 31, 2024 | Document SummarizationIn-Context Learning | CodeCode Available | 2 |
| Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts | Feb 12, 2024 | Continual PretrainingGSM8K | CodeCode Available | 2 |
| Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification | Oct 26, 2020 | Few-Shot Text ClassificationGeneral Classification | CodeCode Available | 2 |
| GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model | Jun 3, 2024 | geo-localizationLanguage Modeling | CodeCode Available | 2 |
| AutoGRAMS: Autonomous Graphical Agent Modeling Software | Jul 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fine-Grained Human Feedback Gives Better Rewards for Language Model Training | Jun 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AutoFlow: Automated Workflow Generation for Large Language Model Agents | Jul 1, 2024 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| Automated Bioinformatics Analysis via AutoBA | Sep 6, 2023 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation | Jan 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale | Mar 13, 2024 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 2 |
| Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation | May 22, 2024 | InformativenessLanguage Modeling | CodeCode Available | 2 |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Jun 3, 2024 | Audio GenerationIn-Context Learning | CodeCode Available | 2 |
| Generative Region-Language Pretraining for Open-Ended Object Detection | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token | May 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Sep 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generating Benchmarks for Factuality Evaluation of Language Models | Jul 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Jun 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |
| Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model | Mar 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Training-free LLM-based Approach to General Chinese Character Error Correction | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generalized Interpolating Discrete Diffusion | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Modeling for Mathematical Discovery | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GenSim: A General Social Simulation Platform with Large Language Model based Agents | Oct 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Frontiers in Intelligent Colonoscopy | Oct 22, 2024 | Image Captioning | CodeCode Available | 2 |
| From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Jun 20, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Survey of Multimodal Large Language Model from A Data-centric Perspective | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model | May 3, 2024 | Decision MakingFew-Shot Learning | CodeCode Available | 2 |
| Formal Mathematics Statement Curriculum Learning | Feb 3, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 2 |
| A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Mar 7, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Forgetting Transformer: Softmax Attention with a Forget Gate | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AgentSims: An Open-Source Sandbox for Large Language Model Evaluation | Aug 8, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models | Jul 24, 2023 | Image GenerationImage-text matching | CodeCode Available | 2 |
| AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Touch, Vision, and Language Dataset for Multimodal Alignment | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |
| GeoChat: Grounded Large Vision-Language Model for Remote Sensing | Nov 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 |
| Implicit Neural Representation for Cooperative Low-light Image Enhancement | Mar 21, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 2 |
| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Feb 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model | Jan 21, 2025 | HallucinationImage Captioning | CodeCode Available | 1 |
| A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability | Mar 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| FIRE: Fact-checking with Iterative Retrieval and Verification | Oct 17, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| Masked Structural Growth for 2x Faster Language Model Pre-training | May 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |