| Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions | Jun 9, 2025 | Large Language ModelReinforcement Learning (RL) | CodeCode Available | 2 |
| LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Nov 14, 2024 | Earth ObservationInstruction Following | CodeCode Available | 2 |
| CMMLU: Measuring massive multitask language understanding in Chinese | Jun 15, 2023 | Large Language Model | CodeCode Available | 2 |
| CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling | Sep 28, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| LaVy: Vietnamese Multimodal Large Language Model | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Libra: Building Decoupled Vision System on Large Language Models | May 16, 2024 | Image to textLanguage Modeling | CodeCode Available | 2 |
| Large Language Model with Region-guided Referring and Grounding for CT Report Generation | Nov 23, 2024 | Computed Tomography (CT)Diagnostic | CodeCode Available | 2 |
| Alignment faking in large language models | Dec 18, 2024 | Large Language Model | CodeCode Available | 2 |
| FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Feb 16, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Large language models can be zero-shot anomaly detectors for time series? | May 23, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |
| Aligning to Thousands of Preferences via System Message Generalization | May 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| User Behavior Simulation with Large Language Model based Agents | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RegMix: Data Mixture as Regression for Language Model Pre-training | Jul 1, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 2 |
| Large Language Model Guided Tree-of-Thought | May 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models | Jan 30, 2024 | Data CompressionLanguage Modelling | CodeCode Available | 2 |
| Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement | May 13, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model | Apr 13, 2025 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Large Language Model Safety: A Holistic Survey | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks | Jan 27, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| ARAGOG: Advanced RAG Output Grading | Apr 1, 2024 | Document EmbeddingLanguage Modeling | CodeCode Available | 2 |
| FLAME: Financial Large-Language Model Assessment and Metrics Evaluation | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular Automata | Sep 3, 2024 | Large Language Model | CodeCode Available | 2 |
| ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback | Feb 15, 2024 | Computational chemistryGraph Neural Network | CodeCode Available | 2 |
| ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles | May 22, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data | Dec 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Algorithm Evolution Using Large Language Model | Nov 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AgentSims: An Open-Source Sandbox for Large Language Model Evaluation | Aug 8, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| 500xCompressor: Generalized Prompt Compression for Large Language Models | Aug 6, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Feb 13, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks | Jun 4, 2024 | Image CaptioningLanguage Modelling | CodeCode Available | 2 |
| AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application | May 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Models can Solve Computer Tasks | Mar 30, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation | Jan 11, 2025 | Chart UnderstandingCode Generation | CodeCode Available | 2 |
| ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG | Feb 13, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 2 |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Sep 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model | Mar 6, 2025 | General KnowledgeImage Captioning | CodeCode Available | 2 |
| KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion | Feb 4, 2024 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction | Mar 12, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 2 |
| SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model | Apr 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning | Sep 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |