| Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts | Feb 12, 2024 | Continual PretrainingGSM8K | CodeCode Available | 2 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 |
| UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction | Feb 10, 2024 | graph constructionKnowledge Graph Completion | CodeCode Available | 2 |
| On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference | Feb 9, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| ScreenAI: A Vision-Language Model for UI and Infographics Understanding | Feb 7, 2024 | Chart Question AnsweringLanguage Modeling | CodeCode Available | 2 |
| Can Large Language Model Agents Simulate Human Trust Behavior? | Feb 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion | Feb 4, 2024 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Efficient Exact Optimization of Language Model Alignment | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement | Jan 31, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning | Jan 31, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 |
| LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation | Jan 30, 2024 | HallucinationKnowledge Distillation | CodeCode Available | 2 |
| EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain | Jan 30, 2024 | Image ComprehensionInstruction Following | CodeCode Available | 2 |
| L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks | Jan 27, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| Towards 3D Molecule-Text Interpretation in Language Models | Jan 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation | Jan 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| ChatterBox: Multi-round Multimodal Referring and Grounding | Jan 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| In-Context Language Learning: Architectures and Algorithms | Jan 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation | Jan 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model | Jan 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Jan 18, 2024 | Code GenerationEvolutionary Algorithms | CodeCode Available | 2 |
| Spatial-Temporal Large Language Model for Traffic Prediction | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Graph Language Models | Jan 13, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 2 |
| Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TechGPT-2.0: A large language model project to solve the task of knowledge graph construction | Jan 9, 2024 | graph constructionLanguage Modeling | CodeCode Available | 2 |
| Malla: Demystifying Real-world Large Language Model Integrated Malicious Services | Jan 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning | Jan 4, 2024 | Data VisualizationDecision Making | CodeCode Available | 2 |
| A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity | Jan 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge | Jan 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining | Dec 29, 2023 | GPULanguage Modeling | CodeCode Available | 2 |
| Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference | Dec 23, 2023 | GPUHigh-Level Synthesis | CodeCode Available | 2 |
| LingoQA: Visual Question Answering for Autonomous Driving | Dec 21, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Dec 4, 2023 | Dense CaptioningHighlight Detection | CodeCode Available | 2 |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars | Nov 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| YUAN 2.0: A Large Language Model with Localized Filtering-based Attention | Nov 27, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| LLMGA: Multimodal Large Language Model based Generation Assistant | Nov 27, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| Algorithm Evolution Using Large Language Model | Nov 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GeoChat: Grounded Large Vision-Language Model for Remote Sensing | Nov 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Controlled Text Generation via Language Model Arithmetic | Nov 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Survey of Graph Meets Large Language Model: Progress and Future Directions | Nov 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Meta Prompting for AI Systems | Nov 20, 2023 | Data InteractionGSM8K | CodeCode Available | 2 |
| HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs | Nov 16, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 2 |
| REST: Retrieval-Based Speculative Decoding | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |