| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Autoregressive Action Sequence Learning for Robotic Manipulation | Oct 4, 2024 | ChunkingLanguage Modeling | CodeCode Available | 2 |
| NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation | Sep 30, 2024 | AttributeCollaborative Filtering | CodeCode Available | 2 |
| DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | Sep 29, 2024 | AllImage Segmentation | CodeCode Available | 2 |
| Control Industrial Automation System with Large Language Model Agents | Sep 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Empirical Asset Pricing with Large Language Model Agents | Sep 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG Model | Sep 24, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| Small Language Models: Survey, Measurements, and Insights | Sep 24, 2024 | BenchmarkingDecoder | CodeCode Available | 2 |
| MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding | Sep 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning | Sep 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AutoVerus: Automated Proof Generation for Rust Code | Sep 19, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization | Sep 19, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework | Sep 19, 2024 | Autonomous VehiclesDecision Making | CodeCode Available | 2 |
| LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment | Sep 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Sep 11, 2024 | Autonomous DrivingFeature Engineering | CodeCode Available | 2 |
| DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks | Sep 10, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 2 |
| TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks | Sep 9, 2024 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| The AdEMAMix Optimizer: Better, Faster, Older | Sep 5, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Language Model Powered Digital Biology with BRAD | Sep 4, 2024 | ChatbotCode Generation | CodeCode Available | 2 |
| Sample-Efficient Diffusion for Text-To-Speech Synthesis | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MemLong: Memory-Augmented Retrieval for Long Text Modeling | Aug 30, 2024 | 4kDecoder | CodeCode Available | 2 |
| Law of Vision Representation in MLLMs | Aug 29, 2024 | cross-modal alignmentLanguage Modeling | CodeCode Available | 2 |
| Efficient LLM Scheduling by Learning to Rank | Aug 28, 2024 | BlockingChatbot | CodeCode Available | 2 |
| LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis | Aug 16, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 2 |
| Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent Framework | Aug 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Causal Agent based on Large Language Model | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP | Aug 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| XMainframe: A Large Language Model for Mainframe Modernization | Aug 5, 2024 | Code SummarizationLanguage Modeling | CodeCode Available | 2 |
| Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning | Aug 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model | Aug 1, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| DiffArtist: Towards Structure and Appearance Controllable Image Stylization | Jul 22, 2024 | DisentanglementImage Stylization | CodeCode Available | 2 |
| T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation | Jul 19, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Jul 19, 2024 | Domain GeneralizationForm | CodeCode Available | 2 |
| Longhorn: State Space Models are Amortized Online Learners | Jul 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Beyond Next Token Prediction: Patch-Level Training for Large Language Models | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation | Jul 15, 2024 | Information RetrievalKnowledge Graphs | CodeCode Available | 2 |
| AutoGRAMS: Autonomous Graphical Agent Modeling Software | Jul 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |