| BianCang: A Traditional Chinese Medicine Large Language Model | Nov 17, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning | Nov 17, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| Analyzing Pokémon and Mario Streamers' Twitch Chat with LLM-based User Embeddings | Nov 17, 2024 | ClusteringLanguage Modeling | —Unverified | 0 |
| Improving training time and GPU utilization in geo-distributed language model training | Nov 16, 2024 | GPULanguage Modeling | —Unverified | 0 |
| VayuBuddy: an LLM-Powered Chatbot to Democratize Air Quality Insights | Nov 16, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| A Novel Approach to Eliminating Hallucinations in Large Language Model-Assisted Causal Discovery | Nov 16, 2024 | Causal DiscoveryHallucination | —Unverified | 0 |
| GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding | Nov 16, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines | Nov 16, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection | Nov 16, 2024 | DiagnosticInstruction Following | CodeCode Available | 0 |
| MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Nov 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Large Vision-Language Models for Remote Sensing Visual Question Answering | Nov 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Evolutionary Algorithms for Recommender Systems: Benchmarks and Algorithm Comparisons | Nov 16, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| Take Package as Language: Anomaly Detection Using Transformer | Nov 15, 2024 | Anomaly DetectionIntrusion Detection | CodeCode Available | 0 |
| Debias your Large Multi-Modal Model at Test-Time with Non-Contrastive Visual Attribute Steering | Nov 15, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging large language models for efficient representation learning for entity resolution | Nov 15, 2024 | BlockingContrastive Learning | —Unverified | 0 |
| Xmodel-1.5: An 1B-scale Multilingual LLM | Nov 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SlimLM: An Efficient Small Language Model for On-Device Document Assistance | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization | Nov 15, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity | Nov 15, 2024 | Contrastive LearningHallucination | —Unverified | 0 |
| Increasing the Accessibility of Causal Domain Knowledge via Causal Information Extraction Methods: A Case Study in the Semiconductor Manufacturing Industry | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning | Nov 15, 2024 | Image Quality AssessmentLanguage Modeling | CodeCode Available | 2 |
| Jal Anveshak: Prediction of fishing zones using fine-tuned LlaMa 2 | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models | Nov 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| CART: Compositional Auto-Regressive Transformer for Image Generation | Nov 15, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Nov 15, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| BabyLM Challenge: Exploring the Effect of Variation Sets on Language Model Training Efficiency | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MARM: Unlocking the Future of Recommendation Systems through Memory Augmentation and Scalable Complexity | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive Decoding via Latent Preference Optimization | Nov 14, 2024 | GSM8KInstruction Following | —Unverified | 0 |
| Local deployment of large-scale music AI models on commodity hardware | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Nov 14, 2024 | Depth EstimationImage Inpainting | —Unverified | 0 |
| Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment | Nov 14, 2024 | BIRLImitation Learning | —Unverified | 0 |
| Reducing Reasoning Costs: The Path of Optimization for Chain of Thought via Sparse Attention Mechanism | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MagicQuill: An Intelligent Interactive Image Editing System | Nov 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Nov 14, 2024 | Earth ObservationInstruction Following | CodeCode Available | 2 |
| How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception | Nov 14, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| On the Limits of Language Generation: Trade-Offs Between Hallucination and Mode Collapse | Nov 14, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| High fitness paths can connect proteins with low sequence overlap | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhanced Classroom Dialogue Sequences Analysis with a Hybrid AI Agent: Merging Expert Rule-Base with Large Language Models | Nov 13, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| VALTEST: Automated Validation of Language Model Generated Test Cases | Nov 13, 2024 | HumanEvalLanguage Modeling | —Unverified | 0 |
| Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers | Nov 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Theoretical Analysis of Byte-Pair Encoding | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging LLMs for Predictive Insights in Food Policy and Behavioral Interventions | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Generation Framework with Strict Constraints for Crystal Materials Design | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data | Nov 13, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| A System Level Performance Evaluation for Superconducting Digital Systems | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language-Model Prior Overcomes Cold-Start Items | Nov 13, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 0 |
| Polymetis:Large Language Modeling for Multiple Material Domains | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |