| MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification | Nov 20, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Patience Is The Key to Large Language Model Reasoning | Nov 20, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders | Nov 20, 2024 | compressed sensingLanguage Modeling | —Unverified | 0 |
| Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning | Nov 20, 2024 | Knowledge DistillationLarge Language Model | —Unverified | 0 |
| Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| StreetviewLLM: Extracting Geographic Information Using a Chain-of-Thought Multimodal Large Language Model | Nov 19, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| DIETS: Diabetic Insulin Management System in Everyday Life | Nov 19, 2024 | Large Language ModelManagement | —Unverified | 0 |
| Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model | Nov 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Nov 19, 2024 | 3D GenerationGPU | —Unverified | 0 |
| Strengthening Fake News Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques. Defying BERT? | Nov 19, 2024 | Fake News DetectionLanguage Modeling | —Unverified | 0 |
| CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model | Nov 19, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| A Layered Architecture for Developing and Enhancing Capabilities in Large Language Model-based Software Systems | Nov 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation | Nov 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model for Qualitative Research -- A Systematic Mapping Study | Nov 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ByteScience: Bridging Unstructured Scientific Literature and Structured Data with Auto Fine-tuned Large Language Model in Token Granularity | Nov 18, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Does Unlearning Truly Unlearn? A Black Box Evaluation of LLM Unlearning Methods | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Topology-aware Preemptive Scheduling for Co-located LLM Workloads | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OASIS: Open Agent Social Interaction Simulations with One Million Agents | Nov 18, 2024 | Large Language ModelRecommendation Systems | CodeCode Available | 7 |
| TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection | Nov 18, 2024 | Anomaly DetectionLarge Language Model | CodeCode Available | 1 |
| Large corpora and large language models: a replicable method for automating grammatical annotation | Nov 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development | Nov 18, 2024 | Instance SegmentationLarge Language Model | —Unverified | 0 |
| AddrLLM: Address Rewriting via Large Language Model on Nationwide Logistics Data | Nov 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning | Nov 17, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| BianCang: A Traditional Chinese Medicine Large Language Model | Nov 17, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Analyzing Pokémon and Mario Streamers' Twitch Chat with LLM-based User Embeddings | Nov 17, 2024 | ClusteringLanguage Modeling | —Unverified | 0 |
| FastDraft: How to Train Your Draft | Nov 17, 2024 | BenchmarkingCode Completion | —Unverified | 0 |
| VayuBuddy: an LLM-Powered Chatbot to Democratize Air Quality Insights | Nov 16, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| A Novel Approach to Eliminating Hallucinations in Large Language Model-Assisted Causal Discovery | Nov 16, 2024 | Causal DiscoveryHallucination | —Unverified | 0 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines | Nov 16, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Leveraging large language models for efficient representation learning for entity resolution | Nov 15, 2024 | BlockingContrastive Learning | —Unverified | 0 |
| Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Nov 15, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Jal Anveshak: Prediction of fishing zones using fine-tuned LlaMa 2 | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Xmodel-1.5: An 1B-scale Multilingual LLM | Nov 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models | Nov 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization | Nov 15, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Refined and Segmented Price Sentiment Indices from Survey Comments | Nov 15, 2024 | Large Language ModelSurvey | —Unverified | 0 |
| VMID: A Multimodal Fusion LLM Framework for Detecting and Identifying Misinformation of Short Videos | Nov 15, 2024 | Fake News DetectionLarge Language Model | —Unverified | 0 |
| Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Nov 14, 2024 | Depth EstimationImage Inpainting | —Unverified | 0 |
| Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment | Nov 14, 2024 | BIRLImitation Learning | —Unverified | 0 |
| Squeezed Attention: Accelerating Long Context Length LLM Inference | Nov 14, 2024 | Code GenerationLarge Language Model | CodeCode Available | 2 |
| Local deployment of large-scale music AI models on commodity hardware | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception | Nov 14, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| MagicQuill: An Intelligent Interactive Image Editing System | Nov 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| Reducing Reasoning Costs: The Path of Optimization for Chain of Thought via Sparse Attention Mechanism | Nov 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Nov 14, 2024 | Earth ObservationInstruction Following | CodeCode Available | 2 |
| A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL | Nov 13, 2024 | DiversityIn-Context Learning | CodeCode Available | 4 |
| Leveraging LLMs for Predictive Insights in Food Policy and Behavioral Interventions | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |