| VideoLLM-online: Online Video Large Language Model for Streaming Video | Jun 17, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft | Jun 17, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels | Jun 17, 2024 | Dataset GenerationInformation Retrieval | —Unverified | 0 |
| Reframing linguistic bootstrapping as joint inference using visually-grounded grammar induction models | Jun 17, 2024 | Language AcquisitionLanguage Modeling | CodeCode Available | 0 |
| CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HARE: HumAn pRiors, a key to small language model Efficiency | Jun 17, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Large Language Models and Knowledge Graphs for Astronomical Entity Disambiguation | Jun 17, 2024 | ClusteringEntity Disambiguation | —Unverified | 0 |
| LiLiuM: eBay's Large Language Models for e-commerce | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents | Jun 17, 2024 | Code GenerationCode Search | CodeCode Available | 0 |
| Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language Models | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generative Visual Instruction Tuning | Jun 17, 2024 | Image GenerationImage-text matching | CodeCode Available | 0 |
| A General Framework for Load Forecasting based on Pre-trained Large Language Model | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection | Jun 17, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 0 |
| SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Personalised Learning Tool for Physics Undergraduate Students Built On a Large Language Model for Symbolic Regression | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Avoiding Copyright Infringement via Large Language Model Unlearning | Jun 16, 2024 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Large Language Models for Dysfluency Detection in Stuttered Speech | Jun 16, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics | Jun 16, 2024 | ClassificationInformativeness | CodeCode Available | 0 |
| Optimization of Armv9 architecture general large language model inference performance based on Llama.cpp | Jun 16, 2024 | Compiler OptimizationLanguage Modeling | CodeCode Available | 0 |
| Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context Learning | Jun 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens | Jun 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Jun 16, 2024 | knowledge editingLanguage Modeling | CodeCode Available | 0 |
| ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation | Jun 16, 2024 | Continual LearningGSM8K | CodeCode Available | 0 |
| MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data | Jun 15, 2024 | Generative Adversarial NetworkLanguage Modeling | —Unverified | 0 |
| VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reactor Mk.1 performances: MMLU, HumanEval and BBH test results | Jun 15, 2024 | BenchmarkingHumanEval | —Unverified | 0 |
| CancerLLM: A Large Language Model in Cancer Domain | Jun 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Large Language Model Enhanced Clustering for News Event Detection | Jun 15, 2024 | ClusteringEvent Detection | —Unverified | 0 |
| A Probability--Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Datasets for Multilingual Answer Sentence Selection | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Group and Shuffle: Efficient Structured Orthogonal Parametrization | Jun 14, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation | Jun 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| 3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Jun 14, 2024 | Image Quality AssessmentLanguage Modeling | —Unverified | 0 |
| OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst | Jun 14, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting | Jun 14, 2024 | Dialogue GenerationForm | CodeCode Available | 0 |
| OpenECAD: An Efficient Visual Language Model for Editable 3D-CAD Design | Jun 14, 2024 | 3D Object ReconstructionLanguage Modeling | —Unverified | 0 |
| TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages | Jun 14, 2024 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 0 |
| Unlearning with Control: Assessing Real-world Utility for Large Language Model Unlearning | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Modal Retrieval For Large Language Model Based Speech Recognition | Jun 13, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transformers meet Neural Algorithmic Reasoners | Jun 13, 2024 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| Autonomous Multi-Objective Optimization Using Large Language Model | Jun 13, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |