| Unseen Attack Detection in Software-Defined Networking Using a BERT-Based Large Language Model | Dec 9, 2024 | feature selectionLanguage Modeling | —Unverified | 0 |
| BatchTopK Sparse Autoencoders | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance | Dec 9, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| MAVias: Mitigate any Visual Bias | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Simulating Human-like Daily Activities with Desire-driven Autonomy | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Gated Delta Networks: Improving Mamba2 with Delta Rule | Dec 9, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 4 |
| Pre-trained protein language model for codon optimization | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model | Dec 8, 2024 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trust No AI: Prompt Injection Along The CIA Security Triad | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LVP-CLIP:Revisiting CLIP for Continual Learning with Label Vector Pool | Dec 8, 2024 | Continual LearningIncremental Learning | —Unverified | 0 |
| Confidence Diagram of Nonparametric Ranking for Uncertainty Assessment in Large Language Models Evaluation | Dec 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ULMRec: User-centric Large Language Model for Sequential Recommendation | Dec 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision | Dec 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text-to-3D Gaussian Splatting with Physics-Grounded Motion Generation | Dec 7, 2024 | 3D GenerationLanguage Modeling | —Unverified | 0 |
| RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts | Dec 7, 2024 | Change DetectionImage Comprehension | CodeCode Available | 1 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 |
| Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System | Dec 6, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| C^2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation | Dec 6, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora | Dec 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning | Dec 6, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| From Voice to Value: Leveraging AI to Enhance Spoken Online Reviews on the Go | Dec 6, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| CigTime: Corrective Instruction Generation Through Inverse Motion Editing | Dec 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transformers Can Navigate Mazes With Multi-Step Prediction | Dec 6, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Generative Humanization for Therapeutic Antibodies | Dec 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Smoothie: Label Free Language Model Routing | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Dec 6, 2024 | document understandingHallucination | —Unverified | 0 |
| Adaptive Optimization for Enhanced Efficiency in Large-Scale Language Model Training | Dec 6, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation | Dec 6, 2024 | Code GenerationCode Translation | —Unverified | 0 |
| Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model | Dec 6, 2024 | EgoSchemaLanguage Modeling | —Unverified | 0 |
| QueEn: A Large Language Model for Quechua-English Translation | Dec 6, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| A Survey of Large Language Model-Based Generative AI for Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges | Dec 6, 2024 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| KaLM: Knowledge-aligned Autoregressive Language Modeling via Dual-view Knowledge Graph Contrastive Learning | Dec 6, 2024 | Contrastive LearningGraph Question Answering | —Unverified | 0 |
| Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference | Dec 6, 2024 | GPULanguage Modeling | —Unverified | 0 |
| LinVT: Empower Your Image-level Large Language Model to Understand Videos | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Practical Examination of AI-Generated Text Detectors for Large Language Models | Dec 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding Hidden Computations in Chain-of-Thought Reasoning | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Establishing Task Scaling Laws via Compute-Efficient Model Ladders | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Model | Dec 5, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ALMA: Alignment with Minimal Annotation | Dec 5, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Aligned Music Notation and Lyrics Transcription | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Liquid: Language Models are Scalable Multi-modal Generators | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM | Dec 5, 2024 | Image ManipulationLanguage Modeling | —Unverified | 0 |
| A large language model-type architecture for high-dimensional molecular potential energy surfaces | Dec 5, 2024 | Computational chemistryLanguage Modeling | —Unverified | 0 |