| CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TIPO: Text to Image with Text Presampling for Prompt Optimization | Nov 12, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| Retrieval, Reasoning, Re-ranking: A Context-Enriched Framework for Knowledge Graph Completion | Nov 12, 2024 | Knowledge Graph CompletionLanguage Modeling | —Unverified | 0 |
| Likelihood as a Performance Gauge for Retrieval-Augmented Generation | Nov 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Low-bit Communication for Tensor Parallel LLM Inference | Nov 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| World Models: The Safety Perspective | Nov 12, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| Contrastive Language Prompting to Ease False Positives in Medical Anomaly Detection | Nov 12, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 0 |
| Tucano: Advancing Neural Text Generation for Portuguese | Nov 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLM App Squatting and Cloning | Nov 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models as Causal Effect Generators | Nov 12, 2024 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| TIPS: Threat Actor Informed Prioritization of Applications using SecEncoder | Nov 12, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation | Nov 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Nov 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Model Stealing for Any Low-Rank Language Model | Nov 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training Data for Large Language Model | Nov 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Clinical Trial Design Approach to Auditing Language Models in Healthcare Setting | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Building a Taiwanese Mandarin Spoken Language Model: A First Attempt | Nov 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model | Nov 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Automatically Detecting Online Deceptive Patterns in Real-time | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Surprising Effectiveness of Test-Time Training for Few-Shot Learning | Nov 11, 2024 | ARCFew-Shot Learning | CodeCode Available | 3 |
| Reverse Prompt Engineering | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models | Nov 11, 2024 | AttributeDialogue Generation | CodeCode Available | 0 |
| Large Language Model in Medical Informatics: Direct Classification and Enhanced Text Representations for Automatic ICD Coding | Nov 11, 2024 | ClassificationCode Classification | —Unverified | 0 |
| Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training | Nov 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Training Neural Networks as Recognizers of Formal Languages | Nov 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Characterizing Cyber Networks with Large Language Models | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Super Weight in Large Language Models | Nov 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| More Expressive Attention with Negative Weights | Nov 11, 2024 | DecoderImage Generation | CodeCode Available | 0 |
| A Text Classification Model Combining Adversarial Training with Pre-trained Language Model and neural networks: A Case Study on Telecom Fraud Incident Texts | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Backpropagation of the Wave Network | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Model Fusion through Bayesian Optimization in Language Model Fine-Tuning | Nov 11, 2024 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 0 |
| CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models | Nov 11, 2024 | 2D Pose EstimationCategory-Agnostic Pose Estimation | —Unverified | 0 |
| CTC-Assisted LLM-Based Contextual ASR | Nov 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator | Nov 10, 2024 | GPULanguage Modeling | —Unverified | 0 |
| LProtector: An LLM-driven Vulnerability Detection System | Nov 10, 2024 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Hermes: A Large Language Model Framework on the Journey to Autonomous Networks | Nov 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey of Emerging Approaches and Advances in Video Generation | Nov 9, 2024 | Image to Video GenerationLanguage Modeling | —Unverified | 0 |
| TourSynbio-Search: A Large Language Model Driven Agent Framework for Unified Search Method for Protein Engineering | Nov 9, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| BreakGPT: Leveraging Large Language Models for Predicting Asset Price Surges | Nov 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Concept Bottleneck Language Models For protein design | Nov 9, 2024 | Decision MakingDrug Discovery | CodeCode Available | 2 |
| ViTOC: Vision Transformer and Object-aware Captioner | Nov 9, 2024 | DiversityImage Captioning | —Unverified | 0 |
| Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs | Nov 9, 2024 | ClassificationClustering | —Unverified | 0 |
| Target-driven Attack for Large Language Models | Nov 9, 2024 | Adversarial TextLanguage Modeling | —Unverified | 0 |
| Zyda-2: a 5 Trillion Token High-Quality Dataset | Nov 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aquila-plus: Prompt-Driven Visual-Language Models for Pixel-Level Remote Sensing Image Understanding | Nov 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aquila: A Hierarchically Aligned Visual-Language Model for Enhanced Remote Sensing Image Comprehension | Nov 9, 2024 | Image ComprehensionLanguage Modeling | —Unverified | 0 |