| Forecasting Frontier Language Model Agent Capabilities | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimizing Pre-Training Data Mixtures with Mixtures of Data Expert Models | Feb 21, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Enhancing RWKV-based Language Models for Long-Sequence Text Generation | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ESPnet-SpeechLM: An Open Speech Language Model Toolkit | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LEDD: Large Language Model-Empowered Data Discovery in Data Lakes | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations | Feb 21, 2025 | ArticlesFraud Detection | CodeCode Available | 0 |
| PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Identifying Features that Shape Perceived Consciousness in Large Language Model-based AI: A Quantitative Study of Human Responses | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Optimizing Singular Spectrum for Large Language Model Compression | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness | Feb 20, 2025 | Backdoor AttackLanguage Modeling | —Unverified | 0 |
| Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation | Feb 20, 2025 | Generative Adversarial NetworkLanguage Modeling | CodeCode Available | 0 |
| TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators | Feb 20, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Rapid Word Learning Through Meta In-Context Learning | Feb 20, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| LabTOP: A Unified Model for Lab Test Outcome Prediction on Electronic Health Records | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HPS: Hard Preference Sampling for Human Preference Alignment | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt-to-Leaderboard | Feb 20, 2025 | ChatbotLanguage Modeling | CodeCode Available | 3 |
| STeCa: Step-level Trajectory Calibration for LLM Agent Learning | Feb 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| SR-LLM: Rethinking the Structured Representation in Large Language Model | Feb 20, 2025 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison | Feb 20, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Megrez-Omni Technical Report | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems | Feb 19, 2025 | Collaborative FilteringConversational Recommendation | CodeCode Available | 1 |
| Slamming: Training a Speech Language Model on One GPU in a Day | Feb 19, 2025 | GPULanguage Modeling | CodeCode Available | 3 |
| A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models | Feb 19, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval | Feb 19, 2025 | Information RetrievalKnowledge Graphs | —Unverified | 0 |
| TESS 2: A Large-Scale Generalist Diffusion Language Model | Feb 19, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments | Feb 19, 2025 | Event SegmentationLanguage Modeling | —Unverified | 0 |
| Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder | Feb 19, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| UniKnow: A Unified Framework for Reliable Language Model Behavior across Parametric and External Knowledge | Feb 19, 2025 | InformativenessLanguage Modeling | —Unverified | 0 |
| Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning | Feb 19, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| TALKPLAY: Multimodal Music Recommendation with Large Language Models | Feb 19, 2025 | Conversational RecommendationInstruction Following | —Unverified | 0 |
| What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs | Feb 19, 2025 | Data AugmentationGraph Learning | CodeCode Available | 0 |
| LLM should think and action as a human | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reproducing NevIR: Negation in Neural Information Retrieval | Feb 19, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Complex Ontology Matching with Large Language Model Embeddings | Feb 19, 2025 | Graph MatchingLanguage Modeling | —Unverified | 0 |
| Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Feb 19, 2025 | BlockingLanguage Modeling | —Unverified | 0 |
| Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model | Feb 19, 2025 | Drug DiscoveryGeneral Knowledge | —Unverified | 0 |
| AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain Recommendations | Feb 19, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning | Feb 19, 2025 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Flow-based generative models as iterative algorithms in probability space | Feb 19, 2025 | Anomaly DetectionDensity Estimation | —Unverified | 0 |
| Reflection of Episodes: Learning to Play Game from Expert and Self Experiences | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |