| Large Language Model Critics for Execution-Free Evaluation of Code Changes | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimizing Large Language Model Training Using FP4 Quantization | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Safe AI Clinicians: A Comprehensive Study on Large Language Model Jailbreaking in Healthcare | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VLMaterial: Procedural Material Generation with Large Vision-Language Models | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BiFold: Bimanual Cloth Folding with Language Guidance | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction | Jan 27, 2025 | Code GenerationInductive Bias | —Unverified | 0 |
| Challenging Assumptions in Learning Generic Text Style Embeddings | Jan 27, 2025 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages | Jan 27, 2025 | DiversityLanguage Identification | CodeCode Available | 0 |
| PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Classification Error Bound for Low Bayes Error Conditions in Machine Learning | Jan 27, 2025 | Automatic Speech RecognitionClassification | —Unverified | 0 |
| CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification | Jan 27, 2025 | Generalizable Person Re-identificationLanguage Modeling | CodeCode Available | 0 |
| Integration of LLM Quality Assurance into an NLG System | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MEL: Legal Spanish Language Model | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SEAL: Speech Embedding Alignment Learning for Speech Large Language Model with Retrieval-Augmented Generation | Jan 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Complete Chess Games Enable LLM Become A Chess Master | Jan 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Network Threat Detection by Knowledge Graph, Large Language Model, and Imbalanced Learning | Jan 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning | Jan 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer | Jan 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Jan 26, 2025 | document understandingLanguage Modeling | CodeCode Available | 1 |
| Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency | Jan 26, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets | Jan 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Grained Patch Training for Efficient LLM-based Recommendation | Jan 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval | Jan 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Intent Understanding for Ambiguous prompt: A Human-Machine Co-Adaption Strategy | Jan 25, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding | Jan 25, 2025 | Action UnderstandingEmotion Recognition | —Unverified | 0 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| LLM4DistReconfig: A Fine-tuned Large Language Model for Power Distribution Network Reconfiguration | Jan 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education | Jan 24, 2025 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval | Jan 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SelfPrompt: Confidence-Aware Semi-Supervised Tuning for Robust Vision-Language Model Adaptation | Jan 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment | Jan 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Locality-aware Fair Scheduling in LLM Serving | Jan 24, 2025 | FairnessLanguage Modeling | —Unverified | 0 |
| Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game | Jan 24, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph | Jan 24, 2025 | Community DetectionHallucination | CodeCode Available | 2 |
| HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Jan 24, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| Humanity's Last Exam | Jan 24, 2025 | Humanity's Last ExamLanguage Modeling | —Unverified | 0 |
| Knowledge Graphs Construction from Criminal Court Appeals: Insights from the French Cassation Court | Jan 24, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| ZETA: Leveraging Z-order Curves for Efficient Top-k Attention | Jan 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CAPRAG: A Large Language Model Solution for Customer Service and Automatic Reporting using Vector and Graph Retrieval-Augmented Generation | Jan 23, 2025 | AI AgentLanguage Modeling | —Unverified | 0 |
| Enhancing Biomedical Relation Extraction with Directionality | Jan 23, 2025 | BenchmarkingDocument-level Relation Extraction | CodeCode Available | 1 |
| OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting | Jan 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Communicating Activations Between Language Model Agents | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model driven Policy Exploration for Recommender Systems | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |