| LLMs Can Simulate Standardized Patients via Agent Coevolution | Dec 16, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model | Dec 13, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations | Dec 12, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis | Dec 11, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 1 |
| Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training | Dec 11, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Concept Bottleneck Large Language Models | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Dec 10, 2024 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts | Dec 7, 2024 | Change DetectionImage Comprehension | CodeCode Available | 1 |
| Transformers Can Navigate Mazes With Multi-Step Prediction | Dec 6, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Smoothie: Label Free Language Model Routing | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 |
| MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Model | Dec 5, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Composed Image Retrieval for Training-Free Domain Conversion | Dec 4, 2024 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |
| Free and Customizable Code Documentation with LLMs: A Fine-Tuning Approach | Dec 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning Knowledge Concepts to Whole Slide Images for Precise Histopathology Image Analysis | Nov 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LongKey: Keyphrase Extraction for Long Documents | Nov 26, 2024 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| PromptHSI: Universal Hyperspectral Image Restoration with Vision-Language Modulated Frequency Adaptation | Nov 24, 2024 | Image RestorationLanguage Modeling | CodeCode Available | 1 |
| VaLiD: Mitigating the Hallucination of Large Vision Language Models by Visual Layer Fusion Contrastive Decoding | Nov 24, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Revelio: Interpreting and leveraging semantic information in diffusion models | Nov 23, 2024 | DenoisingLanguage Modeling | CodeCode Available | 1 |
| ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos | Nov 22, 2024 | Language-Based Temporal LocalizationLanguage Modeling | CodeCode Available | 1 |
| Why do language models perform worse for morphologically complex languages? | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Planning-Driven Programming: A Large Language Model Programming Workflow | Nov 21, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach | Nov 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Selective Attention: Enhancing Transformer through Principled Context Control | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Improved GUI Grounding via Iterative Narrowing | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Nov 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers | Nov 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models as Causal Effect Generators | Nov 12, 2024 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DELIFT: Data Efficient Language model Instruction Fine Tuning | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Training Compute-Optimal Protein Language Models | Nov 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge | Nov 4, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models | Nov 4, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| GraphXForm: Graph transformer for computer-aided molecular design | Nov 3, 2024 | Drug DesignDrug Discovery | CodeCode Available | 1 |