| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Revelio: Interpreting and leveraging semantic information in diffusion models | Nov 23, 2024 | DenoisingLanguage Modeling | CodeCode Available | 1 |
| ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos | Nov 22, 2024 | Language-Based Temporal LocalizationLanguage Modeling | CodeCode Available | 1 |
| Why do language models perform worse for morphologically complex languages? | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Planning-Driven Programming: A Large Language Model Programming Workflow | Nov 21, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach | Nov 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Selective Attention: Enhancing Transformer through Principled Context Control | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Improved GUI Grounding via Iterative Narrowing | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Nov 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers | Nov 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models as Causal Effect Generators | Nov 12, 2024 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DELIFT: Data Efficient Language model Instruction Fine Tuning | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Training Compute-Optimal Protein Language Models | Nov 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge | Nov 4, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models | Nov 4, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| GraphXForm: Graph transformer for computer-aided molecular design | Nov 3, 2024 | Drug DesignDrug Discovery | CodeCode Available | 1 |