| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DELIFT: Data Efficient Language model Instruction Fine Tuning | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Training Compute-Optimal Protein Language Models | Nov 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge | Nov 4, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models | Nov 4, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| GraphXForm: Graph transformer for computer-aided molecular design | Nov 3, 2024 | Drug DesignDrug Discovery | CodeCode Available | 1 |