SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Aug 10, 2024 Hallucination Optical Character Recognition
Code Code Available 115 FourierKAN outperforms MLP on Text Classification Head Fine-tuning Aug 16, 2024 Classification Kolmogorov-Arnold Networks
Code Code Available 75 AutoTrain: No-code training for state-of-the-art models Oct 21, 2024 Classification image-classification
Code Code Available 75 Interactive Prompt Debugging with Sequence Salience Apr 11, 2024 Sentence text-classification
Code Code Available 75 h2oGPT: Democratizing Large Language Models Jun 13, 2023 Chatbot Fairness
Code Code Available 65 Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Oct 16, 2022 Coreference Resolution Multiple-choice
Code Code Available 45 MTEB: Massive Text Embedding Benchmark Oct 13, 2022 Benchmarking Information Retrieval
Code Code Available 45 N-Grammer: Augmenting Transformers with latent n-grams Jul 13, 2022 Common Sense Reasoning Coreference Resolution
Code Code Available 45 ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks Mar 27, 2023 text annotation Text Classification
Code Code Available 45 When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes Apr 18, 2024 Contrastive Learning Few-Shot Learning
Code Code Available 35 BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models Aug 23, 2024 Data Poisoning text-classification
Code Code Available 35 RoFormer: Enhanced Transformer with Rotary Position Embedding Apr 20, 2021 Position Semantic Text Matching
Code Code Available 35 Universal Language Model Fine-tuning for Text Classification Jan 18, 2018 General Classification Language Modeling
Code Code Available 35 PyABSA: A Modularized Framework for Reproducible Aspect-based Sentiment Analysis Aug 2, 2022 Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)
Code Code Available 35 Ludwig: a type-based declarative deep learning toolbox Sep 17, 2019 Decoder Deep Learning
Code Code Available 35 Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models Mar 14, 2022 Text Classification
Code Code Available 35 FusionBench: A Comprehensive Benchmark of Deep Model Fusion Jun 5, 2024 image-classification Image Classification
Code Code Available 35 MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts Apr 22, 2024 Common Sense Reasoning GPU
Code Code Available 35 A Survey of Large Language Models in Finance (FinLLMs) Feb 4, 2024 Named Entity Recognition (NER) Question Answering
Code Code Available 35 PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts Oct 17, 2017 General Classification Sentence
Code Code Available 35 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Oct 11, 2018 Citation Intent Classification Common Sense Reasoning
Code Code Available 35 Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset May 17, 2024 16k Benchmarking
Code Code Available 35 Personalized Benchmarking with the Ludwig Benchmarking Toolkit Nov 8, 2021 Benchmarking Hyperparameter Optimization
Code Code Available 35 Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification Oct 26, 2020 Few-Shot Text Classification General Classification
Code Code Available 25 BAE: BERT-based Adversarial Examples for Text Classification Apr 4, 2020 Adversarial Attack Adversarial Text
Code Code Available 25 Simple Recurrent Units for Highly Parallelizable Recurrence Sep 8, 2017 General Classification Machine Translation
Code Code Available 25 TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP Apr 29, 2020 Adversarial Attack Adversarial Text
Code Code Available 25 QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization Mar 11, 2022 image-classification Image Classification
Code Code Available 25 PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain Oct 22, 2023 Dialogue Generation Dialogue Understanding
Code Code Available 25 TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing Feb 28, 2020 Knowledge Distillation Reading Comprehension
Code Code Available 25 ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers Sep 28, 2023 GPU Instruction Following
Code Code Available 25 LinkBERT: Pretraining Language Models with Document Links Mar 29, 2022 Document Classification Language Modeling
Code Code Available 25 ktrain: A Low-Code Library for Augmented Machine Learning Apr 19, 2020 BIG-bench Machine Learning Classification
Code Code Available 25 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Aug 9, 2024 Image to text Object
Code Code Available 25 Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours Aug 2, 2022 Text Classification
Code Code Available 25 Multiscale Positive-Unlabeled Detection of AI-Generated Texts May 29, 2023 Language Modelling text-classification
Code Code Available 25 EMR-Merging: Tuning-Free High-Performance Model Merging May 23, 2024 Image Classification Image Retrieval
Code Code Available 25 EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks Jan 31, 2019 Data Augmentation General Classification
Code Code Available 25 Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators May 23, 2024 image-classification Image Classification
Code Code Available 25 Few-Shot Text Generation with Pattern-Exploiting Training Dec 22, 2020 Headline Generation text-classification
Code Code Available 25 Adaptive Ranking-based Sample Selection for Weakly Supervised Class-imbalanced Text Classification Oct 6, 2022 text-classification Text Classification
Code Code Available 25 CSL: A Large-scale Chinese Scientific Literature Dataset Sep 12, 2022 text-classification Text Classification
Code Code Available 25 DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion Jan 23, 2023 Image-text Classification Node Classification
Code Code Available 25 CAPO: Cost-Aware Prompt Optimization Apr 22, 2025 Arithmetic Reasoning AutoML
Code Code Available 25 MiniRBT: A Two-stage Distilled Small Chinese Pre-trained Model Apr 3, 2023 Machine Reading Comprehension Reading Comprehension
Code Code Available 25 Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition Jan 4, 2024 Attribute Audio Classification
Code Code Available 25 Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach Aug 31, 2019 Articles Benchmarking
Code Code Available 25 CrypTen: Secure Multi-Party Computation Meets Machine Learning Sep 2, 2021 BIG-bench Machine Learning GPU
Code Code Available 25 AEDA: An Easier Data Augmentation Technique for Text Classification Aug 30, 2021 Classification Data Augmentation
Code Code Available 15 Adversarial Training Methods for Semi-Supervised Text Classification May 25, 2016 Classification General Classification
Code Code Available 15