| Spoken Language Modeling with Duration-Penalized Self-Supervised Units | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers | Sep 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset | Oct 14, 2021 | Image RetrievalLanguage Modeling | CodeCode Available | 0 |
| Towards a text-based quantitative and explainable histopathology image analysis | Jul 10, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation | Oct 18, 2024 | Backdoor AttackKnowledge Distillation | CodeCode Available | 0 |
| ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multi-Word Lexical Simplification | Dec 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sentence-level Media Bias Analysis with Event Relation Graph | Apr 2, 2024 | Graph AttentionLanguage Modeling | CodeCode Available | 0 |
| SPRIG: Improving Large Language Model Performance by System Prompt Optimization | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Monotonic Aggregation for Open-domain QA | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Natural Language Decompositions of Implicit Content Enable Better Text Representations | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OnlySportsLM: Optimizing Sports-Domain Language Models with SOTA Performance under Billion Parameters | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving | Nov 1, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation | Jun 16, 2025 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model | May 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks | Nov 2, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sensei: Self-Supervised Sensor Name Segmentation | Jan 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Dec 27, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| Prompt Engineering for Transformer-based Chemical Similarity Search Identifies Structurally Distinct Functional Analogues | May 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multi-Task Deep Neural Networks for Natural Language Understanding | Jan 31, 2019 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Unlocking Efficiency: Adaptive Masking for Gene Transformer Models | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text | May 18, 2018 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Semi-supervised Sequence Learning | Nov 4, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semi-supervised Formality Style Transfer using Language Model Discriminator and Mutual Information Maximization | Oct 10, 2020 | Formality Style TransferLanguage Modeling | CodeCode Available | 0 |
| PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference | Mar 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Stable LM 2 1.6B Technical Report | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping | Apr 15, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Community-Driven Agents for Machine Learning Engineering | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight Fine-Tuning | Oct 28, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks | Oct 1, 2022 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| Semiparametric Token-Sequence Co-Supervision | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation | Apr 18, 2025 | Anomaly SegmentationLanguage Modeling | CodeCode Available | 0 |
| Stacked AMR Parsing with Silver Data | Nov 1, 2021 | Abstract Meaning RepresentationAMR Parsing | CodeCode Available | 0 |
| Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semi-Automated Construction of Food Composition Knowledge Base | Jan 24, 2023 | Active LearningLanguage Modeling | CodeCode Available | 0 |
| SemGloVe: Semantic Co-occurrences for GloVe from BERT | Dec 30, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning | Mar 22, 2024 | Few-Shot Stance DetectionIn-Context Learning | CodeCode Available | 0 |
| When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? | Nov 25, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| Online Normalization for Training Neural Networks | May 15, 2019 | General Classificationimage-classification | CodeCode Available | 0 |
| SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT | Jan 15, 2024 | Binary ClassificationClassification | CodeCode Available | 0 |
| Semantics or spelling? Probing contextual word embeddings with orthographic noise | Aug 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models | Oct 24, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2 | Apr 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations | May 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment | Nov 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning | Apr 27, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| Stateful Memory-Augmented Transformers for Efficient Dialogue Modeling | Sep 15, 2022 | DecoderDialogue Generation | CodeCode Available | 0 |
| Semantics-aware BERT for Language Understanding | Sep 5, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards | Feb 1, 2024 | Answer SelectionLanguage Modeling | CodeCode Available | 0 |
| Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System | Jun 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |