Effective Attention Sheds Light On Interpretability May 18, 2021 Language Modeling Language Modelling
Code Code Available 1Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language Jul 16, 2021 Language Modeling Language Modelling
Code Code Available 1DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA Dec 6, 2024 counterfactual Language Model Evaluation
Code Code Available 1Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions Aug 20, 2021 Code Generation Diversity
Code Code Available 1DARTS: Differentiable Architecture Search Jun 24, 2018 General Classification image-classification
Code Code Available 1Accurate Prediction of Antibody Function and Structure Using Bio-Inspired Antibody Language Model Aug 31, 2023 Language Modeling Language Modelling
Code Code Available 1Do These LLM Benchmarks Agree? Fixing Benchmark Evaluation with BenchBench Jul 18, 2024 Language Modelling
Code Code Available 1VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer Jul 6, 2021 Image Retrieval Knowledge Distillation
Code Code Available 1Effectiveness of self-supervised pre-training for speech recognition Nov 10, 2019 Language Modelling Quantization
Code Code Available 1Efficient Long Sequence Modeling via State Space Augmented Transformer Dec 15, 2022 Computational Efficiency Decoder
Code Code Available 1BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing Jun 21, 2022 Decoder Language Modeling
Code Code Available 1ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training Dec 20, 2023 Language Modeling Language Modelling
Code Code Available 1ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling Dec 18, 2024 Language Modeling Language Modelling
Code Code Available 1Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization May 24, 2024 Classification Federated Learning
Code Code Available 1Beheshti-NER: Persian Named Entity Recognition Using BERT Mar 19, 2020 Language Modeling Language Modelling
Code Code Available 1Data Augmentation using Pre-trained Transformer Models Mar 4, 2020 Data Augmentation Diversity
Code Code Available 1EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing Jul 18, 2024 Instruction Following Language Modeling
Code Code Available 1Advancing Beyond Identification: Multi-bit Watermark for Large Language Models Aug 1, 2023 Language Modeling Language Modelling
Code Code Available 1VisorGPT: Learning Visual Prior via Generative Pre-Training May 23, 2023 Image Generation Language Modeling
Code Code Available 1EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs Oct 13, 2024 Language Modeling Language Modelling
Code Code Available 1Revisiting the Role of Language Priors in Vision-Language Models Jun 2, 2023 Image-text matching Image-text Retrieval
Code Code Available 1DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines Nov 17, 2023 Language Modelling Large Language Model
Code Code Available 1Visually-Augmented Language Modeling May 20, 2022 Image Retrieval Language Modeling
Code Code Available 1Visually Grounded Commonsense Knowledge Acquisition Nov 22, 2022 Language Modelling
Code Code Available 1DziriBERT: a Pre-trained Language Model for the Algerian Dialect Sep 25, 2021 Language Modeling Language Modelling
Code Code Available 1BECEL: Benchmark for Consistency Evaluation of Language Models Oct 1, 2022 Language Modeling Language Modelling
Code Code Available 1An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers May 1, 2022 Language Modeling Language Modelling
Code Code Available 1Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs May 1, 2022 Constituency Grammar Induction Language Modeling
Code Code Available 1BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Apr 5, 2024 Factual probe General Knowledge
Code Code Available 1ViLA: Efficient Video-Language Alignment for Video Question Answering Dec 13, 2023 cross-modal alignment Language Modeling
Code Code Available 1Data Efficient Masked Language Modeling for Vision and Language Sep 5, 2021 Language Modeling Language Modelling
Code Code Available 1A Cheaper and Better Diffusion Language Model with Soft-Masked Noise Apr 10, 2023 Denoising Image Generation
Code Code Available 1Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Jul 26, 2024 Attribute Language Modelling
Code Code Available 1Dynamic Grained Encoder for Vision Transformers Jan 10, 2023 image-classification Image Classification
Code Code Available 1VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion Mar 8, 2025 3D Semantic Scene Completion Autonomous Driving
Code Code Available 1A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration Oct 3, 2023 Arithmetic Reasoning Code Generation
Code Code Available 1ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning Feb 8, 2022 Benchmarking Language Modelling
Code Code Available 1MedualTime: A Dual-Adapter Language Model for Medical Time Series-Text Multimodal Learning Jun 7, 2024 Contrastive Learning Language Modeling
Code Code Available 1An Efficient Self-Supervised Cross-View Training For Sentence Embedding Nov 6, 2023 Contrastive Learning Language Modeling
Code Code Available 1DUnE: Dataset for Unified Editing Nov 27, 2023 Language Modeling Language Modelling
Code Code Available 1End-to-End Beam Retrieval for Multi-Hop Question Answering Aug 17, 2023 Language Modelling Large Language Model
Code Code Available 1An Efficient Multilingual Language Model Compression through Vocabulary Trimming May 24, 2023 Language Modeling Language Modelling
Code Code Available 1Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks Jul 25, 2017 Language Modeling Language Modelling
Code Code Available 1DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities Feb 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Do Unlearning Methods Remove Information from Language Model Weights? Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 1Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot Jan 14, 2024 Chatbot Conversational Search
Code Code Available 1Balanced Data Sampling for Language Model Training with Clustering Feb 22, 2024 Clustering Language Modeling
Code Code Available 1WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models Aug 7, 2024 AI and Safety Benchmarking
Code Code Available 1Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval Jan 1, 2023 Knowledge Distillation Language Modelling
Code Code Available 1DUMA: Reading Comprehension with Transposition Thinking Jan 26, 2020 Language Modeling Language Modelling
Code Code Available 1