Approaching Deep Learning through the Spectral Dynamics of Weights Aug 21, 2024 Deep Learning image-classification
Code Code Available 15 Cross-domain Retrieval in the Legal and Patent Domains: a Reproducibility Study Dec 21, 2020 Information Retrieval Language Modelling
Code Code Available 15 UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling Nov 23, 2021 Image Captioning Image Description
Code Code Available 15 CDLM: Cross-Document Language Modeling Jan 2, 2021 Citation Recommendation Coreference Resolution
Code Code Available 15 InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators Oct 26, 2023 Language Modeling Language Modelling
Code Code Available 15 Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction Oct 31, 2024 Disaster Response Language Modeling
Code Code Available 15 Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment Oct 9, 2022 Language Modeling Language Modelling
Code Code Available 15 Critic-Guided Decoding for Controlled Text Generation Dec 21, 2022 Language Modeling Language Modelling
Code Code Available 15 Injecting Numerical Reasoning Skills into Language Models Apr 9, 2020 Data Augmentation Decoder
Code Code Available 15 CriticEval: Evaluating Large Language Model as Critic Feb 21, 2024 Language Modeling Language Modelling
Code Code Available 15 -former: Infinite Memory Transformer Sep 1, 2021 Dialogue Generation Language Modeling
Code Code Available 15 AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework Aug 26, 2024 Language Modeling Language Modelling
Code Code Available 15 Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias May 9, 2024 Data Visualization Language Modeling
Code Code Available 15 -former: Infinite Memory Transformer May 1, 2022 Dialogue Generation Language Modeling
Code Code Available 15 INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models Feb 22, 2024 Information Retrieval Instruction Following
Code Code Available 15 Creative Agents: Empowering Agents with Imagination for Creative Tasks Dec 5, 2023 Instruction Following Language Modelling
Code Code Available 15 CREAM: Consistency Regularized Self-Rewarding Language Models Oct 16, 2024 Language Modeling Language Modelling
Code Code Available 15 InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation Dec 2, 2021 Language Modeling Language Modelling
Code Code Available 15 InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings Oct 8, 2022 Contrastive Learning Language Modeling
Code Code Available 15 CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model Apr 28, 2024 Language Modeling Language Modelling
Code Code Available 15 MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models Oct 30, 2023 Language Modeling Language Modelling
Code Code Available 15 InforMask: Unsupervised Informative Masking for Language Model Pretraining Oct 21, 2022 Language Modeling Language Modelling
Code Code Available 15 CrAM: A Compression-Aware Minimizer Jul 28, 2022 GPU Image Classification
Code Code Available 15 CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization May 5, 2025 Diversity Language Modeling
Code Code Available 15 Crafting Large Language Models for Enhanced Interpretability Jul 5, 2024 Language Modeling Language Modelling
Code Code Available 15 Agentic Skill Discovery May 23, 2024 Language Modelling
Code Code Available 15 INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model Jul 23, 2024 Language Modeling Language Modelling
Code Code Available 15 RealFormer: Transformer Likes Residual Attention Dec 21, 2020 Language Modeling Language Modelling
Code Code Available 15 CPT: Efficient Deep Neural Network Training via Cyclic Precision Jan 25, 2021 Language Modeling Language Modelling
Code Code Available 15 CPM: A Large-scale Generative Chinese Pre-trained Language Model Dec 1, 2020 Cloze Test Language Modeling
Code Code Available 15 CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation Sep 13, 2021 Decoder Denoising
Code Code Available 15 CoVR-2: Automatic Data Construction for Composed Video Retrieval Aug 28, 2023 Composed Image Retrieval (CoIR) Composed Video Retrieval (CoVR)
Code Code Available 15 Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning May 24, 2023 Language Modeling Language Modelling
Code Code Available 15 CPLLM: Clinical Prediction with Large Language Models Sep 20, 2023 Disease Prediction Language Modeling
Code Code Available 15 Cross-lingual Visual Pre-training for Multimodal Machine Translation Jan 25, 2021 Language Modelling Machine Translation
Code Code Available 15 Inference with Reference: Lossless Acceleration of Large Language Models Apr 10, 2023 Decoder Language Modeling
Code Code Available 15 InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model Mar 4, 2025 es-en Language Modeling
Code Code Available 15 InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training Jul 15, 2020 Contrastive Learning Cross-Lingual Transfer
Code Code Available 15 Counterfactual Data Augmentation for Neural Machine Translation Jun 1, 2021 counterfactual Data Augmentation
Code Code Available 15 IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization Sep 10, 2021 Language Modeling Language Modelling
Code Code Available 15 Cost-effective Instruction Learning for Pathology Vision and Language Analysis Jul 25, 2024 Few-Shot Learning Language Modelling
Code Code Available 15 InferCept: Efficient Intercept Support for Augmented Large Language Model Inference Feb 2, 2024 GPU Language Modeling
Code Code Available 15 Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning Oct 26, 2022 Language Modeling Language Modelling
Code Code Available 15 Incorporating Large Language Models into Production Systems for Enhanced Task Automation and Flexibility Jul 11, 2024 Language Modeling Language Modelling
Code Code Available 15 ApiQ: Finetuning of 2-Bit Quantized Large Language Model Feb 7, 2024 GPU Language Modeling
Code Code Available 15 VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups Jun 1, 2021 Language Modeling Language Modelling
Code Code Available 15 cosFormer: Rethinking Softmax in Attention Feb 17, 2022 D4RL Language Modeling
Code Code Available 15 A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization May 22, 2025 Combinatorial Optimization Language Modeling
Code Code Available 15 TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation Feb 24, 2024 Descriptive Language Modeling
Code Code Available 15 Inductive Entity Representations from Text via Link Prediction Oct 7, 2020 Inductive knowledge graph completion Inductive Link Prediction
Code Code Available 15