| The Context of Crash Occurrence: A Complexity-Infused Approach Integrating Semantic, Contextual, and Kinematic Features | Nov 26, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding | Nov 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams | Nov 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DapPep: Domain Adaptive Peptide-agnostic Learning for Universal T-cell Receptor-antigen Binding Affinity Prediction | Nov 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Nov 26, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| Data-driven development of cycle prediction models for lithium metal batteries using multi modal mining | Nov 26, 2024 | Graph MiningLanguage Modeling | —Unverified | 0 |
| Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Nov 26, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Scaling Speech-Text Pre-training with Synthetic Interleaved Data | Nov 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 7 |
| LongKey: Keyphrase Extraction for Long Documents | Nov 26, 2024 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension | Nov 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| On the Efficiency of NLP-Inspired Methods for Tabular Deep Learning | Nov 26, 2024 | Computational EfficiencyDeep Learning | CodeCode Available | 3 |
| HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Nov 26, 2024 | Language ModelingLarge Language Model | CodeCode Available | 2 |
| STAR: Synthesis of Tailored Architectures | Nov 26, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| DocEDA: Automated Extraction and Design of Analog Circuits from Documents with Large Language Model | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Agentic Schema Refinement | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tree Transformers are an Ineffective Model of Syntactic Constituency | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training | Nov 25, 2024 | document understandingLanguage Modeling | —Unverified | 0 |
| SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VideoOrion: Tokenizing Object Dynamics in Videos | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Functionality understanding and segmentation in 3D scenes | Nov 25, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment | Nov 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? | Nov 25, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| VaLiD: Mitigating the Hallucination of Large Vision Language Models by Visual Layer Fusion Contrastive Decoding | Nov 24, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| PromptHSI: Universal Hyperspectral Image Restoration with Vision-Language Modulated Frequency Adaptation | Nov 24, 2024 | Image RestorationLanguage Modeling | CodeCode Available | 1 |
| Can a Large Language Model Learn Matrix Functions In Context? | Nov 24, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Ensuring Fair LLM Serving Amid Diverse Applications | Nov 24, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| Is Training Data Quality or Quantity More Impactful to Small Language Model Performance? | Nov 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generative Prompt Internalization | Nov 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revelio: Interpreting and leveraging semantic information in diffusion models | Nov 23, 2024 | DenoisingLanguage Modeling | CodeCode Available | 1 |
| AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset | Nov 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars | Nov 23, 2024 | DescriptiveIn-Context Learning | CodeCode Available | 0 |
| Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks | Nov 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model | Nov 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enabling Efficient Serverless Inference Serving for LLM (Large Language Model) in the Cloud | Nov 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment | Nov 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Model with Region-guided Referring and Grounding for CT Report Generation | Nov 23, 2024 | Computed Tomography (CT)Diagnostic | CodeCode Available | 2 |
| Automatic High-quality Verilog Assertion Generation through Subtask-Focused Fine-Tuned LLMs and Iterative Prompting | Nov 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts | Nov 22, 2024 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| The BS-meter: A ChatGPT-Trained Instrument to Detect Sloppy Language-Games | Nov 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tulu 3: Pushing Frontiers in Open Language Model Post-Training | Nov 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos | Nov 22, 2024 | Language-Based Temporal LocalizationLanguage Modeling | CodeCode Available | 1 |
| ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data | Nov 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation | Nov 22, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Astro-HEP-BERT: A bidirectional language model for studying the meanings of concepts in astrophysics and high energy physics | Nov 22, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Effective SAM Combination for Open-Vocabulary Semantic Segmentation | Nov 22, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Memory Backdoor Attacks on Neural Networks | Nov 21, 2024 | Backdoor AttackFederated Learning | —Unverified | 0 |
| Planning-Driven Programming: A Large Language Model Programming Workflow | Nov 21, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Nov 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |