| MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding | Nov 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection | Nov 26, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| The Context of Crash Occurrence: A Complexity-Infused Approach Integrating Semantic, Contextual, and Kinematic Features | Nov 26, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Data-driven development of cycle prediction models for lithium metal batteries using multi modal mining | Nov 26, 2024 | Graph MiningLanguage Modeling | —Unverified | 0 |
| DapPep: Domain Adaptive Peptide-agnostic Learning for Universal T-cell Receptor-antigen Binding Affinity Prediction | Nov 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Nov 26, 2024 | Language ModelingLarge Language Model | CodeCode Available | 2 |
| MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension | Nov 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LongKey: Keyphrase Extraction for Long Documents | Nov 26, 2024 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Nov 26, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Nov 26, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| On the Efficiency of NLP-Inspired Methods for Tabular Deep Learning | Nov 26, 2024 | Computational EfficiencyDeep Learning | CodeCode Available | 3 |
| STAR: Synthesis of Tailored Architectures | Nov 26, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| Scaling Speech-Text Pre-training with Synthetic Interleaved Data | Nov 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 7 |
| DocEDA: Automated Extraction and Design of Analog Circuits from Documents with Large Language Model | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Agentic Schema Refinement | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tree Transformers are an Ineffective Model of Syntactic Constituency | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training | Nov 25, 2024 | document understandingLanguage Modeling | —Unverified | 0 |
| VideoOrion: Tokenizing Object Dynamics in Videos | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Functionality understanding and segmentation in 3D scenes | Nov 25, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment | Nov 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? | Nov 25, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| PromptHSI: Universal Hyperspectral Image Restoration with Vision-Language Modulated Frequency Adaptation | Nov 24, 2024 | Image RestorationLanguage Modeling | CodeCode Available | 1 |
| VaLiD: Mitigating the Hallucination of Large Vision Language Models by Visual Layer Fusion Contrastive Decoding | Nov 24, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |