| ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression | Dec 4, 2024 | 2kLogical Reasoning | CodeCode Available | 1 |
| SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark | Dec 1, 2024 | 2k4D reconstruction | CodeCode Available | 1 |
| Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Nov 18, 2024 | 2k4k | CodeCode Available | 0 |
| Phenome-wide causal proteomics enhance systemic lupus erythematosus flare prediction: A study in Asian populations | Nov 18, 2024 | 2kManagement | —Unverified | 0 |
| Fox-1 Technical Report | Nov 8, 2024 | 2k8k | —Unverified | 0 |
| STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing | Nov 1, 2024 | 2kIn-Context Learning | —Unverified | 0 |
| BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks | Oct 28, 2024 | 2k | —Unverified | 0 |
| How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs | Oct 24, 2024 | 2kMachine Translation | CodeCode Available | 1 |
| Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents | Oct 18, 2024 | 2kActive Learning | —Unverified | 0 |
| Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification | Oct 15, 2024 | 2kDefect Detection | —Unverified | 0 |
| TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models | Oct 14, 2024 | 2kBenchmarking | CodeCode Available | 1 |
| I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow | Oct 10, 2024 | 2k | —Unverified | 0 |
| HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration | Oct 2, 2024 | 2kDenoising | CodeCode Available | 1 |
| Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning | Sep 30, 2024 | 2kComputational Efficiency | —Unverified | 0 |
| The Nature of NLP: Analyzing Contributions in NLP Papers | Sep 29, 2024 | 2k | CodeCode Available | 0 |
| Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Sep 26, 2024 | 2k4k | —Unverified | 0 |
| Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents | Sep 23, 2024 | 2k | —Unverified | 0 |
| PecSched: Preemptive and Efficient Cluster Scheduling for LLM Inference | Sep 23, 2024 | 2kBlocking | —Unverified | 0 |
| Scene-Text Grounding for Text-Based Video Question Answering | Sep 22, 2024 | 2kContrastive Learning | CodeCode Available | 1 |
| Clustering with Non-adaptive Subset Queries | Sep 17, 2024 | 2kClustering | —Unverified | 0 |
| TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Sep 5, 2024 | 2kFace Recognition | CodeCode Available | 0 |
| How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes | Sep 5, 2024 | 2kTranslation | —Unverified | 0 |
| Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method | Aug 30, 2024 | 2kDepth Estimation | CodeCode Available | 0 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| LogParser-LLM: Advancing Efficient Log Parsing with Large Language Models | Aug 25, 2024 | 2kLog Parsing | —Unverified | 0 |