| Test-Time Training Done Right | May 29, 2025 | 2kNovel View Synthesis | —Unverified | 0 |
| PIIvot: A Lightweight NLP Anonymization Framework for Question-Anchored Tutoring Dialogues | May 22, 2025 | 2k | —Unverified | 0 |
| Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning | May 19, 2025 | 2kMathematical Reasoning | —Unverified | 0 |
| UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning | May 18, 2025 | 2kReinforcement Learning (RL) | —Unverified | 0 |
| ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative Annotation | May 12, 2025 | 2kRecommendation Systems | CodeCode Available | 0 |
| Calibrating Translation Decoding with Quality Estimation on LLMs | Apr 26, 2025 | 2kMachine Translation | CodeCode Available | 0 |
| aiXamine: Simplified LLM Safety and Security | Apr 21, 2025 | 2kAdversarial Robustness | —Unverified | 0 |
| Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis | Apr 20, 2025 | 2kKnowledge Distillation | —Unverified | 0 |
| Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading | Apr 16, 2025 | 2kCode Generation | —Unverified | 0 |
| On Linear Representations and Pretraining Data Frequency in Language Models | Apr 16, 2025 | 2kIn-Context Learning | —Unverified | 0 |
| Seedream 3.0 Technical Report | Apr 15, 2025 | 2kImage Generation | —Unverified | 0 |
| ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration | Apr 11, 2025 | 2kImage Restoration | —Unverified | 0 |
| DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers | Mar 28, 2025 | 2kImage Generation | —Unverified | 0 |
| Nonparametric MLE for Gaussian Location Mixtures: Certified Computation and Generic Behavior | Mar 26, 2025 | 2k | —Unverified | 0 |
| REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities | Mar 17, 2025 | 2kText Generation | —Unverified | 0 |
| Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation | Feb 26, 2025 | 16k2k | —Unverified | 0 |
| Stackelberg Game Preference Optimization for Data-Efficient Alignment of Language Models | Feb 25, 2025 | 2kModels Alignment | —Unverified | 0 |
| Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks | Feb 24, 2025 | 2kARC | —Unverified | 0 |
| Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements | Feb 21, 2025 | 2kQuantization | —Unverified | 0 |
| Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning | Feb 18, 2025 | 2kLong-Context Understanding | —Unverified | 0 |
| Improved Regret in Stochastic Decision-Theoretic Online Learning under Differential Privacy | Feb 16, 2025 | 2k | —Unverified | 0 |
| Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains | Jan 24, 2025 | 2kLegal Reasoning | —Unverified | 0 |
| TimeLogic: A Temporal Logic Benchmark for Video QA | Jan 13, 2025 | 2kAction Segmentation | —Unverified | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 |
| Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language | Dec 31, 2024 | 2k | —Unverified | 0 |
| Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces | Dec 30, 2024 | 2kRobot Navigation | —Unverified | 0 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 |
| AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models | Dec 17, 2024 | 2kCode Generation | —Unverified | 0 |
| Block-Based Multi-Scale Image Rescaling | Dec 16, 2024 | 2k4k | —Unverified | 0 |
| Do Large Language Models Show Biases in Causal Learning? | Dec 13, 2024 | 2kMisinformation | —Unverified | 0 |
| MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects | Dec 6, 2024 | 2kAnomaly Detection | —Unverified | 0 |
| Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video | Dec 4, 2024 | 2k | —Unverified | 0 |
| Phenome-wide causal proteomics enhance systemic lupus erythematosus flare prediction: A study in Asian populations | Nov 18, 2024 | 2kManagement | —Unverified | 0 |
| Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Nov 18, 2024 | 2k4k | CodeCode Available | 0 |
| Fox-1 Technical Report | Nov 8, 2024 | 2k8k | —Unverified | 0 |
| STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing | Nov 1, 2024 | 2kIn-Context Learning | —Unverified | 0 |
| BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks | Oct 28, 2024 | 2k | —Unverified | 0 |
| Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents | Oct 18, 2024 | 2kActive Learning | —Unverified | 0 |
| Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification | Oct 15, 2024 | 2kDefect Detection | —Unverified | 0 |
| I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow | Oct 10, 2024 | 2k | —Unverified | 0 |
| Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning | Sep 30, 2024 | 2kComputational Efficiency | —Unverified | 0 |
| The Nature of NLP: Analyzing Contributions in NLP Papers | Sep 29, 2024 | 2k | CodeCode Available | 0 |
| Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Sep 26, 2024 | 2k4k | —Unverified | 0 |
| Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents | Sep 23, 2024 | 2k | —Unverified | 0 |
| PecSched: Preemptive and Efficient Cluster Scheduling for LLM Inference | Sep 23, 2024 | 2kBlocking | —Unverified | 0 |
| Clustering with Non-adaptive Subset Queries | Sep 17, 2024 | 2kClustering | —Unverified | 0 |
| How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes | Sep 5, 2024 | 2kTranslation | —Unverified | 0 |
| TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Sep 5, 2024 | 2kFace Recognition | CodeCode Available | 0 |
| Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method | Aug 30, 2024 | 2kDepth Estimation | CodeCode Available | 0 |
| LogParser-LLM: Advancing Efficient Log Parsing with Large Language Models | Aug 25, 2024 | 2kLog Parsing | —Unverified | 0 |