| LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs | Jul 1, 2025 | Large Language Model | CodeCode Available | 1 |
| Dataset Distillation via Vision-Language Category Prototype | Jun 30, 2025 | Dataset DistillationDescriptive | CodeCode Available | 1 |
| Where, What, Why: Towards Explainable Driver Attention Prediction | Jun 29, 2025 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder | Jun 28, 2025 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective | Jun 22, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Jun 21, 2025 | Autonomous DrivingDescriptive | CodeCode Available | 1 |
| The Condition Number as a Scale-Invariant Proxy for Information Encoding in Neural Units | Jun 19, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models | Jun 19, 2025 | Large Language ModelSafety Alignment | CodeCode Available | 1 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge | Jun 17, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Jun 12, 2025 | Large Language ModelStarcraft | CodeCode Available | 1 |
| Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning | Jun 10, 2025 | Large Language Modelreinforcement-learning | CodeCode Available | 1 |
| Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis | Jun 10, 2025 | Domain AdaptationLarge Language Model | CodeCode Available | 1 |
| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 |
| Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Jun 6, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data | Jun 5, 2025 | Drug DiscoveryLarge Language Model | CodeCode Available | 1 |
| OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Jun 5, 2025 | Instance SegmentationLanguage Modeling | CodeCode Available | 1 |
| POSS: Position Specialist Generates Better Draft for Speculative Decoding | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RewardAnything: Generalizable Principle-Following Reward Models | Jun 4, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 1 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |