| CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Field-Mediated Semantic Organization in Large Language Models: Evidence for Quantum-Like Properties in Artificial Neural Systems | Mar 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities | Mar 20, 2025 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Comprehensive Survey on Long Context Language Modeling | Mar 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Entropy-based Exploration Conduction for Multi-step Reasoning | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model | Mar 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Using Language Models to Decipher the Motivation Behind Human Behaviors | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction | Mar 20, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Exploring the Reliability of Self-explanation and its Relationship with Classification in Language Model-driven Financial Analysis | Mar 20, 2025 | ClassificationFinancial Analysis | CodeCode Available | 0 |
| Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| ChatGPT and U(X): A Rapid Review on Measuring the User Experience | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions | Mar 19, 2025 | Action RecognitionComputational Efficiency | —Unverified | 0 |
| SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning | Mar 19, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Probing the topology of the space of tokens with structured prompts | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models | Mar 19, 2025 | Bayesian OptimizationCode Generation | —Unverified | 0 |
| Sig2text, a Vision-language model for Non-cooperative Radar Signal Parsing | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation | Mar 19, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| What Makes a Reward Model a Good Teacher? An Optimization Perspective | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Robust Transmission of Punctured Text with Large Language Model-based Recovery | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Shushing! Let's Imagine an Authentic Speech from the Silent Video | Mar 19, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |