| SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning | Mar 19, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Probing the topology of the space of tokens with structured prompts | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models | Mar 19, 2025 | Bayesian OptimizationCode Generation | —Unverified | 0 |
| Sig2text, a Vision-language model for Non-cooperative Radar Signal Parsing | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation | Mar 19, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| What Makes a Reward Model a Good Teacher? An Optimization Perspective | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Robust Transmission of Punctured Text with Large Language Model-based Recovery | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Shushing! Let's Imagine an Authentic Speech from the Silent Video | Mar 19, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |