| ZClip: Adaptive Spike Mitigation for LLM Pre-Training | Apr 3, 2025 | Anomaly DetectionLarge Language Model | CodeCode Available | 2 |
| CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language Models | Apr 1, 2025 | Large Language ModelTranslation | CodeCode Available | 2 |
| TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection | Mar 31, 2025 | Fraud DetectionLarge Language Model | CodeCode Available | 2 |
| Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs | Mar 31, 2025 | Large Language ModelVideo Chaptering | CodeCode Available | 2 |
| Cross-Tokenizer Distillation via Approximate Likelihood Matching | Mar 25, 2025 | Large Language Model | CodeCode Available | 2 |
| CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Modifying Large Language Model Post-Training for Diverse Creative Writing | Mar 21, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| Generative Modeling for Mathematical Discovery | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model | Mar 13, 2025 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models | Mar 13, 2025 | Large Language ModelObject | CodeCode Available | 2 |