| SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations | Feb 24, 2025 | CPUGPU | CodeCode Available | 0 |
| Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space | Feb 24, 2025 | GPU | —Unverified | 0 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition | Feb 23, 2025 | Deep HashingGPU | CodeCode Available | 3 |
| A Split-Window Transformer for Multi-Model Sequence Spammer Detection using Multi-Model Variational Autoencoder | Feb 23, 2025 | GPUmodel | —Unverified | 0 |
| Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation | Feb 22, 2025 | Dialogue GenerationGPU | —Unverified | 0 |
| A Universal Framework for Compressing Embeddings in CTR Prediction | Feb 21, 2025 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 0 |
| Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference | Feb 21, 2025 | GPU | —Unverified | 0 |
| KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation | Feb 21, 2025 | Audio GenerationFAD | CodeCode Available | 2 |
| Towards Efficient Automatic Self-Pruning of Large Language Models | Feb 20, 2025 | GPU | —Unverified | 0 |
| Dynamic Low-Rank Sparse Adaptation for Large Language Models | Feb 20, 2025 | CPUGPU | CodeCode Available | 1 |
| Distributed U-net model and Image Segmentation for Lung Cancer Detection | Feb 20, 2025 | CPUFederated Learning | —Unverified | 0 |
| Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective | Feb 20, 2025 | CPUGPU | —Unverified | 0 |
| TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators | Feb 20, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Multiscale Byte Language Models -- A Hierarchical Architecture for Causal Million-Length Sequence Modeling | Feb 20, 2025 | DecoderGPU | CodeCode Available | 0 |
| Building reliable sim driving agents by scaling self-play | Feb 20, 2025 | Autonomous VehiclesBenchmarking | CodeCode Available | 4 |
| ParallelComp: Parallel Long-Context Compressor for Length Extrapolation | Feb 20, 2025 | 4k8k | —Unverified | 0 |
| Learning conformational ensembles of proteins based on backbone geometry | Feb 19, 2025 | GPU | —Unverified | 0 |
| FairKV: Balancing Per-Head KV Cache for Fast Multi-GPU Inference | Feb 19, 2025 | GPU | —Unverified | 0 |
| Slamming: Training a Speech Language Model on One GPU in a Day | Feb 19, 2025 | GPULanguage Modeling | CodeCode Available | 3 |
| RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression | Feb 19, 2025 | GPU | —Unverified | 0 |
| LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation | Feb 19, 2025 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin | Feb 19, 2025 | GPULogical Reasoning | —Unverified | 0 |
| Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs | Feb 19, 2025 | GPU | —Unverified | 0 |