SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
GPU
GPU
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 931–940 of 5629 papers
Title
Date
Tasks
Status
Hype
CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation
Oct 23, 2024
GPU
Language Modeling
—
Unverified
0
Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing
Oct 23, 2024
GPU
—
Unverified
0
Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs
Oct 23, 2024
GPU
Scheduling
—
Unverified
0
POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inference
Oct 23, 2024
GPU
Code
Code Available
0
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference
Oct 23, 2024
Computational Efficiency
CPU
—
Unverified
0
AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost
Oct 22, 2024
CPU
GPU
—
Unverified
0
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Oct 22, 2024
GPU
Representation Learning
Code
Code Available
3
Semantic-guided Search for Efficient Program Repair with Large Language Models
Oct 22, 2024
GPU
HumanEval
—
Unverified
0
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs
Oct 22, 2024
CPU
GPU
—
Unverified
0
Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling
Oct 22, 2024
All
GPU
—
Unverified
0
Show:
10
25
50
← Prev
Page 94 of 563
Next →
No leaderboard results yet.