SOTAVerified

CPU

Papers

Showing 51100 of 2231 papers

TitleStatusHype
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts ModelsCode3
Unlimiformer: Long-Range Transformers with Unlimited Length InputCode3
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden IntermediatesCode3
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignCode2
SCNet: Sparse Compression Network for Music Source SeparationCode2
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural NetworksCode2
SFSORT: Scene Features-based Simple Online Real-Time TrackerCode2
Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular StructuresCode2
Delivering Document Conversion as a Cloud Service with High Throughput and ResponsivenessCode2
Real-time and Continuous Turn-taking Prediction Using Voice Activity ProjectionCode2
Real Time Speech Enhancement in the Waveform DomainCode2
Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding MechanismCode2
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector RetrievalCode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-DesignCode2
Deep Differentiable Logic Gate NetworksCode2
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust ControlCode2
Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload AwarenessCode2
Cross-domain Neural Pitch and Periodicity EstimationCode2
On Efficient Reinforcement Learning for Full-length Game of StarCraft IICode2
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
RAVE: A variational autoencoder for fast and high-quality neural audio synthesisCode2
NAVIX: Scaling MiniGrid Environments with JAXCode2
MixFormerV2: Efficient Fully Transformer TrackingCode2
Low-latency Real-time Voice Conversion on CPUCode2
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier TransformCode2
MathOptAI.jl: Embed trained machine learning predictors into JuMP modelsCode2
Neural Network Compression Framework for fast model inferenceCode2
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot LearningCode2
Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless ObjectsCode2
Breaking of brightness consistency in optical flow with a lightweight CNN networkCode2
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUsCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
ImMesh: An Immediate LiDAR Localization and Meshing FrameworkCode2
AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker PreventionCode2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisCode2
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
JaxUED: A simple and useable UED library in JaxCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate ControlCode2
Musika! Fast Infinite Waveform Music GenerationCode2
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech SynthesisCode2
Godot Reinforcement Learning AgentsCode2
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and MemoryCode2
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage ClusteringCode2
AudioDec: An Open-source Streaming High-fidelity Neural Audio CodecCode2
A Tensor Compiler for Unified Machine Learning Prediction ServingCode2
Show:102550
← PrevPage 2 of 45Next →

No leaderboard results yet.