SOTAVerified

CPU

Papers

Showing 51100 of 2231 papers

TitleStatusHype
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object TrackingCode3
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden IntermediatesCode3
SoundStream: An End-to-End Neural Audio CodecCode3
Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded ModesCode3
MathOptAI.jl: Embed trained machine learning predictors into JuMP modelsCode2
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-DesignCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response ScenariosCode2
Very fast Bayesian Additive Regression Trees on GPUCode2
Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding MechanismCode2
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector RetrievalCode2
Super Monotonic Alignment SearchCode2
Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare ApplicationsCode2
NAVIX: Scaling MiniGrid Environments with JAXCode2
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate ControlCode2
The future of cosmological likelihood-based inference: accelerated high-dimensional parameter estimation and model comparisonCode2
AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker PreventionCode2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisCode2
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
SFSORT: Scene Features-based Simple Online Real-Time TrackerCode2
Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular StructuresCode2
FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic MatchingCode2
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object TrackingCode2
JaxUED: A simple and useable UED library in JaxCode2
SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignCode2
SCNet: Sparse Compression Network for Music Source SeparationCode2
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase PredictionCode2
Real-time and Continuous Turn-taking Prediction Using Voice Activity ProjectionCode2
XLB: A differentiable massively parallel lattice Boltzmann library in PythonCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
Exponentially Faster Language ModellingCode2
Low-latency Real-time Voice Conversion on CPUCode2
Breaking of brightness consistency in optical flow with a lightweight CNN networkCode2
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUsCode2
tsdownsample: high-performance time series downsampling for scalable visualizationCode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust ControlCode2
AudioDec: An Open-source Streaming High-fidelity Neural Audio CodecCode2
MixFormerV2: Efficient Fully Transformer TrackingCode2
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and MemoryCode2
Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload AwarenessCode2
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural NetworksCode2
Cross-domain Neural Pitch and Periodicity EstimationCode2
ImMesh: An Immediate LiDAR Localization and Meshing FrameworkCode2
evosax: JAX-based Evolution StrategiesCode2
TorchOpt: An Efficient Library for Differentiable OptimizationCode2
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier TransformCode2
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage ClusteringCode2
Show:102550
← PrevPage 2 of 45Next →

No leaderboard results yet.