SOTAVerified

Collaborative Inference

In collaborative inference, a single inference task is performed by multiple models distributed on two or more (typically resource-constrained IoT) devices

Papers

Showing 150 of 68 papers

TitleStatusHype
Petals: Collaborative Inference and Fine-tuning of Large ModelsCode6
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance GroundingCode1
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary DetectionCode1
LiveMind: Low-latency Large Language Models with Simultaneous InferenceCode1
Attention-aware Semantic Communications for Collaborative InferenceCode1
MLink: Linking Black-Box Models from Multiple Domains for Collaborative InferenceCode1
Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge LearningCode1
SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation LearningCode1
Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework0
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI0
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition0
Towards Intelligent Edge Sensing for ISCC Network: Joint Multi-Tier DNN Partitioning and Beamforming Design0
Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
G-Boost: Boosting Private SLMs with General LLMs0
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference SystemsCode0
PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational AutoencodersCode0
DistrEE: Distributed Early Exit of Deep Neural Network Inference on Edge Devices0
MoE^2: Optimizing Collaborative Inference for Edge Large Language Models0
Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation0
Distributed Mixture-of-Agents for Edge Inference with Large Language ModelsCode0
Distributed Collaborative Inference System in Next-Generation Networks and Communication0
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge CollaborationCode0
Collaborative Inference over Wireless Channels with Feature Differential Privacy0
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization0
Edge-device Collaborative Computing for Multi-view Classification0
Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models0
Private Collaborative Edge Inference via Over-the-Air ComputationCode0
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference0
Federated Learning for Collaborative Inference Systems: The Case of Early Exit Networks0
Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning0
Ensembler: Protect Collaborative Inference Privacy from Model Inversion Attack via Selective Ensemble0
Towards Efficient Object Re-Identification with A Novel Cloud-Edge Collaborative Framework0
Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables0
Adaptive Early Exiting for Collaborative Inference over Noisy Wireless Channels0
Semantics-Driven Cloud-Edge Collaborative Inference0
DeepAdaIn-Net: Deep Adaptive Device-Edge Collaborative Inference for Augmented Reality0
Limited Communications Distributed Optimization via Deep Unfolded Distributed ADMM0
DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices0
Roulette: A Semantic Privacy-Preserving Device-Edge Collaborative Inference Framework for Deep Learning Classification Tasks0
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks0
Energy-efficient Wearable-to-Mobile Offload of ML Inference for PPG-based Heart-Rate Estimation0
DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference0
PrivaScissors: Enhance the Privacy of Collaborative Inference through the Lens of Mutual Information0
Architectural Vision for Quantum Computing in the Edge-Cloud ContinuumCode0
Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms0
Collaborative Inference for AI-Empowered IoT Devices0
DuetFace: Collaborative Privacy-Preserving Face Recognition via Channel Splitting in the Frequency DomainCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.