SOTAVerified

Collaborative Inference

In collaborative inference, a single inference task is performed by multiple models distributed on two or more (typically resource-constrained IoT) devices

Papers

Showing 150 of 68 papers

TitleStatusHype
Petals: Collaborative Inference and Fine-tuning of Large ModelsCode6
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
MLink: Linking Black-Box Models from Multiple Domains for Collaborative InferenceCode1
Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge LearningCode1
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Attention-aware Semantic Communications for Collaborative InferenceCode1
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary DetectionCode1
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance GroundingCode1
SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation LearningCode1
LiveMind: Low-latency Large Language Models with Simultaneous InferenceCode1
Distributed Mixture-of-Agents for Edge Inference with Large Language ModelsCode0
Dual Attention Networks for Multimodal Reasoning and MatchingCode0
Private Collaborative Edge Inference via Over-the-Air ComputationCode0
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference SystemsCode0
DuetFace: Collaborative Privacy-Preserving Face Recognition via Channel Splitting in the Frequency DomainCode0
PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational AutoencodersCode0
PRICURE: Privacy-Preserving Collaborative Inference in a Multi-Party SettingCode0
Architectural Vision for Quantum Computing in the Edge-Cloud ContinuumCode0
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge CollaborationCode0
Decentralized Low-Latency Collaborative Inference via Ensembles on the EdgeCode0
EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference0
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing0
Edge-device Collaborative Computing for Multi-view Classification0
Towards Efficient Object Re-Identification with A Novel Cloud-Edge Collaborative Framework0
Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning0
Energy-Efficient Model Compression and Splitting for Collaborative Inference Over Time-Varying Channels0
Energy-efficient Wearable-to-Mobile Offload of ML Inference for PPG-based Heart-Rate Estimation0
Ensembler: Protect Collaborative Inference Privacy from Model Inversion Attack via Selective Ensemble0
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework0
Federated Learning for Collaborative Inference Systems: The Case of Early Exit Networks0
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference0
G-Boost: Boosting Private SLMs with General LLMs0
Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models0
Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices0
Limited Communications Distributed Optimization via Deep Unfolded Distributed ADMM0
MoE^2: Optimizing Collaborative Inference for Edge Large Language Models0
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks0
PrivaScissors: Enhance the Privacy of Collaborative Inference through the Lens of Mutual Information0
Robust Multimodal Graph Matching: Sparse Coding Meets Graph Matching0
Roulette: A Semantic Privacy-Preserving Device-Edge Collaborative Inference Framework for Deep Learning Classification Tasks0
Security Analysis of Capsule Network Inference using Horizontal Collaboration0
Semantics-Driven Cloud-Edge Collaborative Inference0
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud0
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization0
Towards Intelligent Edge Sensing for ISCC Network: Joint Multi-Tier DNN Partitioning and Beamforming Design0
Towards Receiver-Agnostic and Collaborative Radio Frequency Fingerprint Identification0
Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks0
Adaptive Early Exiting for Collaborative Inference over Noisy Wireless Channels0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.