SOTAVerified

Collaborative Inference

In collaborative inference, a single inference task is performed by multiple models distributed on two or more (typically resource-constrained IoT) devices

Papers

Showing 150 of 68 papers

TitleStatusHype
Petals: Collaborative Inference and Fine-tuning of Large ModelsCode6
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
Attention-aware Semantic Communications for Collaborative InferenceCode1
LiveMind: Low-latency Large Language Models with Simultaneous InferenceCode1
SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation LearningCode1
MLink: Linking Black-Box Models from Multiple Domains for Collaborative InferenceCode1
Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge LearningCode1
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance GroundingCode1
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary DetectionCode1
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Private Collaborative Edge Inference via Over-the-Air ComputationCode0
PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational AutoencodersCode0
PRICURE: Privacy-Preserving Collaborative Inference in a Multi-Party SettingCode0
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference SystemsCode0
Decentralized Low-Latency Collaborative Inference via Ensembles on the EdgeCode0
Distributed Mixture-of-Agents for Edge Inference with Large Language ModelsCode0
Dual Attention Networks for Multimodal Reasoning and MatchingCode0
Architectural Vision for Quantum Computing in the Edge-Cloud ContinuumCode0
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge CollaborationCode0
DuetFace: Collaborative Privacy-Preserving Face Recognition via Channel Splitting in the Frequency Domain0
DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference0
EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference0
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing0
Edge-device Collaborative Computing for Multi-view Classification0
Towards Efficient Object Re-Identification with A Novel Cloud-Edge Collaborative Framework0
Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning0
Energy-Efficient Model Compression and Splitting for Collaborative Inference Over Time-Varying Channels0
Energy-efficient Wearable-to-Mobile Offload of ML Inference for PPG-based Heart-Rate Estimation0
Ensembler: Protect Collaborative Inference Privacy from Model Inversion Attack via Selective Ensemble0
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework0
Federated Learning for Collaborative Inference Systems: The Case of Early Exit Networks0
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference0
G-Boost: Boosting Private SLMs with General LLMs0
Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models0
Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices0
Limited Communications Distributed Optimization via Deep Unfolded Distributed ADMM0
MoE^2: Optimizing Collaborative Inference for Edge Large Language Models0
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks0
PrivaScissors: Enhance the Privacy of Collaborative Inference through the Lens of Mutual Information0
Robust Multimodal Graph Matching: Sparse Coding Meets Graph Matching0
Roulette: A Semantic Privacy-Preserving Device-Edge Collaborative Inference Framework for Deep Learning Classification Tasks0
Security Analysis of Capsule Network Inference using Horizontal Collaboration0
Semantics-Driven Cloud-Edge Collaborative Inference0
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud0
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization0
Towards Intelligent Edge Sensing for ISCC Network: Joint Multi-Tier DNN Partitioning and Beamforming Design0
Towards Receiver-Agnostic and Collaborative Radio Frequency Fingerprint Identification0
Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.