SOTAVerified

Collaborative Inference

In collaborative inference, a single inference task is performed by multiple models distributed on two or more (typically resource-constrained IoT) devices

Papers

Showing 150 of 68 papers

TitleStatusHype
Petals: Collaborative Inference and Fine-tuning of Large ModelsCode6
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
LiveMind: Low-latency Large Language Models with Simultaneous InferenceCode1
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance GroundingCode1
Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge LearningCode1
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Attention-aware Semantic Communications for Collaborative InferenceCode1
SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation LearningCode1
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary DetectionCode1
MLink: Linking Black-Box Models from Multiple Domains for Collaborative InferenceCode1
Collaborative Inference over Wireless Channels with Feature Differential Privacy0
Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables0
Communication-Efficient Split Learning Based on Analog Communication and Over the Air Aggregation0
DeepAdaIn-Net: Deep Adaptive Device-Edge Collaborative Inference for Augmented Reality0
Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms0
DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices0
Dictionary Learning over Distributed Models0
DistrEE: Distributed Early Exit of Deep Neural Network Inference on Edge Devices0
Distributed Collaborative Inference System in Next-Generation Networks and Communication0
DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference0
EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference0
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing0
Edge-device Collaborative Computing for Multi-view Classification0
Towards Efficient Object Re-Identification with A Novel Cloud-Edge Collaborative Framework0
Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning0
Energy-Efficient Model Compression and Splitting for Collaborative Inference Over Time-Varying Channels0
Energy-efficient Wearable-to-Mobile Offload of ML Inference for PPG-based Heart-Rate Estimation0
Ensembler: Protect Collaborative Inference Privacy from Model Inversion Attack via Selective Ensemble0
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework0
Federated Learning for Collaborative Inference Systems: The Case of Early Exit Networks0
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference0
G-Boost: Boosting Private SLMs with General LLMs0
Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks0
Adaptive Early Exiting for Collaborative Inference over Noisy Wireless Channels0
Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework0
An Approach to Inference-Driven Dialogue Management within a Social Chatbot0
Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge0
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition0
C-NMT: A Collaborative Inference Framework for Neural Machine Translation0
Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation0
Collaborative Inference for AI-Empowered IoT Devices0
Collaborative Inference for Efficient Remote Monitoring0
Robust Multimodal Graph Matching: Sparse Coding Meets Graph Matching0
Roulette: A Semantic Privacy-Preserving Device-Edge Collaborative Inference Framework for Deep Learning Classification Tasks0
Security Analysis of Capsule Network Inference using Horizontal Collaboration0
Semantics-Driven Cloud-Edge Collaborative Inference0
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud0
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.