Information Fusion
Decentralized Federated Learning with Multimodal Prototypes for Heterogeneous Data
Decentralized Federated Learning (DFL) enables collaborative machine learning across numerous devices while avoiding bottlenecks and reliance on a single trusted entity inherent to centralized architectures. However, its...
Heterogeneous Clients
Clients possess non-IID local class distributions and incomplete modalities (some have only image & audio, others text & audio, etc.).
Optimization Losses
Key Empirical Results
Heterogeneous Clients
Local Encoding & Imputation
Prototype Construction
P2P Prototype Exchange
Knowledge Fusion & Loss
Local Model Update
Generalizes the technical progression toward heterogeneous multimodal DFL: clients exchange compact modality-aware prototypes instead of complete model payloads, enabling collaboration with non-IID data and missing modalities.
+4.0%
relative F1 gain
High heterogeneity over the next-best method
82.1%
F1 image-only
AVMNIST unimodal clients
0.11 MB
client payload
Average message size per round
>40x
cost reduction
Compared with full-model exchange baselines
Key Scientific Contributions
- Decentralised Multimodal DFL: Enables serverless collaborative training on heterogeneous modalities.
- Prototype-Centric Protocol: Transmits compact class prototypes, reducing network overhead by 40×.
- Missing Modality Robustness: Contextual Null Embeddings (CNE) impute missing sensory signals.
Major Conclusions
- Prototype exchange reduces per-round uplink cost from ~4.75 MB (FedAvg) to ~0.11 MB, enabling bandwidth-constrained deployments.
- CNE combined with masked multi-way transformers maintains strong F1 performance even under 50% missing-modality rates.
- Modalis achieves 83.4% F1, outperforming FedAvg (69.5%) and FedProto (74.8%) under high heterogeneity on AVMNIST.
Empirical Results (AVMNIST)
| Method | F1 score | Uplink cost / round |
|---|---|---|
| FedAvg | 69.5% | ~4.75 MB |
| FedProto | 74.8% | ~0.03 MB |
| Modalis (ours) | 83.4% | ~0.11 MB |
Methodology phases
Encode
Local multimodal embeddings and contextual null embeddings
Prototype
Compact class prototypes exchanged across neighbors
Fuse
Adaptive modality weighting and representation alignment
Abstract
Decentralized Federated Learning (DFL) enables collaborative machine learning across numerous devices while avoiding bottlenecks and reliance on a single trusted entity inherent to centralized architectures. However, its practical application is challenged by modern scenarios where data is increasingly multimodal. The key obstacles in such settings are severe data heterogeneity, characterized by non-Independent and Identically Distributed (non-IID) class distributions, and incomplete data, where modalities are often missing across clients. Existing solutions struggle with these challenges, either incurring high communication costs or lacking effective mechanisms for fusing partial information. To overcome these limitations, this work introduces Modalis, a novel framework for multimodal DFL that achieves superior model performance under data heterogeneity while minimizing network consumption. It pioneers a communication-efficient, prototype-centric protocol in which clients exchange compact, modality-aware class representations rather than high-dimensional model parameters. This process is guided by a multi-objective loss function enforcing inter-modality coherence and representation alignment for effective knowledge fusion. The framework integrates sophisticated architectural innovations, including contextual null embeddings for intelligent data imputation and robust multimodal fusion using adaptive gating and multi-way transformers. The approach is validated through theoretical analysis, providing formal convergence guarantees, and extensive experiments on standard multimodal benchmarks. These results demonstrate that Modalis achieves superior performance, improving F1 scores by up to 4% under high heterogeneity and reducing communication costs by over 40 times compared to state-of-the-art baselines, establishing it as a highly effective solution for collaborative AI.
Authors
Keywords
Related publications
Works with stronger overlap in topic, type, and tags.
Expert Systems with Applications
Fedstellar: A Platform for Decentralized Federated Learning
Enrique Tomás Martínez Beltrán, Ángel Luis Perales Gómez, Chao Feng, Pedro Miguel Sánchez Sánchez, Sergio López Bernal, Gérôme Bovet, Manuel Gil Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán
In 2016, Google proposed Federated Learning (FL) as a novel paradigm to train Machine Learning (ML) models across the participants of a federation while preserving data privacy. Since its birth, Centralized FL (CFL) has...
Computer Networks
Asynchronous Cache-based Aggregation with Fairness and Filtering for Decentralized Federated Learning
Enrique Tomás Martínez Beltrán, Eduard Gash, Gérôme Bovet, Alberto Huertas Celdrán, Burkhard Stiller
Decentralized Federated Learning (DFL) offers a scalable paradigm for collaborative intelligence at the edge, yet its practical efficacy is severely constrained by system heterogeneity. Traditional synchronous protocols...
Future Generation Computer Systems
FedEnD: Communication-efficient Federated Learning for non-IID data via decentralized ensemble distillation
Enrique Tomás Martínez Beltrán, Philip Giryes, Gérôme Bovet, Burkhard Stiller, Gregorio Martínez Pérez, Alberto Huertas Celdrán
Federated Learning (FL) offers a paradigm for collaborative AI that mitigates raw data exposure, yet the statistical heterogeneity of client data severely constrains its practical application. This non-independent and id...
Related Research

Apr 2023 — Nov 2023
DEFENDIS: Decentralized Federated Learning for IoT Device Identification and Security
DEFENDIS develops a framework for uniquely identifying IoT devices in a distributed manner while solving security threats through decentralized federated learning.

Nov 2025 — Jan 2026
Cybersecurity and Distributed Federated Learning
An international research exchange program investigating threat mitigation models for peer-to-peer federated learning.

Jan 2024 — Dec 2024
DATRIS: Decentralized AI for Trustworthy and Resource-efficient Intelligent Systems
Developed decentralized AI solutions addressing computational constraints and trust validation under armasuisse Cyber-Defence Campus sponsorship.