Information Fusion
Decentralized Federated Learning with Multimodal Prototypes for Heterogeneous Data
Decentralized Federated Learning (DFL) enables collaborative machine learning across numerous devices while avoiding bottlenecks and reliance on a single trusted entity inherent to centralized architectures. However, its...
Heterogeneous Clients
Clients possess non-IID local class distributions and incomplete modalities (some have only image & audio, others text & audio, etc.).
Optimization Losses
Key Empirical Results
Heterogeneous Clients
Local Encoding & Imputation
Prototype Construction
P2P Prototype Exchange
Knowledge Fusion & Loss
Local Model Update
Completes the technical progression toward heterogeneous multimodal DFL: clients exchange compact modality-aware prototypes instead of complete model parameters, linking missing-modality handling, representation alignment, modality weighting and communication efficiency.
+4.0%
relative F1 gain
High heterogeneity over the next-best method
82.1%
F1 image-only
AVMNIST unimodal clients
0.11 MB
client payload
Average message size per round
>40x
cost reduction
Compared with full-model exchange baselines
Key Scientific Contributions
- Decentralised Multimodal DFL: Studies decentralized collaboration when clients have non-IID class distributions and incomplete modality availability.
- Prototype-Centric Protocol: Exchanges compact, modality-aware class prototypes instead of complete model parameters or full updates.
- Missing Modality Robustness: Combines contextual null embeddings, adaptive gating, multimodal fusion and representation-alignment objectives.
Major Conclusions
- Prototype exchange reported approximately 0.11 MB per client message, more than 40x smaller than communication-heavy full-model exchange baselines.
- The evaluated mechanisms support collaboration under missing modalities, non-IID data and controlled heterogeneous configurations.
- The conclusions remain tied to the documented datasets, modality configurations, baselines and experimental assumptions.
Empirical Results (AVMNIST)
| Method | F1 score | Uplink cost / round |
|---|---|---|
| FedAvg | 69.5% | ~4.75 MB |
| FedProto | 74.8% | ~0.03 MB |
| Modalis | 83.4% | ~0.11 MB |
Methodology phases
Encode
Local multimodal embeddings and contextual null embeddings
Prototype
Compact class prototypes exchanged across neighbors
Fuse
Adaptive modality weighting and representation alignment
Abstract
Decentralized Federated Learning (DFL) enables collaborative machine learning across numerous devices while avoiding bottlenecks and reliance on a single trusted entity inherent to centralized architectures. However, its practical application is challenged by modern scenarios where data is increasingly multimodal. The key obstacles in such settings are severe data heterogeneity, characterized by non-Independent and Identically Distributed (non-IID) class distributions, and incomplete data, where modalities are often missing across clients. Existing solutions struggle with these challenges, either incurring high communication costs or lacking effective mechanisms for fusing partial information. To overcome these limitations, this work introduces Modalis, a novel framework for multimodal DFL that achieves superior model performance under data heterogeneity while minimizing network consumption. It pioneers a communication-efficient, prototype-centric protocol in which clients exchange compact, modality-aware class representations rather than high-dimensional model parameters. This process is guided by a multi-objective loss function enforcing inter-modality coherence and representation alignment for effective knowledge fusion. The framework integrates sophisticated architectural innovations, including contextual null embeddings for intelligent data imputation and robust multimodal fusion using adaptive gating and multi-way transformers. The approach is validated through theoretical analysis, providing formal convergence guarantees, and extensive experiments on standard multimodal benchmarks. These results demonstrate that Modalis achieves superior performance, improving F1 scores by up to 4% under high heterogeneity and reducing communication costs by over 40 times compared to state-of-the-art baselines, establishing it as a highly effective solution for collaborative AI.
Authors
Keywords
Related publications
Works with stronger overlap in topic, type, and tags.
Expert Systems with Applications
Fedstellar: A Platform for Decentralized Federated Learning
Enrique Tomás Martínez Beltrán, Ángel Luis Perales Gómez, Chao Feng, Pedro Miguel Sánchez Sánchez, Sergio López Bernal, Gérôme Bovet, Manuel Gil Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán
In 2016, Google proposed Federated Learning (FL) as a novel paradigm to train Machine Learning (ML) models across the participants of a federation while preserving data privacy. Since its birth, Centralized FL (CFL) has...
Computer Networks
Asynchronous Cache-based Aggregation with Fairness and Filtering for Decentralized Federated Learning
Enrique Tomás Martínez Beltrán, Eduard Gash, Gérôme Bovet, Alberto Huertas Celdrán, Burkhard Stiller
Decentralized Federated Learning (DFL) offers a scalable paradigm for collaborative intelligence at the edge, yet its practical efficacy is severely constrained by system heterogeneity. Traditional synchronous protocols...
Future Generation Computer Systems
FedEnD: Communication-efficient Federated Learning for non-IID data via decentralized ensemble distillation
Enrique Tomás Martínez Beltrán, Philip Giryes, Gérôme Bovet, Burkhard Stiller, Gregorio Martínez Pérez, Alberto Huertas Celdrán
Federated Learning (FL) offers a paradigm for collaborative AI that mitigates raw data exposure, yet the statistical heterogeneity of client data severely constrains its practical application. This non-independent and id...
Related Research

Apr 2023 — Nov 2023
DEFENDIS: Decentralized Federated Learning for IoT Device Identification and Security
DEFENDIS develops a framework for uniquely identifying IoT devices in a distributed manner while solving security threats through decentralized federated learning.

Nov 2025 — Jan 2026
Cybersecurity and Distributed Federated Learning
An international research exchange program investigating threat mitigation models for peer-to-peer federated learning.

Jan 2024 — Dec 2024
DATRIS: Decentralized AI for Trustworthy and Resource-efficient Intelligent Systems
Developed decentralized AI solutions addressing computational constraints and trust validation under armasuisse Cyber-Defence Campus sponsorship.