Journal article2026

Information Fusion

Decentralized Federated Learning with Multimodal Prototypes for Heterogeneous Data

Decentralized Federated LearningMultimodal DataHeterogeneous DataNon-IID DataIncomplete DataPrototype-Centric CommunicationMulti-Objective LossContextual Null EmbeddingsMultimodal FusionConvergence Analysis

Scientific overview

Live • Tick 0

Modalis Lens • Step 1 of 6

Heterogeneous Clients

Clients possess non-IID local class distributions and incomplete modalities (some have only image & audio, others text & audio, etc.).

Optimization Losses

Federated Alignment LossL_FAL

Discriminative Contrastive LossL_DCL

Modality Coherence LossL_MCL

Prototype RegularizationL_PR

Key Empirical Results

F1 Score GainRelative gain over the next-best competing method under high heterogeneity.

+4.0%

Communication SavingsReported reduction by sharing compact class prototypes instead of full model parameters.

>40x

Missing ModalitiesEvaluated setting where 50% of modalities may be missing.

pm=0.5

Heterogeneous Clients
Local Encoding & Imputation
Prototype Construction
P2P Prototype Exchange
Knowledge Fusion & Loss
Local Model Update

Scientific overview

RQ3 · RQ4

Completes the technical progression toward heterogeneous multimodal DFL: clients exchange compact modality-aware prototypes instead of complete model parameters, linking missing-modality handling, representation alignment, modality weighting and communication efficiency.

View animation

+4.0%

relative F1 gain

High heterogeneity over the next-best method

82.1%

F1 image-only

AVMNIST unimodal clients

0.11 MB

client payload

Average message size per round

>40x

cost reduction

Compared with full-model exchange baselines

Key Scientific Contributions

Decentralised Multimodal DFL: Studies decentralized collaboration when clients have non-IID class distributions and incomplete modality availability.
Prototype-Centric Protocol: Exchanges compact, modality-aware class prototypes instead of complete model parameters or full updates.
Missing Modality Robustness: Combines contextual null embeddings, adaptive gating, multimodal fusion and representation-alignment objectives.

Major Conclusions

Prototype exchange reported approximately 0.11 MB per client message, more than 40x smaller than communication-heavy full-model exchange baselines.
The evaluated mechanisms support collaboration under missing modalities, non-IID data and controlled heterogeneous configurations.
The conclusions remain tied to the documented datasets, modality configurations, baselines and experimental assumptions.

Empirical Results (AVMNIST)

Method	F1 score	Uplink cost / round
FedAvg	69.5%	~4.75 MB
FedProto	74.8%	~0.03 MB
Modalis	83.4%	~0.11 MB

Methodology phases

Encode

Local multimodal embeddings and contextual null embeddings

Prototype

Compact class prototypes exchanged across neighbors

Fuse

Adaptive modality weighting and representation alignment

Abstract

Decentralized Federated Learning (DFL) enables collaborative machine learning across numerous devices while avoiding bottlenecks and reliance on a single trusted entity inherent to centralized architectures. However, its practical application is challenged by modern scenarios where data is increasingly multimodal. The key obstacles in such settings are severe data heterogeneity, characterized by non-Independent and Identically Distributed (non-IID) class distributions, and incomplete data, where modalities are often missing across clients. Existing solutions struggle with these challenges, either incurring high communication costs or lacking effective mechanisms for fusing partial information. To overcome these limitations, this work introduces Modalis, a novel framework for multimodal DFL that achieves superior model performance under data heterogeneity while minimizing network consumption. It pioneers a communication-efficient, prototype-centric protocol in which clients exchange compact, modality-aware class representations rather than high-dimensional model parameters. This process is guided by a multi-objective loss function enforcing inter-modality coherence and representation alignment for effective knowledge fusion. The framework integrates sophisticated architectural innovations, including contextual null embeddings for intelligent data imputation and robust multimodal fusion using adaptive gating and multi-way transformers. The approach is validated through theoretical analysis, providing formal convergence guarantees, and extensive experiments on standard multimodal benchmarks. These results demonstrate that Modalis achieves superior performance, improving F1 scores by up to 4% under high heterogeneity and reducing communication costs by over 40 times compared to state-of-the-art baselines, establishing it as a highly effective solution for collaborative AI.

Authors

Enrique Tomás Martínez BeltránGérôme BovetGregorio Martínez PérezAlberto Huertas Celdrán

Keywords

Related publications

Works with stronger overlap in topic, type, and tags.

Journal article2024

Expert Systems with Applications

Fedstellar: A Platform for Decentralized Federated Learning

Enrique Tomás Martínez Beltrán, Ángel Luis Perales Gómez, Chao Feng, Pedro Miguel Sánchez Sánchez, Sergio López Bernal, Gérôme Bovet, Manuel Gil Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán

In 2016, Google proposed Federated Learning (FL) as a novel paradigm to train Machine Learning (ML) models across the participants of a federation while preserving data privacy. Since its birth, Centralized FL (CFL) has...

Publisher Page DOI

Journal article2026

Computer Networks

Asynchronous Cache-based Aggregation with Fairness and Filtering for Decentralized Federated Learning

Enrique Tomás Martínez Beltrán, Eduard Gash, Gérôme Bovet, Alberto Huertas Celdrán, Burkhard Stiller

Decentralized Federated Learning (DFL) offers a scalable paradigm for collaborative intelligence at the edge, yet its practical efficacy is severely constrained by system heterogeneity. Traditional synchronous protocols...

Publisher Page DOI

Journal article2026

Future Generation Computer Systems

FedEnD: Communication-efficient Federated Learning for non-IID data via decentralized ensemble distillation

Enrique Tomás Martínez Beltrán, Philip Giryes, Gérôme Bovet, Burkhard Stiller, Gregorio Martínez Pérez, Alberto Huertas Celdrán

Federated Learning (FL) offers a paradigm for collaborative AI that mitigates raw data exposure, yet the statistical heterogeneity of client data severely constrains its practical application. This non-independent and id...

Publisher Page DOI

Related Research

Apr 2023 — Nov 2023

Method

F1 score

Uplink cost / round

FedAvg

69.5%

~4.75 MB

FedProto

74.8%

~0.03 MB

Modalis

83.4%

~0.11 MB

Decentralized Federated Learning with Multimodal Prototypes for Heterogeneous Data

Heterogeneous Clients

Key Scientific Contributions

Major Conclusions

Empirical Results (AVMNIST)

Methodology phases

Abstract

Authors

Keywords

Related publications

Fedstellar: A Platform for Decentralized Federated Learning

Asynchronous Cache-based Aggregation with Fairness and Filtering for Decentralized Federated Learning

FedEnD: Communication-efficient Federated Learning for non-IID data via decentralized ensemble distillation

Related Research

DEFENDIS: Decentralized Federated Learning for IoT Device Identification and Security

Cybersecurity and Distributed Federated Learning

DATRIS: Decentralized AI for Trustworthy and Resource-efficient Intelligent Systems

Decentralized Federated Learning with Multimodal Prototypes for Heterogeneous Data

Heterogeneous Clients

Key Scientific Contributions

Major Conclusions

Empirical Results (AVMNIST)

Methodology phases

Abstract

Authors

Keywords

Related publications

Fedstellar: A Platform for Decentralized Federated Learning

Asynchronous Cache-based Aggregation with Fairness and Filtering for Decentralized Federated Learning

FedEnD: Communication-efficient Federated Learning for non-IID data via decentralized ensemble distillation

Related Research

DEFENDIS: Decentralized Federated Learning for IoT Device Identification and Security

Cybersecurity and Distributed Federated Learning

DATRIS: Decentralized AI for Trustworthy and Resource-efficient Intelligent Systems