skip to main content
10.1145/3580305.3599443acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework

Published: 04 August 2023 Publication History

Abstract

Traditional federated classification methods, even those designed for non-IID clients, assume that each client annotates its local data with respect to the same universal class set. In this paper, we focus on a more general yet practical setting, non-identical client class sets, where clients focus on their own (different or even non-overlapping) class sets and seek a global model that works for the union of these classes. If one views classification as finding the best match between representations produced by data/label encoder, such heterogeneity in client class sets poses a new significant challenge-local encoders at different clients may operate in different and even independent latent spaces, making it hard to aggregate at the server. We propose a novel framework, FedAlign1, to align the latent spaces across clients from both label and data perspectives. From a label perspective, we leverage the expressive natural language class names as a common ground for label encoders to anchor class representations and guide the data encoder learning across clients. From a data perspective, during local training, we regard the global class representations as anchors and leverage the data points that are close/far enough to the anchors of locally-unaware classes to align the data encoders across clients. Our theoretical analysis of the generalization performance and extensive experiments on four real-world datasets of different tasks confirm that FedAlign outperforms various state-of-the-art (non-IID) federated classification methods.

Supplementary Material

Promotional video for KDD'23 paper "Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework". (rtfp1262-2min-promo.zip)
MP4 File (rtfp1262-2min-promo.mp4)
Promotional video for KDD'23 paper "Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework".

References

[1]
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. 2020. Language Models are Few-Shot Learners. NeurIPS, Vol. 33 (2020), 1877--1901.
[2]
John A. Bullinaria and Joseph P. Levy. 2007. Extracting semantic representations from word co-occurrence statistics: A computational study. Behavior Research Methods, Vol. 39, 3 (2007), 510--526.
[3]
Ana Cardoso-Cachopo. 2007. Improving Methods for Single-label Text Categorization. PdD Thesis, Instituto Superior Tecnico, Universidade Tecnica de Lisboa.
[4]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv preprint arXiv:1810.04805 (2018).
[5]
Efstathios D. Gennatas, Jerome H. Friedman, Lyle H. Ungar, Romain Pirracchio, Eric Eaton, Lara G. Reichmann, Yannet Interian, José Marcio Luna, Charles B. Simone, Andrew Auerbach, et al. 2020. Expert-augmented machine learning. PNAS, Vol. 117, 9 (2020), 4571--4577.
[6]
Felix Hamborg, Norman Meuschke, Corinna Breitinger, and Bela Gipp. 2017. news-please: a Generic News Crawler and Extractor. In International Symposium of Information Science. 218--223.
[7]
Neil Houlsby, Andrei Giurgiu, Stanislaw Jastrzebski, Bruna Morrone, Quentin De Laroussilhe, Andrea Gesmundo, Mona Attariyan, and Sylvain Gelly. 2019. Parameter-Efficient Transfer Learning for NLP. In ICML. PMLR, 2790--2799.
[8]
Alistair EW Johnson, Tom J. Pollard, Lu Shen, Li-wei H. Lehman, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, and Roger G Mark. 2016. MIMIC-III, a freely accessible critical care database. Scientific Data, Vol. 3, 1 (2016), 1--9.
[9]
Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, et al. 2021. Advances and Open Problems in Federated Learning. Foundations and Trends® in Machine Learning, Vol. 14, 1--2 (2021), 1--210.
[10]
Sai Praneeth Karimireddy, Satyen Kale, Mehryar Mohri, Sashank Reddi, Sebastian Stich, and Ananda Theertha Suresh. 2020. SCAFFOLD: Stochastic Controlled Averaging for Federated Learning. In ICML. PMLR, 5132--5143.
[11]
Chung-Wei Lee, Wei Fang, Chih-Kuan Yeh, and Yu-Chiang Frank Wang. 2018. Multi-Label Zero-Shot Learning with Structured Knowledge Graphs. In CVPR. 1576--1585.
[12]
Dong-Hyun Lee et al. 2013. Pseudo-Label: The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks. In Workshop on Challenges in Representation Learning, ICML, Vol. 3. 896.
[13]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics, Vol. 36, 4 (2020), 1234--1240.
[14]
Jimmy Lei Ba, Kevin Swersky, Sanja Fidler, et al. 2015. Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions. In ICCV. 4247--4255.
[15]
Omer Levy and Yoav Goldberg. 2014. Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL. 171--180.
[16]
Ang Li, Jingwei Sun, Pengcheng Li, Yu Pu, Hai Li, and Yiran Chen. 2021d. Hermes: An Efficient Federated Learning Framework for Heterogeneous Mobile Clients. In MobiCom. 420--437.
[17]
Ang Li, Jingwei Sun, Xiao Zeng, Mi Zhang, Hai Li, and Yiran Chen. 2021 e. FedMask: Joint Computation and Communication-Efficient Personalized Federated Learning via Heterogeneous Masking. In SenSys. 42--55.
[18]
Chenglin Li, Di Niu, Bei Jiang, Xiao Zuo, and Jianming Yang. 2021b. Meta-HAR: Federated Representation Learning for Human Activity Recognition. In WWW. 912--922.
[19]
Liang Li, Dian Shi, Ronghui Hou, Hui Li, Miao Pan, and Zhu Han. 2021c. To talk or to work: Flexible communication compression for energy efficient federated learning over heterogeneous mobile edge devices. In INFOCOM. IEEE, 1--10.
[20]
Qinbin Li, Bingsheng He, and Dawn Song. 2021a. Model-Contrastive Federated Learning. In CVPR. 10713--10722.
[21]
Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, and Virginia Smith. 2020. Federated Optimization in Heterogeneous Networks. MLSys, Vol. 2 (2020), 429--450.
[22]
Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, and Zhihua Zhang. 2019. On the Convergence of FedAvg on Non-IID Data. In ICLR.
[23]
Xin-Chun Li and De-Chuan Zhan. 2021. FedRS: Federated Learning with Restricted Softmax for Label Distribution Non-IID Data. In KDD. 995--1005.
[24]
Feng Liang, Weike Pan, and Zhong Ming. 2021. FedRec: Lossless Federated Recommendation with Explicit Feedback. In AAAI, Vol. 35. 4224--4231.
[25]
Tao Lin, Lingjing Kong, Sebastian U. Stich, and Martin Jaggi. 2020. Ensemble Distillation for Robust Model Fusion in Federated Learning. NeurIPS, Vol. 33 (2020).
[26]
Xinyang Lin, Hanting Chen, Yixing Xu, Chao Xu, Xiaolin Gui, Yiping Deng, and Yunhe Wang. 2022. Federated Learning with Positive and Unlabeled Data. In ICML. PMLR, 13344--13355.
[27]
Shuchang Liu, Shuyuan Xu, Wenhui Yu, Zuohui Fu, Yongfeng Zhang, and Amelie Marian. 2021. FedCT: Federated Collaborative Transfer for Recommendation. In SIGIR. 716--725.
[28]
Mi Luo, Fei Chen, Dapeng Hu, Yifan Zhang, Jian Liang, and Jiashi Feng. 2021. No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data. NeurIPS, Vol. 34 (2021).
[29]
Moe Matsuki, Paula Lago, and Sozo Inoue. 2019. Characterizing Word Embeddings for Zero-Shot Sensor-Based Human Activity Recognition. Sensors, Vol. 19, 22 (2019), 5043.
[30]
Geoffrey J McLachlan. 1975. Iterative Reclassification Procedure for Constructing an Asymptotically Optimal Rule of Allocation in Discriminant Analysis. Journal of the American Statistical Association, Vol. 70, 350 (1975), 365--369.
[31]
Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas. 2017. Communication-Efficient Learning of Deep Networks from Decentralized Data. In AISTATS. PMLR, 1273--1282.
[32]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. NeurIPS, Vol. 26 (2013).
[33]
James Mullenbach, Sarah Wiegreffe, Jon Duke, Jimeng Sun, and Jacob Eisenstein. 2018. Explainable Prediction of Medical Codes from Clinical Text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1. 1101--1111.
[34]
Bethesda (MD): National Library of Medicine. 2003. PMC Open Access Subset [Internet] . https://www.ncbi.nlm.nih.gov/pmc/tools/openftlist/ [accessed 7-June-2022].
[35]
Xingchao Peng, Zijun Huang, Yizhe Zhu, and Kate Saenko. 2019. Federated Adversarial Domain Adaptation. arXiv preprint arXiv:1911.02054 (2019).
[36]
Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. GloVe: Global Vectors for Word Representation. In EMNLP. 1532--1543.
[37]
Yinhua Piao, Sangseon Lee, Dohoon Lee, and Sun Kim. 2022. Sparse Structure Learning via Graph Neural Networks for Inductive Document Classifcation. In AAAI, Vol. 36. 11165--11173.
[38]
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning Transferable Visual Models From Natural Language Supervision. In ICML. PMLR, 8748--8763.
[39]
Attila Reiss and Didier Stricker. 2012. Introducing a New Benchmarked Dataset for Activity Monitoring. In ISWC. IEEE, 108--109.
[40]
Linlin Tu, Xiaomin Ouyang, Jiayu Zhou, Yuze He, and Guoliang Xing. 2021. FedDL: Federated Learning via Dynamic Layer Sharing for Human Activity Recognition. In SenSys. 15--28.
[41]
Yonatan Vaizman, Katherine Ellis, and Gert Lanckriet. 2017. Recognizing Detailed Human Context in the Wild from Smartphones and Smartwatches. IEEE Pervasive Computing, Vol. 16, 4 (2017), 62--74.
[42]
Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research, Vol. 9, 11 (2008).
[43]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need. NeurIPS, Vol. 30 (2017).
[44]
Laura Von Rueden, Sebastian Mayer, Katharina Beckh, Bogdan Georgiev, Sven Giesselbach, Raoul Heese, Birgit Kirsch, Julius Pfrommer, Annika Pick, Rajkumar Ramamurthy, et al. 2021. Informed Machine Learning-A taxonomy and survey of integrating prior knowledge into learning systems. TKDE, Vol. 35, 1 (2021), 614--633.
[45]
Hongyi Wang, Mikhail Yurochkin, Yuekai Sun, Dimitris Papailiopoulos, and Yasaman Khazaeni. 2020b. Federated Learning with Matched Averaging. In ICLR.
[46]
Jianyu Wang, Qinghua Liu, Hao Liang, Gauri Joshi, and H. Vincent Poor. 2020a. Tackling the Objective Inconsistency Problem in Heterogeneous Federated Optimization. NeurIPS, Vol. 33 (2020), 7611--7623.
[47]
Tong Wu, Yiqiang Chen, Yang Gu, Jiwei Wang, Siyu Zhang, and Zhanghu Zhechen. 2020. Multi-Layer Cross Loss Model for Zero-Shot Human Activity Recognition. In PAKDD. 210--221.
[48]
Chengxu Yang, Qipeng Wang, Mengwei Xu, Zhenpeng Chen, Kaigui Bian, Yunxin Liu, and Xuanzhe Liu. 2021. Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data. In WWW. 935--946.
[49]
Jingwei Yi, Fangzhao Wu, Chuhan Wu, Ruixuan Liu, Guangzhong Sun, and Xing Xie. 2021. Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation. In EMNLP. 2814--2824.
[50]
Stella X. Yu and Jianbo Shi. 2003. Multiclass Spectral Clustering. In ICCV. IEEE, 313--319.
[51]
Qiming Zhang, Jing Zhang, Wei Liu, and Dacheng Tao. 2019. Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation. NeurIPS, Vol. 32 (2019).
[52]
Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books. In ICCV. 19--27.
[53]
Zhuangdi Zhu, Junyuan Hong, and Jiayu Zhou. 2021. Data-Free Knowledge Distillation for Heterogeneous Federated Learning. In ICML. PMLR, 12878--12889.

Cited By

View all
  • (2024)How Few Davids Improve One Goliath: Federated Learning in Resource-Skewed Edge Computing EnvironmentsProceedings of the ACM Web Conference 202410.1145/3589334.3645544(2976-2985)Online publication date: 13-May-2024
  • (2023)Unleashing the Power of Shared Label Structures for Human Activity RecognitionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615101(3340-3350)Online publication date: 21-Oct-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2023
5996 pages
ISBN:9798400701030
DOI:10.1145/3580305
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 August 2023

Check for updates

Author Tags

  1. federated learning
  2. label semantics modeling
  3. non-iid

Qualifiers

  • Research-article

Funding Sources

Conference

KDD '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)372
  • Downloads (Last 6 weeks)42
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)How Few Davids Improve One Goliath: Federated Learning in Resource-Skewed Edge Computing EnvironmentsProceedings of the ACM Web Conference 202410.1145/3589334.3645544(2976-2985)Online publication date: 13-May-2024
  • (2023)Unleashing the Power of Shared Label Structures for Human Activity RecognitionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615101(3340-3350)Online publication date: 21-Oct-2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media