Using Federated Learning to develop AI models on personally identifiable information

Sarbaree Mishra; Vineela Komandla; Srikanth Bandi; Sairamesh Konidala; Jeevan Manda

Authors

Sarbaree Mishra Program Manager at Molina Healthcare Inc., USA Author
Vineela Komandla Vice President - Product Manager, JP Morgan Author
Srikanth Bandi Software Engineer, JP Morgan Chase, USA Author
Sairamesh Konidala Vice President, JP Morgan & Chase, USA, Author
Jeevan Manda Project Manager, Metanoia Solutions Inc, USA Author

Keywords:

Federated Learning, Sensitive Data, Artificial Intelligence, Data Security

Abstract

Building AI models on sensitive data presents both opportunities & challenges as artificial intelligence becomes increasingly entwined into numerous sectors. Conventional approaches of AI model development rely on centralized systems, so large datasets are gathered & handled on a single server. Although this approach is doable, especially for the personally identifiable or sensitive data, it seriously compromises privacy & the security. By enabling the training of Artificial Intelligence models straight on the distributed data sources, hence removing the need to transport sensitive data to a central repository, Federated Learning (FL) offers an efficient answer to these problems. Because of the data privacy is kept confined to its source, this distributed approach protects it. By aggregating the model updates from many sources instead of utilizing the raw data, federated learning assures that data stays in its natural position, therefore lowering the risk of data breaches & ensuring the adherence to strict data security regulations like GDPR. The basic ideas of federated learning—including architecture, key components & the need of secure aggregation with methods in maintaining the anonymity—are discussed in this article. It also emphasizes the growing range of uses for federated learning—spanning healthcare, finance & the mobile devices—where data privacy is very vital. While acknowledging the difficulties of communications efficiency, model synchronizing & the complexity of huge-scale FL implementation, the paper investigates the advantages of federated learning (FL), including enhanced privacy, reduced bandwidth usage & the improved model performance via collaborative learning, underline.

References

1. Hao, M., Li, H., Luo, X., Xu, G., Yang, H., & Liu, S. (2019). Efficient and privacy-enhanced federated learning for industrial artificial intelligence. IEEE Transactions on Industrial Informatics, 16(10), 6532-6542.

2. Truex, S., Baracaldo, N., Anwar, A., Steinke, T., Ludwig, H., Zhang, R., & Zhou, Y. (2019, November). A hybrid approach to privacy-preserving federated learning. In Proceedings of the 12th ACM workshop on artificial intelligence and security (pp. 1-11).

3. Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10(2), 1-19.

4. Bhagoji, A. N., Chakraborty, S., Mittal, P., & Calo, S. (2019, May). Analyzing federated learning through an adversarial lens. In International conference on machine learning (pp. 634-643). PMLR.

5. Wang, Z., Song, M., Zhang, Z., Song, Y., Wang, Q., & Qi, H. (2019, April). Beyond inferring class representatives: User-level privacy leakage from federated learning. In IEEE INFOCOM 2019-IEEE conference on computer communications (pp. 2512-2520). IEEE.

6. Li, D., & Wang, J. (2019). Fedmd: Heterogenous federated learning via model distillation. arXiv preprint arXiv:1910.03581.

7. Hard, A., Rao, K., Mathews, R., Ramaswamy, S., Beaufays, F., Augenstein, S., ... & Ramage, D. (2018). Federated learning for mobile keyboard prediction. arXiv preprint arXiv:1811.03604.

8. Brisimi, T. S., Chen, R., Mela, T., Olshevsky, A., Paschalidis, I. C., & Shi, W. (2018). Federated learning of predictive models from federated electronic health records. International journal of medical informatics, 112, 59-67.

9. Bonawitz, K. (2019). Towards federated learning at scale: Syste m design. arXiv preprint arXiv:1902.01046.

10. Nishio, T., & Yonetani, R. (2019, May). Client selection for federated learning with heterogeneous resources in mobile edge. In ICC 2019-2019 IEEE international conference on communications (ICC) (pp. 1-7). IEEE.

11. Yang, T., Andrew, G., Eichner, H., Sun, H., Li, W., Kong, N., ... & Beaufays, F. (2018). Applied federated learning: Improving google keyboard query suggestions. arXiv preprint arXiv:1812.02903.

12. Wang, X., Han, Y., Wang, C., Zhao, Q., Chen, X., & Chen, M. (2019). In-edge ai: Intelligentizing mobile edge computing, caching and communication by federated learning. Ieee Network, 33(5), 156-165.

13. Geyer, R. C., Klein, T., & Nabi, M. (2017). Differentially private federated learning: A client level perspective. arXiv preprint arXiv:1712.07557.

14. Jiang, Y., Konečný, J., Rush, K., & Kannan, S. (2019). Improving federated learning personalization via model agnostic meta learning. arXiv preprint arXiv:1909.12488.

15. Lu, Y., Huang, X., Dai, Y., Maharjan, S., & Zhang, Y. (2019). Blockchain and federated learning for privacy-preserved data sharing in industrial IoT. IEEE Transactions on Industrial Informatics, 16(6), 4177-4186.

16. Gade, K. R. (2017). Integrations: ETL vs. ELT: Comparative analysis and best practices. Innovative Computer Sciences Journal, 3(1).

17. Gade, K. R. (2017). Migrations: Challenges and Best Practices for Migrating Legacy Systems to Cloud-Based Platforms. Innovative Computer Sciences Journal, 3(1).

18. Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.

19. Komandla, V. Enhancing Security and Fraud Prevention in Fintech: Comprehensive Strategies for Secure Online Account Opening.

20. Gade, K. R. (2018). Real-Time Analytics: Challenges and Opportunities. Innovative Computer Sciences Journal, 4(1).

Using Federated Learning to develop AI models on personally identifiable information

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite