Federated learning and Continual learning
#
Federated Learning:
Link: https://federated.withgoogle.com/
Firms actively working on it : Google, Facebook , NVIDIA
Federated Learning (FL) is a distributed machine learning approach which enables training on a large corpus of decentralized data residing on devices like mobile phones. FL is one instance of the more general approach of “bringing the code to the data, instead of the data to the code” and addresses the fundamental problems of privacy, ownership, and locality of data.
- Read Paper here.
- Google Blog Image
- Nvidia Clara Federated Server Description
Where it can be used :
- Healthcare
- Analytics on Private Data
- Distributed Weight Shared learning use cases - ML Technical Point of View
Continual Learning
Most of the information collected from
Main Link: https://www.continualai.org/home/
Continual learning, also called lifelong learning or online machine learning, is a fundamental idea in machine learning in which models continuously learn and evolve based on the input of increasing amounts of data while retaining previously learned knowledge. In practice, this refers to supporting a model’s ability to autonomously learn and adapt in production as new data comes in. Just like us, this concept too, is based on mimicking humans’ ability to learn incrementally by acquiring, fine-tuning, and transferring knowledge and skills throughout their lifespan.
Basically it is Incremental Knowledge Learning with Continuity in Adapting to the Knowledge.
Use cases:
- Continual Bias addition to knowledge - recommender systems and Anomaly detection
- Ever changing data either partial or complete updates
CL Enables Multimodal- Multitask learning too .
Lukasz Kaiser & All from Google Brain this year [8] come up with a single model which has been able to learn very different tasks in very different domains and with many input modalities (with a huge static training set).
State-of-the-art & Future Challenges
Pros:
- Contrasting Catastrophic Forgetting is possible in many ways, and not only through careful hyper-parametrizations or regularization techniques.
- We have already proved that CL can be used in complex problems in Vision, Supervised or with Reinforce.
- Accuracy results (on some toy benchmarks) are impressively high, almost in line with other techniques which have access to previously encountered data.
Cons:
- It’s not completely clear how evaluate CL techniques, and a more formalized framework may be needed.
- We have pretty much focused on solving a rigid sequence of (simple) tasks (on the scale of dozens) and not on streaming perception data, neither on multimodal/multitask problems.
- It’s not clear how to behave after the saturation of the capability of the model, neither how to selectively forget.