Generative adversarial simulator

Authors

  • Jonathan Raiman1* 1 Paris-Saclay University, Paris, France.

DOI:

https://doi.org/10.51483/IJAIML.1.1.2021.31-46

Keywords:

Machine learning, Reinforcement learning, Student networks, Data-free learning

Abstract

Knowledge distillation between machine learning models has opened many new
avenues for parameter count reduction, performance improvements, or amortizing
training time when changing architectures between the teacher and student network.
In the case of reinforcement learning, this technique has also been applied to distill
teacher policies to students. Until now, policy distillation required access to a
simulator or real world trajectories. In this paper we introduce a simulator-free
approach to knowledge distillation in the context of reinforcement learning. A key
challenge is having the student learn the multiplicity of cases that correspond to a
given action. While prior work has shown that data-free knowledge distillation is
possible with supervised learning models by generating synthetic examples, these
approaches to are vulnerable to only producing a single prototype example for each
class. We propose an extension to explicitly handle multiple observations per output
class that seeks to find as many exemplars as possible for a given output class by
reinitializing our data generator and making use of an adversarial loss. To the best of
our knowledge, this is the first demonstration of simulator-free knowledge distillation
between a teacher and a student policy. This new approach improves over the state
of the art on data-free learning of student networks on benchmark datasets (MNIST,
Fashion-MNIST, CIFAR-10), and we also demonstrate that it specifically tackles
issues with multiple input modes. We also identify open problems when
distillingagents trained in high dimensional environments such as Pong, Breakout, or
Seaquest.

Downloads

Published

2021-07-05

How to Cite

Jonathan Raiman1*. (2021). Generative adversarial simulator. International Journal of Artificial Intelligence and Machine Learning, 1(01), 31–46. https://doi.org/10.51483/IJAIML.1.1.2021.31-46

Similar Articles

1 2 3 4 5 6 > >> 

You may also start an advanced similarity search for this article.