74 4 Learning-based Control
internal mechanism, the model is modified in [113]. Unfortunately, the models
obtained above are based on the assumption of rolling without slipping, that
is, the robot must be rolling perfectly on the ground. Therefore, these models
are not applicable for the static situation. In the static situation, the coupling
between the wheel and the flywheel becomes much more complicated, which
makes it difficult to derive an analytical model by traditional control methods.
On the other hand, humans are capable of mastering complex and highly
nonlinear control systems. A typical example is car driving. For Gyrover con-
trol, humans are able to control the robot well if enough practice is undertaken.
Thus, we intuitively come up with the idea of machine learning, a model-free
approach to model this kind of human control strategy. This approach is suit-
able for Gyrover control for the following reasons:
• Gyrover is a complex system, for which it is difficult for us to develop
a complete dynamic model to represent the robot’s behaviors by using
traditional control methods.
• From a practical point of view, it is equally difficult to model the system
precisely due to some unmodeled factors, such as friction. Friction is an
important issue when we are dealing with the coupling between the wheel
and the spinning flywheel.
• Although Gyrover is a complex system, humans can control the robot
through a radio transmitter to perform various kinds of task. They do
not need to explicitly model a system in order to control it. Through
interaction with the system and observation of the behaviors of the system,
humans are able to “learn” how to control a system.
• The learning process is in fact a direct input-output mapping between the
system sensory data and the actuation data. A controller is generated by
using the training data while a human “teacher” controls the system until
the synthesized controller can perform the same way as a human.
4.1.1 Cascade Neural Network with Kalman Filtering
The field of intelligent control has emerged from the field of classical control
theory to deal with applications which are too complex for classical control
approaches. In terms of complexity, human control strategy lies between low-
level feedback control and high-level reasoning, and encompasses a wide range
of useful physical tasks with a reasonably well-defined numeric input/output
representation.
Here, we introduce a continuous learning architecture for modeling hu-
man control strategies based on a neural network. Since most neural networks
used today rely on rigid, fixed architecture networks and/or with slow gra-
dient descent-based training algorithms, they may not be a suitable method
to model the complex, dynamic and nonlinear human control strategy. To
counter these problems, a new neural network learning architecture is pro-
posed in [78], which combines (1) flexible cascade neural networks, which