SE367 / HW4 / Group B

Motor Expertise

Debidatta Dwibedi

Q1. Which two instructions in the "programming language" of the 2011 HW would be the most difficult for robots to follow?
A. We thought the following steps in the suggested algorithm were the toughest:

Q2. The robot following the learning paradigm as in Kalakrishnan is clearly gaining some expertise. Which aspects of the execution may be called implicit or automatic, and which aspects may be more explicit? What could be the "chunks" in this structure?
(Here we assume that implicit refers to the instructions that the robot performs on its own and explicit refers to the part where it requires a human to specify certain parameters for him. The words explicit and implicit don't refer to implicit/tacit or explicit knowledge respectively.)
A. From [1]

Q3. Comment on whether human learning may also be following similar "reward" based processes? Consider the learning process for the fire-fighting expert who knows how to fight complex fires.
A. A lot of learning in humans is based on rewards and punishments, which usually are feedback of our actions in the environment. In fact a lot of the times people behave in a manner that increases rewards and hence become better at tasks that provide them with rewards. These rewards need not be provided by the environment always and may be a resultant of intrinsic motivation.

In the fire-fighting video, the novice wanted to perform the task of fighting the fire by opening the water in regular but small intervals and looking for its effects on the fire. He definitely wanted to use the abating fire to reward himself and keep following that line of action. However, because of his lack of expertise he was unable to contain the fire. But fire-fighters are given training in which they are given instructions on what to do. If they successfully follow an instruction and the fire abates they are able to reward themselves intrinsically and learn that instruction and tend to do that more. His fellow fire-fighters and seniors might also provide him with rewards in form of words of praise to ensure that they learn the correct method of fire-fighting. In this case, a learning fire-fighter can reward himself intrinsically because he knows he must somehow keep the fire in check and any action that helps him in doing this needs to be rewarded. Because of a lot of training and the rewards the fire fighter gives himself, he is able to perform the instructions as actions and anticipates its effects, intuitively.

References

[1] Kalakrishnan, Mrinal, et al. "Learning force control policies for compliant manipulation." Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on. IEEE, 2011.