|
|
|
ix | |
|
|
|
xvi | |
| Foreword |
|
xvii | |
| Preface |
|
xix | |
| Complex Systems Approach |
|
xx | |
| Towards ACS2 |
|
xxiii | |
| ACS2 |
|
xxiv | |
| Road Map |
|
xxv | |
|
|
|
xxvii | |
|
|
|
1 | (22) |
|
|
|
2 | (4) |
|
Psychology Discovers Anticipations |
|
|
2 | (1) |
|
Theory of Anticipatory Behavioral Control |
|
|
3 | (1) |
|
Importance of Anticipations |
|
|
4 | (2) |
|
|
|
6 | (5) |
|
|
|
6 | (2) |
|
|
|
8 | (2) |
|
|
|
10 | (1) |
|
Learning Classifier Systems |
|
|
11 | (12) |
|
Holland's Cognitive System |
|
|
13 | (1) |
|
|
|
14 | (1) |
|
Problems in Traditional LCSs |
|
|
15 | (1) |
|
|
|
16 | (7) |
|
|
|
23 | (28) |
|
|
|
25 | (4) |
|
Environmental Interaction |
|
|
25 | (1) |
|
|
|
26 | (1) |
|
|
|
27 | (2) |
|
|
|
29 | (1) |
|
The Anticipatory Learning Process |
|
|
30 | (7) |
|
|
|
30 | (3) |
|
The ALP in Action: A Simple Gripper Problem |
|
|
33 | (2) |
|
Causes for Over-Specialization |
|
|
35 | (2) |
|
Genetic Generalization in ACS2 |
|
|
37 | (6) |
|
Accurate, Maximally General Classifiers in ACS2 |
|
|
38 | (1) |
|
|
|
39 | (2) |
|
|
|
41 | (2) |
|
Interaction of ALP, GA, RL, and Behavior |
|
|
43 | (8) |
|
|
|
44 | (1) |
|
Evolutionary Pressures of ALP and GA |
|
|
45 | (2) |
|
|
|
47 | (4) |
|
|
|
51 | (30) |
|
Gripper Problem Revisited |
|
|
52 | (3) |
|
|
|
52 | (2) |
|
|
|
54 | (1) |
|
|
|
55 | (9) |
|
|
|
56 | (1) |
|
Evolution of a Multiplexer Model |
|
|
57 | (6) |
|
|
|
63 | (1) |
|
|
|
64 | (5) |
|
|
|
65 | (1) |
|
|
|
66 | (2) |
|
|
|
68 | (1) |
|
|
|
69 | (7) |
|
|
|
71 | (2) |
|
|
|
73 | (3) |
|
Hand-Eye Coordination Task |
|
|
76 | (3) |
|
|
|
76 | (2) |
|
|
|
78 | (1) |
|
|
|
79 | (2) |
|
|
|
81 | (18) |
|
|
|
81 | (6) |
|
|
|
82 | (3) |
|
Interfering Specificities |
|
|
85 | (2) |
|
Non-determinism and a First Approach |
|
|
87 | (6) |
|
ACS2 in a Non-determinism Task |
|
|
88 | (1) |
|
Probability-Enhanced Effects |
|
|
89 | (4) |
|
|
|
93 | (6) |
|
|
|
99 | (16) |
|
|
|
99 | (8) |
|
|
|
100 | (4) |
|
Combining Exploration with Action Planning |
|
|
104 | (3) |
|
Enhancing Reinforcement Learning |
|
|
107 | (6) |
|
Response-Effect Learning Task |
|
|
107 | (1) |
|
|
|
108 | (2) |
|
Lookahead Action Selection |
|
|
110 | (1) |
|
ACS2 in the Response-Effect Task |
|
|
111 | (1) |
|
Stimulus-Response-Effect Task |
|
|
112 | (1) |
|
Model Exploitation Recapitulation |
|
|
113 | (2) |
|
|
|
115 | (6) |
|
Estimated Learning Algorithm |
|
|
115 | (2) |
|
|
|
117 | (1) |
|
|
|
118 | (1) |
|
|
|
119 | (2) |
|
Summary, Conclusions, and Future Work |
|
|
121 | (18) |
|
|
|
121 | (2) |
|
Model Representation Enhancements |
|
|
123 | (4) |
|
|
|
123 | (3) |
|
|
|
126 | (1) |
|
Model Learning Modifications |
|
|
127 | (7) |
|
|
|
127 | (3) |
|
|
|
130 | (1) |
|
|
|
131 | (2) |
|
|
|
133 | (1) |
|
|
|
134 | (3) |
|
Reinforcement Learning Processes |
|
|
135 | (1) |
|
|
|
136 | (1) |
|
|
|
137 | (2) |
| Appendices |
|
139 | (26) |
|
Appendix A: Parameters in ACS2 |
|
|
139 | (2) |
|
Appendix B: Algorithmic Description of ACS2 |
|
|
141 | (12) |
|
|
|
141 | (1) |
|
2. The Main Execution Loop |
|
|
142 | (1) |
|
3. Formation of the Match Set |
|
|
143 | (1) |
|
|
|
143 | (1) |
|
5. Formation of the Action Set |
|
|
144 | (1) |
|
6. Application of the ALP |
|
|
144 | (5) |
|
7. Reinforcement Learning |
|
|
149 | (1) |
|
|
|
149 | (3) |
|
|
|
152 | (1) |
|
Appendix C: ACS2 C++ Code Documentation |
|
|
153 | (8) |
|
|
|
153 | (1) |
|
|
|
154 | (1) |
|
2.1. The Controller - ACSConstants.h |
|
|
154 | (2) |
|
2.2. The Executer - acs2++.cc |
|
|
156 | (1) |
|
|
|
157 | (2) |
|
|
|
159 | (1) |
|
|
|
160 | (1) |
|
|
|
161 | (4) |
| References |
|
165 | (6) |
| Index |
|
171 | |