Created by W.Langdon from gp-bibliography.bib Revision:1.8178
To appear in Proc. ICML`96, 86 K, 252 K uncompressed. Another spin-off paper of the TR (schmidhuber:1996:spm?) above. It uses ``Levin's universal search through program space (LS)''. LS is theoretically `optimal' for a wide variety of search problems including many partially observable Markov decision problems (POMDPs). Experiments show that LS can solve partially observable mazes (`POMS') involving many more states and obstacles than those solved by various previous authors. An adaptive extension of LS (ALS) is introduced. ALS uses experience to increase probabilities of instructions occurring in successful programs found by LS. To deal with cases where ALS does not lead to long term performance improvement, we use the above-mentioned, novel paradigm (EIRA) to guarantee lifelong histories of reward accelerations. We show: (a) ALS can dramatically reduce the search time consumed by successive calls of LS. (b) Additional significant speedups can be obtained by combining ALS with EIRA.
1996 FRIDAY July 5 11:30-13:00 Parallel Session A https://icml.cc/Conferences/1996/",
Genetic Programming entries for Marco Wiering Jurgen Schmidhuber