skip to main content
10.1145/301136.301195acmconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
Article
Free Access

Team-partitioned, opaque-transition reinforcement learning

Published:01 April 1999Publication History
First page image

References

  1. 1.Sachiyo Arai, Kazuteru Miyazaki, and Shigenobu Kobayashi. Generating cooperative behavior by multi-agent reinforcement learning. In Sixth European Workshop on Learning Robots, Brighton, UK, August 1997.Google ScholarGoogle Scholar
  2. 2.Minoru Asada, Shoichi Nods, Sukoya Tawaratumida, and Koh Hosoda. Purposive behavior acquisition for a real robot by vision-based reinforcement learning. Machine Learning, 23:279-303, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.J. A. Boyan and M. L. Littman. Packet routing in dynamically changing networks: A reinforcement learning approach. In J. D. Cowan, G. Tesauro, and J. Alspector, editors, Advances In Neural Information Processing Systems 6. Morgan Kaufmann Publishers, 1994.Google ScholarGoogle Scholar
  4. 4.Leslie Pack Kaelbling, Anthony R. Cassandra, and Michael L. Littman. Acting optimally in partially observable stochastic domains, in Proceedings of the Twelfth National Conference on Artificial Intelligence, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.Leslie Pack Kaelbling, Michael L. Littman, and Andrew W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285, May 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.Hiroaki Kitano, Yasuo Kuniyoshi, Itsuki Nods, Minoru Asada, Hitoshi Matsubara, and Eiichi Osawa. RoboCup: A challenge problem for AI. AI Magazine, 18(1):73-85, Spring 1997.Google ScholarGoogle Scholar
  7. 7.Michael L. Littman. Markov games as a framework for multiagent reinforcement learning. In Proceedings of the Eleventh International Conference on Machine Learning, pages 157- 163, San Mateo, CA, 1994. Morgan Kaufman.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.Seen Luke, Charles Hohn, Jonathan Farris, Gary Jackson, and James Hendler. Co-evolving soccer softbot team coordination with genetic programming. In Hiroaki Kitano, editor, RoboCup-97: Robot Soccer World Cup I, pages398-411, Berlin, 1998. Springer Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.Maja J. Mataric. Interaction and intelligent behavior. MIT EECS PhD Thesis AITR-1495, MIT AI Lab, August 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.Itsuki Nods, Hitoshi Matsubara, and Kazuo Hiraki. Learning cooperative behavior in multi-agent environment: a case study of choice of play-plans in soccer. In PRICAi'96: Topics in Artificial Intelligence (Proc. of dth Pacific Rim International Conference on Artificial Intelligence, Cairns, Australia), pages 570-579, Cairns,Australia, August 1996. Google ScholarGoogle Scholar
  11. 11.J. Ross Quinlan. C,t.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.Rafal P. Salustowicz, Marco A. Wiering, and Jurgen Schmidhuber. Learning team strategies: Soccer case studies. Machine Learning, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13.Peter Stone and Manuela Veloso. Multiagent systems: A survey from a machine learning perspective. Technical Report CMU-CS-97-193, Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, December 1997.Google ScholarGoogle ScholarCross RefCross Ref
  14. 14.Peter Stone and Manuela Veloso. A layered approach to learning client behaviors in the RoboCup soccer server. Applied Artificial Intelligence, 12:165-188, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  15. 15.Peter Stone and Manuela Veloso. Towards collaborative and adversarial learning: A case study in robotic soccer. International Journal of Human-Computer Studies, 48(1):83-104, January 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.Peter Stone and Manuela Veloso. Using decision tree confidence factors for multiagent control. In Hiroaki Kitaao, editor, RoboCup-97: Robot Soccer World Cup I, pages 99-111. Springer Verlag, Berlin, 1998. Also in Proceedings of the Second International Conference on Autonomous Agents, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.Ming Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning, pages 330-337, 1993.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.Manuela Veloso, Peter Stone, Kwun Hen, and Sorin Achim. The CMUnited-97 small-robot team. In Hiroaki Kitano, editor, RoboCup-97: Robot Soccer World Cup I~ pages 242-256. Springer Verlag, Berlin, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.Jieyu Zhao and Jurgen Schmidhuber. Incremental selfimprovement for life-time multi-agent reinforcement learning. In Proceedings of the 4th International Conference o! Simulation of Adaptive Behavior, pages 363-372.MIT Press, 1996.Google ScholarGoogle Scholar

Index Terms

  1. Team-partitioned, opaque-transition reinforcement learning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          AGENTS '99: Proceedings of the third annual conference on Autonomous Agents
          April 1999
          441 pages
          ISBN:158113066X
          DOI:10.1145/301136

          Copyright © 1999 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 April 1999

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate182of599submissions,30%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader