skip to main content
10.1145/1007352.1007367acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article

Adaptive routing with end-to-end feedback: distributed learning and geometric approaches

Published:13 June 2004Publication History

ABSTRACT

Minimal delay routing is a fundamental task in networks. Since delays depend on the (potentially unpredictable) traffic distribution, online delay optimization can be quite challenging. While uncertainty about the current network delays may make the current routing choices sub-optimal, the algorithm can nevertheless try to learn the traffic patterns and keep adapting its choice of routing paths so as to perform nearly as well as the best static path. This online shortest path problem is a special case of online linear optimization, a problem in which an online algorithm must choose, in each round, a strategy from some compact set S ⊆ Rd so as to try to minimize a linear cost function which is only revealed at the end of the round. Kalai and Vempala[4] gave an algorithm for such problems in the transparent feedback model, where the entire cost function is revealed at the end of the round. Here we present an algorithm for online linear optimization in the more challenging opaque feedback model, in which only the cost of the chosen strategy is revealed at the end of the round. In the special case of shortest paths, opaque feedback corresponds to the notion that in each round the algorithm learns only the end-to-end cost of the chosen path, not the cost of every edge in the network.We also present a second algorithm for online shortest paths, which solves the shortest-path problem using a chain of online decision oracles, one at each node of the graph. This has several advantages over the online linear optimization approach. First, it is effective against an adaptive adversary, whereas our linear optimization algorithm assumes an oblivious adversary. Second, even in the case of an oblivious adversary, the second algorithm performs better than the first, as measured by their additive regret.

References

  1. Peter Auer, Nicolo Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. Gambling in a rigged casino: the adversarial multi-armed bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pages 322--331. IEEE Computer Society Press, Los Alamitos, CA, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Baruch Awerbuch and Yishay Mansour. Online learning of reliable network paths. In PODC, 2003. to appear.Google ScholarGoogle Scholar
  3. Avrim Blum, Geoff Gordon, and Brendan McMahan. Bandit version of the shortest paths problem. Unpublished manuscript, July 2003.Google ScholarGoogle Scholar
  4. Adam Kalai and Santosh Vempala. Geometric algorithms for online optimization, 2003. unpublished manuscript.Google ScholarGoogle Scholar
  5. N. Littlestone and M. Warmuth. The weighted majority algorithm. Information and Computation, 108(2):212--260, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. In IEEE Symposium on Foundations of Computer Science, pages 256--261, 1989.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. Information and Computation, 108:212--261, 1994. A preliminary version appeared in FOCS 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Eiji Takimoto and Manfred K. Warmuth. Path kernels and multiplicative updates. In COLT Proceedings, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Adaptive routing with end-to-end feedback: distributed learning and geometric approaches

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
          June 2004
          660 pages
          ISBN:1581138520
          DOI:10.1145/1007352

          Copyright © 2004 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 June 2004

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate1,469of4,586submissions,32%

          Upcoming Conference

          STOC '24
          56th Annual ACM Symposium on Theory of Computing (STOC 2024)
          June 24 - 28, 2024
          Vancouver , BC , Canada

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader