Article

Adaptive routing with end-to-end feedback: distributed learning and geometric approaches

Authors:
Baruch Awerbuch

Johns Hopkins University, Baltimore, MD

Johns Hopkins University, Baltimore, MD
View Profile

,
Robert D. Kleinberg

MIT, Cambridge, MA

MIT, Cambridge, MA
View Profile

STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computingJune 2004Pages 45–53https://doi.org/10.1145/1007352.1007367

Published:13 June 2004Publication History

STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing

Pages 45–53

ABSTRACT

Minimal delay routing is a fundamental task in networks. Since delays depend on the (potentially unpredictable) traffic distribution, online delay optimization can be quite challenging. While uncertainty about the current network delays may make the current routing choices sub-optimal, the algorithm can nevertheless try to learn the traffic patterns and keep adapting its choice of routing paths so as to perform nearly as well as the best static path. This online shortest path problem is a special case of online linear optimization, a problem in which an online algorithm must choose, in each round, a strategy from some compact set S ⊆ R^d so as to try to minimize a linear cost function which is only revealed at the end of the round. Kalai and Vempala[4] gave an algorithm for such problems in the transparent feedback model, where the entire cost function is revealed at the end of the round. Here we present an algorithm for online linear optimization in the more challenging opaque feedback model, in which only the cost of the chosen strategy is revealed at the end of the round. In the special case of shortest paths, opaque feedback corresponds to the notion that in each round the algorithm learns only the end-to-end cost of the chosen path, not the cost of every edge in the network.We also present a second algorithm for online shortest paths, which solves the shortest-path problem using a chain of online decision oracles, one at each node of the graph. This has several advantages over the online linear optimization approach. First, it is effective against an adaptive adversary, whereas our linear optimization algorithm assumes an oblivious adversary. Second, even in the case of an oblivious adversary, the second algorithm performs better than the first, as measured by their additive regret.

References

Peter Auer, Nicolo Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. Gambling in a rigged casino: the adversarial multi-armed bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pages 322--331. IEEE Computer Society Press, Los Alamitos, CA, 1995. Google ScholarDigital Library
Baruch Awerbuch and Yishay Mansour. Online learning of reliable network paths. In PODC, 2003. to appear.Google Scholar
Avrim Blum, Geoff Gordon, and Brendan McMahan. Bandit version of the shortest paths problem. Unpublished manuscript, July 2003.Google Scholar
Adam Kalai and Santosh Vempala. Geometric algorithms for online optimization, 2003. unpublished manuscript.Google Scholar
N. Littlestone and M. Warmuth. The weighted majority algorithm. Information and Computation, 108(2):212--260, 1994. Google ScholarDigital Library
Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. In IEEE Symposium on Foundations of Computer Science, pages 256--261, 1989.Google ScholarDigital Library
Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. Information and Computation, 108:212--261, 1994. A preliminary version appeared in FOCS 1989. Google ScholarDigital Library
Eiji Takimoto and Manfred K. Warmuth. Path kernels and multiplicative updates. In COLT Proceedings, 2002. Google ScholarDigital Library

Index Terms

Adaptive routing with end-to-end feedback: distributed learning and geometric approaches
1. Theory of computation

Recommendations

Multi-armed bandits in metric spaces
STOC '08: Proceedings of the fortieth annual ACM symposium on Theory of computing

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of $n$ trials so as to maximize the total payoff of the chosen strategies. While the performance of bandit algorithms with a small finite strategy set is ...
Read More
Playing games with approximation algorithms
STOC '07: Proceedings of the thirty-ninth annual ACM symposium on Theory of computing

In an online linear optimization problem, on each period t, an online algorithm chooses s_t ∈ S from a fixed (possibly infinite) set S of feasible decisions. Nature (who may be adversarial) chooses a weight vector w_t ∈ R, and the algorithm incurs cost c(...
Read More
Playing Games with Approximation Algorithms

In an online linear optimization problem, on each period $t$, an online algorithm chooses $s_t\in\mathcal{S}$ from a fixed (possibly infinite) set $\mathcal{S}$ of feasible decisions. Nature (who may be adversarial) chooses a weight vector $w_t\in\...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
June 2004
660 pages
ISBN:1581138520
DOI:10.1145/1007352
Program Chair:
László Babai
University of Chicago, Chicago, IL
Copyright © 2004 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 June 2004
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
multi-armed bandit problem
online decision problem
online linear optimization
routing
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,469of4,586submissions,32%
Upcoming Conference
STOC '24

Sponsor:

sigact

56th Annual ACM Symposium on Theory of Computing (STOC 2024)

June 24 - 28, 2024

Vancouver , BC , Canada
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 98
  Total Citations
  View Citations
- 913
  Total Downloads
- Downloads (Last 12 months)57
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Adaptive routing with end-to-end feedback: distributed learning and geometric approaches

STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Multi-armed bandits in metric spaces

Playing games with approximation algorithms

Playing Games with Approximation Algorithms

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Adaptive routing with end-to-end feedback: distributed learning and geometric approaches

STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Multi-armed bandits in metric spaces

Playing games with approximation algorithms

Playing Games with Approximation Algorithms

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media