Abstract
Proper timing of the purchase of airline tickets is difficult even when historical ticket prices and some domain knowledge are available. To address this problem, we introduce an algorithm that optimizes purchase timing on behalf of customers and provides performance estimates of its computed action policy. Given a desired flight route and travel date, the algorithm uses machine-learning methods on recent ticket price quotes from many competing airlines to predict the future expected minimum price of all available flights. The main novelty of our algorithm lies in using a systematic feature-selection technique, which captures time dependencies in the data by using time-delayed features, and reduces the number of features by imposing a class hierarchy among the raw features and pruning the features based on in-situ performance. Our algorithm achieves much closer to the optimal purchase policy than other existing decision theoretic approaches for this domain, and meets or exceeds the performance of existing feature-selection methods from the literature. Applications of our feature-selection process to other domains are also discussed.
- Rakesh Agrawal, Samuel Ieong, and Raja Velu. 2011. Timing when to buy. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management. ACM, New York, NY, 709--718. Google ScholarDigital Library
- Enrico Bachis and Claudio A. Piga. 2011. Low-cost airlines and online price dispersion. International Journal of Industrial Organization 29, 6, 655--667.Google ScholarCross Ref
- Patrick Bajari and Ali Hortacsu. 2003. The winner’s curse, reserve prices, and endogenous entry: empirical insights from eBay auctions. RAND Journal of Economics 34, 2, 329--355.Google ScholarCross Ref
- Peter P. Belobaba. 1987. Airline yield management. An overview of seat inventory control. Transportation Science 21, 2, 63--73. Google ScholarDigital Library
- George E. P. Box, Gwilym M. Jenkins, and Gregory C. Reinsel. 2013. Time Series Analysis: Forecasting and Control. John Wiley & Sons, Hoboken, NJ.Google Scholar
- Sijmen de Jong. 1993. SIMPLS: An alternative approach to partial least squares regression. Chemometrics and Intelligent Laboratory Systems 18, 3, 251--263.Google ScholarCross Ref
- Ke-Lin Du and M. N. S. Swamy. 2014. Recurrent neural networks. In Neural Networks and Statistical Learning. Springer, London, 337--353. DOI:http://dx.doi.org/10.1007/978-1-4471-5571-3_11Google Scholar
- Wedad Elmaghraby and Pinar Keskinocak. 2003. Dynamic pricing in the presence of inventory considerations: Research overview, current practices, and future directions. Management Science 49, 10, 1287--1309. Google ScholarDigital Library
- Oren Etzioni, Rattapoom Tuchinda, Craig Knoblock, and Alexander Yates. 2003. To buy or not to buy: Mining airfare data to minimize ticket purchase price. In SIGKDD Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 119--128. Google ScholarDigital Library
- Graham Francis, Ian Humphreys, Stephen Ison, and Michelle Aicken. 2006. Where next for low cost airlines? A spatial and temporal comparative study. Journal of Transport Geography 14, 2, 83--94.Google ScholarCross Ref
- William Groves. 2013. Using domain knowledge to systematically guide feature selection. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI). AAAI Press, Palo Alto, CA, 3215--3216. Google ScholarDigital Library
- William Groves and Maria Gini. 2013a. Improving prediction in TAC SCM by integrating multivariate and temporal aspects via PLS regression. In Agent-Mediated Electronic Commerce. Designing Trading Strategies and Mechanisms for Electronic Markets. Lecture Notes in Business Information Processing, Vol. 119. Springer, Berlin, 28--43.Google Scholar
- William Groves and Maria Gini. 2013b. Optimal airline ticket purchasing using automated user-guided feature selection. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI). AAAI Press, Palo Alto, CA, 150--156. Google ScholarDigital Library
- Isabelle Guyon and André Elisseeff. 2003. An introduction to variable and feature selection. The Journal of Machine Learning Research 3, 1157--1182. Google ScholarDigital Library
- Mark A. Hall. 1999. Correlation-Based Feature Selection for Machine Learning. Ph.D. Dissertation. The University of Waikato, Waikato, New Zealand.Google Scholar
- Mark A. Hall. 2000. Correlation-based feature selection for discrete and numeric class machine learning. In International Conference on Machine Learning (ICML). Morgan Kaufmann, San Francisco, CA, 359--366. Google ScholarDigital Library
- Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2001. The Elements of Statistical Learning. Springer, New York.Google Scholar
- Arthur E. Hoerl and Robert W. Kennard. 2000. Ridge regression: biased estimation for nonorthogonal problems. Technometrics 42, 1, 80--86. Google ScholarDigital Library
- Ian T. Jolliffe. 1982. A note on the use of principal components in regression. Journal of the Royal Statistical Society (Applied Statistics) 31, 3, 300--303.Google ScholarCross Ref
- Ron Kohavi and George H. John. 1997. Wrappers for feature subset selection. Artificial Intelligence 97, 1--2, 273--324. Google ScholarDigital Library
- Yuri Levin, Jeff McGill, and Mikhail Nediak. 2009. Dynamic pricing in the presence of strategic consumers and oligopolistic competition. Management Science 55, 1, 32--46. Google ScholarDigital Library
- David Lucking-Reiley, Doug Bryan, Naghi Prasad, and Daniel Reeves. 2007. Pennies from Ebay: The determinants of price in online auctions. The Journal of Industrial Economics 55, 2, 223--233.Google ScholarCross Ref
- Benny Mantin and David Gillen. 2011. The hidden information content of price movements. European Journal of Operational Research 211, 2, 385--393.Google ScholarCross Ref
- Benny Mantin and Bonwoo Koo. 2010. Weekend effect in airfare pricing. Journal of Air Transport Management 16, 1, 48--50.Google ScholarCross Ref
- Harold Martens and Tormod Næs. 1992. Multivariate Calibration. John Wiley & Sons, Hoboken, NJ.Google Scholar
- Luis Carlos Molina, Lluís Belanche, and Àngela Nebot. 2002. Feature selection algorithms: a survey and experimental evaluation. In IEEE International Conference on Data Mining (ICDM). IEEE, Piscataway, NJ, 306--313. Google ScholarDigital Library
- K. Obeng and R. Sakano. 2012. Airline fare and seat management strategies with demand dependency. Journal of Air Transport Management 24, 42--48.Google ScholarCross Ref
- Claudio A. Piga and Nicola Filippi. 2002. Booking and flying with low-cost airlines. International Journal of Tourism Research 4, 3, 237--249.Google ScholarCross Ref
- Steven L. Puller and Lisa M. Taylor. 2012. Price discrimination by day-of-week of purchase: Evidence from the U.S. airline industry. Journal of Economic Behavior & Organization 84, 3, 801--812.Google ScholarCross Ref
- J. R. Quinlan and R. M. Cameron-Jones. 1995. Oversearching and layered search in empirical learning. In Proceedings of the 14th International Joint Conference on Artificial Intelligence. Morgan Kaufmann, San Francisco, CA, 1019--1024. Google ScholarDigital Library
- Ilya Raykhel and Dan Ventura. 2009. Real-time automatic price prediction for eBay online trading. In Proceedings of the Innovative Applications of Artificial Intelligence Conference. AAAI, Palo Alto, CA, 135--140.Google Scholar
- Stephen A. Rhoades. 1993. Herfindahl-Hirschman index, the. Federal Reserve Bulletin 79, 188--189.Google Scholar
- Tim Sauer. 1994. Time series prediction by using delay coordinate embedding. In Time Series Prediction: Forecasting the Future and Understanding the Past, Andreas S. Weigend and Neil A. Gershenfeld (Eds.). Addison Wesley, Boston, MA, 175--194.Google Scholar
- Bernhard Schölkopf, Alex J. Smola, Robert C. Williamson, and Peter L. Bartlett. 2000. New support vector algorithms. Neural Computation 12, 5, 1207--1245. Google ScholarDigital Library
- Barry C. Smith, John F. Leimkuhler, and Ross M. Darrow. 1992. Yield management at American Airlines. Interfaces 22, 1, 8--31. Google ScholarDigital Library
- Janakiram Subramanian, Shaler Stidham Jr., and Conrad J. Lautenbacher. 1999. Airline yield management with overbooking, cancellations, and no-shows. Transportation Science 33, 2, 147--167. Google ScholarDigital Library
- U.S. Department of Transportation. 2012. Origin-Destination Survey. Bureau of Transportation Services.Google Scholar
- Timothy M. Vowles. 2000. The effect of low fare air carriers on airfares in the US. Journal of Transport Geography 8, 2, 121--128.Google ScholarCross Ref
- Eric A. Wan. 1994. Time series prediction by using a connectionist network with internal delay lines. In Time Series Prediction: Forecasting the Future and Understanding the Past, Andreas S. Weigend and Neil A. Gershenfeld (Eds.). Addison Wesley, Boston, MA, 195--218.Google Scholar
- Ian H. Witten and Eibe Frank. 2005. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco, CA. Google ScholarDigital Library
- Svante Wold, Harold Martens, and H. Wold. 1983. Multivariate calibration problem in chemistry solved by the PLS method. Matrix Pencils 973, 18, 286--293.Google ScholarCross Ref
- Zheng Alan Zhao and Huan Liu. 2011. Spectral Feature Selection for Data Mining. Chapman & Hall/CRC, London, UK. Google ScholarDigital Library
- Yun Zhou, Norman Fenton, and Martin Neil. 2014. Bayesian network approach to multinomial parameter learning using data and expert judgments. International Journal of Approximate Reasoning 55, 5, 1252--1268.Google ScholarCross Ref
Index Terms
- On Optimizing Airline Ticket Purchase Timing
Recommendations
An agent for optimizing airline ticket purchasing
AAMAS '13: Proceedings of the 2013 international conference on Autonomous agents and multi-agent systemsBuying airline tickets is an ubiquitous task in which it is difficult for humans to minimize cost due to insufficient information. Even with historical data available for inspection (a recent addition to some travel reservation websites), it is ...
DAliM: Machine Learning Based Intelligent Lucky Money Determination for Large-Scale E-Commerce Businesses
Service-Oriented ComputingAbstractE-commerce businesses compete in the market by conducting marketing strategies consisting of four aspects: customers, products, marketplaces and intermediaries. One of the widely-used marketing strategies, called Lucky Money, is capable of ...
Online Footsteps to Purchase: Exploring Consumer Behaviors on Online Shopping Sites
WebSci '15: Proceedings of the ACM Web Science ConferenceAs an important part of the Internet economy, online markets have gained much interest in research community as well as industry. Researchers have studied various aspects of online markets including motivations of consumer behaviors on online markets. ...
Comments