Analysis of Competitor Intelligence in the Era of Big Data: An Integrated System Using Text Summarization Based on Global Optimization

Chakraborti, Swapnajit; Dey, Shubhamoy

doi:10.1007/s12599-018-0562-0

Analysis of Competitor Intelligence in the Era of Big Data: An Integrated System Using Text Summarization Based on Global Optimization

Research Paper
Published: 16 October 2018

Volume 61, pages 345–355, (2019)
Cite this article

Business & Information Systems Engineering Aims and scope Submit manuscript

Swapnajit Chakraborti¹ &
Shubhamoy Dey²

868 Accesses
4 Citations
Explore all metrics

Abstract

Automatic text summarization can be applied to extract summaries from competitor intelligence (CI) corpora that organizations create by gathering textual data from the Internet. Such a representation of CI text is easier for managers to interpret and use for making decisions. This research investigates design of an integrated system for CI analysis which comprises clustering and automatic text summarization and evaluates quality of extractive summaries generated automatically by various text-summarization techniques based on global optimization. This research is conducted using experimentation and empirical analysis of results. A survey of practicing managers is also carried out to understand the effectiveness of automatically generated summaries from CI perspective. Firstly, it shows that global optimization-based techniques generate good quality extractive summaries for CI analysis from topical clusters created by the clustering step of the integrated system. Secondly, it shows the usefulness of the generated summaries by having them evaluated by practicing managers from CI perspective. Finally, the implication of this research from the point of view of theory and practice is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

How to design bibliometric research: an overview and a framework proposal

Article Open access 06 March 2024

Customer profiling, segmentation, and sales prediction using AI in direct marketing

Article Open access 23 December 2023

Balanced scorecard: trends, developments, and future directions

Article Open access 30 September 2023

References

Ackermann M, Soares C, Guidemann B (2006) Practical data mining: applications, experiences and challenges. In: SAS & PKDD workshop, Berlin
Alguliev RM, Aliguliyev RM, Mehdiyev CA (2011a) Sentence selection for generic document summarization using an adaptive differential evolution algorithm. Swarm Evol Comput 1(4):213–222
Article Google Scholar
Alguliev RM, Aliguliyev RM, Hajirahimova MS, Mehdiyev CA (2011b) MCMR: maximum coverage and minimum redundant text summarization model. Expert Syst Appl 38(12):14514–14522
Article Google Scholar
Alguliev RM, Aliguliyev RM, Isazade NR (2012) DESAMC + DocSum: differential evolution with self-adaptive mutation and crossover parameters for multi-document summarization. Knowl Based Syst 36:21–38
Article Google Scholar
Alguliev RM, Aliguliyev RM, Isazade NR (2013) CDDS: constraint-driven document summarization models. Expert Syst Appl 40(2):458–465
Article Google Scholar
Alguliev RM, Aliguliyev RM, Isazade NR (2014) Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst Appl 40(5):1675–1689
Article Google Scholar
Allan J, Carbonell J, Doddington G, Yamron J, Yang Y (2000) Topic detection and tracking pilot study final report. DARPA, Arlington
Google Scholar
Arnott D, Pervan G (2008) Eight key issues for the decision support systems discipline. Decis Support Syst 44(3):657–672
Article Google Scholar
Baralis E, Cagliero L, Jabeen S, Fiori A, Shah S (2013) Multi-document summarization based on Yago ontology. Expert Syst Appl 40(17):6976–6984
Article Google Scholar
Barzilay R, Elhadad M (1997) Using lexical chains for text summarization. In: Proceedings ISTS, pp 10–17
Bellegarda J, Butzberger JW, Chow Y, Coccaro NB, Naik D (1996) A novel word clustering algorithm based on latent semantic analysis. In: ICASSP, vol 1. IEEE, pp 172–175
Bissantz N, Hagedorn J (2009) Data mining. Bus Inf Syst Eng 1(1):118–122
Article Google Scholar
Bitzer P, Söllner M, Leimeister JM (2015) Design principles for high-performance blended learning services delivery. Bus Inf Syst Eng 58(2):135–149
Article Google Scholar
Bland JM, Altman DG (1995) Multiple significance tests: the Bonferroni method. BMJ 310:170
Article Google Scholar
Blei DM (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022
Google Scholar
Browne GJ, Cheung C, Heinzl A, Riedl R (2017) Human information behavior. Bus Inf Syst Eng 59(1):1–2
Article Google Scholar
Carbonell J, Goldstein J (1998) The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of SIGIR, pp 335–336
Carullo MB, Binaghi E, Gallo I (2009) An online document clustering technique for short web contents. Pattern Recognit Lett 30(10):870–876
Article Google Scholar
Chakraborti S (2015) Multi-document text summarization for competitor intelligence: a methodology based on topic identification and artificial bee colony optimization. SAC, ACM Digital Library, Salamanca, pp 1110–1111
Google Scholar
Chakraborti S, Dey S (2014) Multi-document text summarization for competitor intelligence: a methodology. In: ISCBI-2014. IEEE Computer Society, New Delhi, pp 97–100
Chakraborti S, Dey S (2015) Product news summarization for competitor intelligence using topic identification and artificial bee colony optimization. ACM RACS, ACM Digital Library, Prague, pp 1–6
Google Scholar
Chakraborti S, Dey S (2016) Multi-level k-means text clustering technique for topic identification for competitor intelligence. In: Proceedings of RCIS. IEEE, Grenoble, pp 1–10
Codina-Filbà J, Bouayad-Agha N, Burga A, Casamayor G, Wanner L (2017) Using genre-specific features for patent summaries. Inf Process Manag 53(1):151–174
Article Google Scholar
Cohen W, Levinthal D (1990) Absorptive capacity: a new perspective on learning and innovation. Adm Sci Q 35(1):128–152
Article Google Scholar
Deb K (2000) An efficient constraint handling method for genetic algorithms. Comput Method Appl Mech Eng 186(2–4):311–338
Article Google Scholar
Deerwester S (1990) Indexing by latent semantic analysis. JOASIS 41(6):391–407
Article Google Scholar
DeJong GF (1978) Fast skimming of news stories: the FRUMP system. PhD thesis, Yale University
Donohue DP, Murphy PM (2016) Supporting competitive intelligence at DuPont by controlling information overload and cutting through the noise. J Inf Knowl Manag 15(1):1650004. https://doi.org/10.1142/S0219649216500040
Article Google Scholar
Dumais S, Chen H (2000) Hierarchical classification of web content. In: SIGIR Conference on research and development in information retrieval. ACM, pp 256–263
Erkan G, Radev DR (2004) LexRank: graph-based lexical centrality as salience in text summarization. J Artif Intell Res 22:457–479
Article Google Scholar
Fischer C, Winter R, Wortmann F (2010) Design theory. Bus Inf Syst Eng 2(6):387–390
Article Google Scholar
Flath C, Nicolay D, Conte T, Dinther C, Filipova-Neumann L (2012) Cluster analysis of smart metering data. Bus Inf Syst Eng 4(1):31–39
Article Google Scholar
Gilad B (2015) Companies collect competitive intelligence but don’t use it. Harv Bus Rev
Gilad B, Fuld L (2016) Only half of companies actually use the competitive intelligence they collect. Harv Bus Rev
Groom JR, David FR (2001) Competitive intelligence activity among small firms. SAM Adv Manag J 66(1):12–20
Google Scholar
Halko J, Martinsson P, Tropp J (2010) Finding structure with randomness: probabilistic algorithm for constructing approximate matrix decomposition. SIAM Rev 53(2):217–288
Article Google Scholar
Heinrich P, Schwabe G (2017) Facilitating informed decision-making in financial service encounters. Bus Inf Syst Eng 60(4):317–329
Article Google Scholar
Hevner AR (2007) A three cycle view of design science research. Scand J Inf Syst 19(2):87–92
Google Scholar
Hevner AR, March ST, Park J, Ram S (2004) Design science in information systems research. MIS Q 28(1):75–105
Article Google Scholar
Holland JH (1975) Adaptation in natural and artificial systems. University of Michigan Press, Ann Arbor
Google Scholar
Hornik K, Kober M, Feinerer I, Buchta C (2012) Spherical k-means clustering. J Stat Softw 50(10):1–22
Article Google Scholar
Hu Y, Chen Y, Chou H (2017) Opinion mining from online hotel reviews – a text summarization approach. Inf Process Manag 53(2):436–449
Article Google Scholar
Jain AM, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323
Article Google Scholar
Jishma Mohan M, Sunitha C, Ganesh A, Jaya A (2016) A study on ontology based abstractive summarization. Proc Comput Sci 87:32–37
Article Google Scholar
Karaboga D, Akay B (2011) A modified artificial bee colony (ABC) algorithm for constrained optimization problems. Appl Soft Comput 11(3):3021–3031
Article Google Scholar
Karaboga D, Basturk B (2007) Artificial bee colony (ABC) optimization algorithm for solving constrained optimization problems. In: Foundations of fuzzy logic and soft computing, IFSA, pp 789–798
Kotonya G, Sommerville I (1998) Requirements engineering processes and techniques. Wiley, Hoboken
Google Scholar
Kowalczyk M (2014) Big data and information processing in organizational decision processes. Bus Inf Syst Eng 6(5):267–278
Article Google Scholar
Lin C (2004) ROUGE: a package for automatic evaluation of summaries. In: Proceedings of the workshop in text summarization. ACL, pp 74–81
Lloret E, Romá-Ferri MT, Palomar M (2013) COMPENDIUM: a text summarization system for generating abstracts of research papers. Data Knowl Eng 88:164–175
Article Google Scholar
Lockshin LS, Spawton AL, Macintosh G (1997) Using product, brand and purchasing involvement for retail segmentation. J Retail Consum Serv 4(3):171–183
Article Google Scholar
Luhn HP (1958) The automatic creation of literature abstracts. IBM J Res Dev 2(2):159–165
Article Google Scholar
MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley symposium on mathematical statistics and probability. University of California Press, pp 287–297
Mani I, Maybury M (1999) Advances in automatic text summarization. MIT Press, Cambridge
Google Scholar
Marcu D (1998) Improving summarization through rhetorical parsing tuning. In: Proceedings of the sixth workshop on very large corpora, pp 206–215
Mendoza M, Bonilla S, Noguera C, Cobos C, León E (2014) Extractive single-document summarization based on genetic operators and guided local search. Expert Syst Appl 41(9):4158–4169
Article Google Scholar
Miller GA (1956) The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol Rev 63(2):81–97
Article Google Scholar
Moens M (2007) Summarizing court decisions. Inf Process Manag 43(6):1748–1764
Article Google Scholar
Oberle D, Bhatti N, Brockmans S, Niemann M, Janiesch C (2009) Countering service information challenges in the internet of services. Bus Inf Syst Eng 1(5):370–390
Article Google Scholar
Okike C, Fernandes KJ (2012) Impact of information use architecture on load and usability. Inf Process Manag 48(5):995–1016
Article Google Scholar
O’Reilly CA (1983) The use of information in organizational decision making: a model and some propositions. Res Organ Behav 5:103–140
Google Scholar
Orenga-Roglá S, Chalmeta R (2017) Methodology for the implementation of knowledge management systems. Bus Inf Syst Eng 2:1–19
Google Scholar
Rackoff N, Wiseman C, Ulrich WA (1985) Information systems for competitive advantage: implementation of a planning process. MIS Q 9(4):285–294
Article Google Scholar
Radev DR (2004) MEAD – a platform for multidocument multilingual text summarization. In: Proceedings of LREC, Lisbon
Radev DR, Jing H, Budzikowska M (2000) Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies. In: NAACL-ANLP 2000 workshop on automatic summarization, pp 21–30
Radev DR, Hovy E, McKeown K (2002) Introduction to the special issue on summarization. Comput Linguist 28(4):399–408
Article Google Scholar
Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl Based Syst 89:14–46
Article Google Scholar
Reeve LH, Han H, Brooks AD (2007) The use of domain-specific concepts in biomedical text summarization. Inf Process Manag 43(6):1765–1776
Article Google Scholar
Reuters (1987) Retrieved from Reuters-21578 text categorization data set. Retrieved from Reuters-21578: http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html. Accessed 7 Jan 2017
Sankarasubramaniam Y, Ramanathan K, Ghosh S (2014) Text summarization using Wikipedia. Inf Process Manag 50(3):443–461
Article Google Scholar
Simon HA (1996) The sciences of the artificial. MIT Press, Cambridge
Google Scholar
Simon B (2010) A discussion on competency management systems from a design theory perspective. Bus Inf Syst Eng 2(6):337–346
Article Google Scholar
Storn R, Price K (1996) Differential evolution – a simple and efficient adaptive scheme for global optimization over continuous spaces. University of California, Berkeley
Google Scholar
Stroh F, Winter R, Wortmann F (2011) Method support of information requirements analysis for analytical information systems. Bus Inf Syst Eng 3(1):33–43
Article Google Scholar
Tallon PP, Ramirez RV, Short JE (2013–2014) The information artifact in IT governance: toward a theory of information governance. J Manag Inf Syst 30(3):141–147
Tseng YH, Lin CJ, Lin Y (2007) Text mining techniques for patent analysis. Inf Process Manag 43(5):1216–1247
Article Google Scholar
Wang YD, Forgionne G (2006) A decision-theoretic approach to the evaluation of information retrieval systems. Inf Process Manag 42(4):863–874
Article Google Scholar
Wang W, Li S, Li J, Li W, Wei F (2013) Exploring hypergraph-based semi-supervised ranking for query-oriented summarization. Inf Sci 237:271–286
Article Google Scholar
Wang N, Sun S, OuYang D (2016) Business process modeling abstraction based on semi-supervised clustering analysis. Bus Inf Syst Eng 1–18
Wilson TD (1981) On user studies and information needs. J Doc 37(1):3–15
Article Google Scholar
Wright S, Pickton DW, Callow J (2002) Competitive intelligence in UK Firms: a typology. Mark Intell Plan 20(6):349–360
Article Google Scholar
Xu M, Ong V, Duan Y, Mathews B (2011) Intelligent agent systems for executive information scanning, filtering and interpretation: perceptions and challenges. Inf Process Manag 47(2):186–201
Article Google Scholar
Zaby C, Wilde KD (2017) Intelligent business processes in CRM. Bus Inf Syst Eng 1–16
Zhan J, Loh HT, Liu Y (2009) Gather customer concerns from online product reviews – a text summarization approach. Expert Syst Appl 36(2):2107–2115
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, S. P. Jain Institute of Management and Research (SPJIMR), Mumbai, India
Swapnajit Chakraborti
Department of Information Systems, Indian Institute of Management Indore, Indore, M.P., India
Shubhamoy Dey

Authors

Swapnajit Chakraborti
View author publications
You can also search for this author in PubMed Google Scholar
Shubhamoy Dey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Swapnajit Chakraborti.

Additional information

Accepted after one revision by Natalia Kliewer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chakraborti, S., Dey, S. Analysis of Competitor Intelligence in the Era of Big Data: An Integrated System Using Text Summarization Based on Global Optimization. Bus Inf Syst Eng 61, 345–355 (2019). https://doi.org/10.1007/s12599-018-0562-0

Download citation

Received: 10 April 2018
Accepted: 29 August 2018
Published: 16 October 2018
Issue Date: 01 June 2019
DOI: https://doi.org/10.1007/s12599-018-0562-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analysis of Competitor Intelligence in the Era of Big Data: An Integrated System Using Text Summarization Based on Global Optimization

Abstract

Access this article

Similar content being viewed by others

How to design bibliometric research: an overview and a framework proposal

Customer profiling, segmentation, and sales prediction using AI in direct marketing

Balanced scorecard: trends, developments, and future directions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Analysis of Competitor Intelligence in the Era of Big Data: An Integrated System Using Text Summarization Based on Global Optimization

Abstract

Access this article

Similar content being viewed by others

How to design bibliometric research: an overview and a framework proposal

Customer profiling, segmentation, and sales prediction using AI in direct marketing

Balanced scorecard: trends, developments, and future directions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation