Skip to main content
Log in

Extraction and visualization of industrial service portfolios by text mining of 10-K annual reports

  • Published:
Flexible Services and Manufacturing Journal Aims and scope Submit manuscript

An Erratum to this article was published on 26 April 2016

Abstract

As more and more manufacturing companies accumulate profits from service provision, the ability to monitor the adoption of the industrial services of other companies grows more important. The purpose of this paper is to propose a data-driven methodology for extraction of the industrial service portfolio from a company’s annual report. In this approach, form 10-K, a special format of annual report regulated by the Security Exchange Commission in United States is utilized as the data source. Because this document type contains rich information on a company’s operating segments, industrial service information is easily retrieved. Given the sheer volume of such documents, however, manual inspection is impractical. In order to resolve this issue, a text-mining algorithm is applied to automatically examine word-usage patterns and to identify the service portfolio. Then, the service portfolio’s relative position in the market is visualized on a positioning map. Due to the multi-dimensionality of the data, self-organizing map (SOM) is used as an alternative visualization scheme. SOM enables easy identification of the major service clusters as well as niche areas in the market; these, in turn, provide valuable information pertinent to service development planning. Also, and not least, policy makers can utilize our methodology to detect the servitization trends of various industries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Davies A (2004) Moving base into high-value integrated solutions: a value stream approach. Ind Corp Change 3(5):727–756

    Article  Google Scholar 

  • Delen D, Crossland MD (2008) Seeding the survey and analysis of research literature with text mining. Expert Syst Appl 34(3):1707–1720

    Article  Google Scholar 

  • Ding Y, Chowdhury GG, Foo S (2000) Incorporating the results of co-word analyses to increase search variety for information retrieval. J Inf Sci 26(6):429–451

    Article  Google Scholar 

  • Eklund T, Back B, Vanharanta H, Visa A (2003) Using the self-organizing map as a visualization tool in financial benchmarking. Inf Vis 2(3):171–181

    Article  Google Scholar 

  • Fang E, Palmatier RW, Steenkamp JBE (2008) Effect of service transition strategies on firm value. J Mark 72(5):1–14

    Article  Google Scholar 

  • García D, Norli Ø (2012) Crawling EDGAR. Span Rev Financ Econ 10(1):1–10

    Article  Google Scholar 

  • Gebauer H (2008) Identifying service strategies in product manufacturing companies by exploring environment-strategy configurations. Ind Mark Manag 37(3):278–291

    Article  MathSciNet  Google Scholar 

  • Griffin P (2003) Got information? Investor response to Form 10-K and Form 10-Q EDGAR filings. Rev Acc Stud 8(4):433–460

    Article  Google Scholar 

  • Guthrie JA, Guthrie, L, Wilks Y, Aidinejad H (1991) Subject-dependent co-occurrence and word sense disambiguation. In: Proceedings of the 29th annual meeting on Association for Computational Linguistics (Stroudsburg, Pennsylvania), pp. 146–152

  • Homburg C, Garbe B (1999) Towards an improved understanding of industrial services: quality dimensions and their impact on buyer–seller relationships. J Bus Bus Mark 6(2):39–71

    Article  Google Scholar 

  • Kiang MY, Hu MY, Fisher DM (2006) An extended self-organizing map network for market segmentation—a telecommunication example. Decis Support Syst 42(1):36–47

    Article  Google Scholar 

  • Kindström D, Kowalkowski C (2009) Development of industrial service offerings: a process framework. J Serv Manag 20(2):156–172

    Article  Google Scholar 

  • Kohonen T (2001) Self-organizing maps. Springer, Berlin

    Book  MATH  Google Scholar 

  • Kowalkowski C, Kindstrom D, Brehmer P (2011) Managing industrial service offerings in global business market. J Bus Ind Mark 26(3):181–192

    Article  Google Scholar 

  • Lewis DD, Ringuette M (1994) A comparison of two learning algorithms for text categorization. In: Proceedings of third annual symposium on document analysis and information retrieval (Las Vegas, Nevada), pp 81–93

  • Li F (2008) Annual report readability, current earnings, and earnings persistence. J Acc Econ 45(2):221–244

    Article  Google Scholar 

  • Lovelock C, Wirtz J (2007) Service marketing: people, technology, strategy, 6th edn. Pearson Prentice Hall, Upper Saddle River

    Google Scholar 

  • Matthyssens P, Vandenbempt K. (2008). Moving from basic offerings to value-added solutions: Strategies, barriers and alignment. Indl Mark Manag, 37(3):316–328.

  • Mazanec JA (1995) Positioning analysis with self-organizing maps: an exploratory study on luxury hotels. Cornell Hotel Restaur Adm Q 36(6):80–95

    Google Scholar 

  • Neely A (2008) Exploring the financial consequences of the servitization of manufacturing. Oper Manag Res 1(2):103–118

    Article  Google Scholar 

  • Oliva R, Kallenberg R (2003) Managing the transition from products to services. Int J Serv Ind Manag 14(2):160–172

    Article  Google Scholar 

  • Ramos J (2003) Using tf-idf to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning (Piscataway, NJ, 2003)

  • Security Exchange Commission (2015) Rules, regulations and schedules. Retrieved September 17, from https://www.sec.gov/divisions/corpfin/ecfrlinks.shtml

  • Trappey AJ, Trappey CV (2008) An R&D knowledge management method for patent document summarization. Ind Manag Data Syst 180(2):245–257

    Article  Google Scholar 

  • Vandermerwe S, Rada J (1988) Servitization of business: adding value by adding services. Eur Manag J 6(4):314–324

    Article  Google Scholar 

  • Wiener E, Pedersen JO, Weigend AS (1995). A neural network approach to topic spotting. In: Proceedings of SDAIR-95, 4th annual symposium on document analysis and information retrieval (Las Vegas, Nevada), pp 332–347

  • Wise R, Baumgartner P (1999) Go downstream: the new profit imperative in manufacturing. Harv Bus Rev 77(5):133–141

    Google Scholar 

  • Wu HC, Luk RWP, Wong KF, Kwok KL (2008) Interpreting tf-idf term weights as making relevance decisions. ACM Trans Inf Syst (TOIS) 26(3):13:3–13:37

    Article  Google Scholar 

  • Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: Proceedings of ICML-97, 14th international conference on machine learning (Nashville, TN, 1997), pp 412–420

  • Yoon BU, Yoon CB, Park YT (2002) On the development and application of a self-organizing feature map-based patent map. R&D Manag 32(4):291–300

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the “Core Technology Development Program for Knowledge-Based Service” funded by the Korean Government (MOTIE) (Project No. 10048090, Title: Manufacturing Servitization Support Framework). The authors deeply appreciate the administrative support during the project period from Institute for Industrial Systems Innovation of Seoul National University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yoo S. Hong.

Additional information

An erratum to this article is available at http://dx.doi.org/10.1007/s10696-016-9243-9.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lee, J., Hong, Y.S. Extraction and visualization of industrial service portfolios by text mining of 10-K annual reports. Flex Serv Manuf J 28, 551–574 (2016). https://doi.org/10.1007/s10696-015-9235-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10696-015-9235-1

Keywords

Navigation