skip to main content
10.1145/3197026.3203883acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
poster

Exploratory Investigation of Word Embedding in Song Lyric Topic Classification: Promising Preliminary Results

Authors Info & Claims
Published:23 May 2018Publication History

ABSTRACT

In this work we investigate a data-driven vector representation of word embedding for the task of classifying song lyrics into their semantic topics. Previous research on topic classification of song lyrics has used traditional frequency based text representation. On the other hand, empirically driven word embedding has shown sensible performance improvment of text classification tasks, because of its ability to capture semantic relationship between words from big data. As averaging the word vectors from a short text is known to work reasonably well compared to the other comprehensive models utilizing their order, we adopt the averaged word vectors from the lyrics and user's interpretations about them, which are short in general, as the feature for this classification task. This simple approach showed promising classification accuracy of 57%. From this, we envision the potential of the data-driven approaches to creating features, such as the sequence of word vectors and doc2vec models, to improve the performance of the system.

References

  1. Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2016. Enriching Word Vectors with Subword Information. arXiv preprint arXiv:1607.04606 (2016).Google ScholarGoogle Scholar
  2. Kahyun Choi, Jin Ha Lee, and J Stephen Downie. 2014. What is this song about anyway?: Automatic classification of subject using user interpretations and lyrics. Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries (2014), 453--454. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Kahyun Choi, Jin Ha Lee, Xiao Hu, and J Stephen Downie. 2016. Music Subject Classification Based on Lyrics and User Interpretations. Proceedings of the American Society for Information Science and Technology (2016), 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cedric De Boom, Steven Van Canneyt, Thomas Demeester, and Bart Dhoedt. 2016. Representation learning for very short texts using weighted word embedding aggregation. Pattern Recognition Letters 80 (2016), 150--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jin Ha Lee and J Stephen Downie. 2004. Survey Of Music Information Needs, Uses, And Seeking Behaviours: Preliminary Findings. Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR) 2004 (2004), 441--446.Google ScholarGoogle Scholar
  6. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).Google ScholarGoogle Scholar

Index Terms

  1. Exploratory Investigation of Word Embedding in Song Lyric Topic Classification: Promising Preliminary Results

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      JCDL '18: Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries
      May 2018
      453 pages
      ISBN:9781450351782
      DOI:10.1145/3197026

      Copyright © 2018 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 May 2018

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      JCDL '18 Paper Acceptance Rate26of71submissions,37%Overall Acceptance Rate415of1,482submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader