Enhancing Focus Topic Findings of Discussion Forum through Corpus Classifier Algorithm
Reina Setiawan1, Widodo Budiharto2, Iman Herwidiana Kartowisastro3, Harjanto Prabowo4 

1Reina Setiawan, Department of, Computer Science, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia.
2Widodo Budiharto Department of, Computer Science, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia.
3Iman Herwidiana Kartowisastro, Department of, Computer Science, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia.
4Harjanto Prabowo Department of, Computer Science, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia.

Manuscript received on 03 March 2019 | Revised Manuscript received on 08 March 2019 | Manuscript published on 30 July 2019 | PP: 1525-1530| Volume-8 Issue-2, July 2019 | Retrieval Number: B2166078219/19©BEIESP | DOI: 10.35940/ijrte.B2166.078219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In learning management system, a discussion forum, in which the students and lecturers are involved actively as part of the learning method, enriches the context of communication, thereby enhancing the students’ learning and performance. The aim of this paper was to determine the appropriate topics for a discussion forum for learning management systems through enhanced probabilistic latent semantic analysis (PLSA) with the corpus classifier algorithm. In preparing the paper, the methods used were PLSA and the classifying process, which classifies the documents to become a corpus based on the similarity word approach. The similarity word is influenced by the term-frequency of the word in the document. The novel concept in this paper is the corpus classifier algorithm. The experiment was conducted using three approaches to discover the topic, and it used 4,868 distinct words from 234 documents. The documents were contained in three threads subject. The post of the discussion forum is the text document. The performance of the result was measured by the f-measure, which was calculated for each thread subject. The corpus classifier algorithm was used in the second approach, and third approach increased the average f-measure values for the second and third thread subjects by approximately 24 and 17%, respectively.
Keywords: Corpus Classification, Discussion Forum, PLSA, Similarity Word, Topic Findings

Scope of the Article: Classification