• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

Methodology for Issue-related R&D Keywords Packaging Using Text Mining


Yoonjin Hyun, William Wong Xiu Shun, Namgyu Kim, Journal of Internet Computing and Services, Vol. 16, No. 2, pp. 57-66, Apr. 2015
10.7472/jksii.2015.16.2.57, Full Text:
Keywords: Association Rules Mining, Keyword Matching, Social Network Analysis, Text Mining, Topic Analysis

Abstract

Considerable research efforts are being directed towards analyzing unstructured data such as text files and log files using commercial and noncommercial analytical tools. In particular, researchers are trying to extract meaningful knowledge through text mining in not only business but also many other areas such as politics, economics, and cultural studies. For instance, several studies have examined national pending issues by analyzing large volumes of text on various social issues. However, it is difficult to provide successful information services that can identify R&D documents on specific national pending issues. While users may specify certain keywords relating to national pending issues, they usually fail to retrieve appropriate R&D information primarily due to discrepancies between these terms and the corresponding terms actually used in the R&D documents. Thus, we need an intermediate logic to overcome these discrepancies, also to identify and package appropriate R&D information on specific national pending issues. To address this requirement, three methodologies are proposed in this study-a hybrid methodology for extracting and integrating keywords pertaining to national pending issues, a methodology for packaging R&D information that corresponds to national pending issues, and a methodology for constructing an associative issue network based on relevant R&D information. Data analysis techniques such as text mining, social network analysis, and association rules mining are utilized for establishing these methodologies. As the experiment result, the keyword enhancement rate by the proposed integration methodology reveals to be about 42.8%. For the second objective, three key analyses were conducted and a number of association rules between national pending issue keywords and R&D keywords were derived. The experiment regarding to the third objective, which is issue clustering based on R&D keywords is still in progress and expected to give tangible results in the future.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Hyun, Y., Shun, W., & Kim, N. (2015). Methodology for Issue-related R&D Keywords Packaging Using Text Mining. Journal of Internet Computing and Services, 16(2), 57-66. DOI: 10.7472/jksii.2015.16.2.57.

[IEEE Style]
Y. Hyun, W. W. X. Shun, N. Kim, "Methodology for Issue-related R&D Keywords Packaging Using Text Mining," Journal of Internet Computing and Services, vol. 16, no. 2, pp. 57-66, 2015. DOI: 10.7472/jksii.2015.16.2.57.

[ACM Style]
Yoonjin Hyun, William Wong Xiu Shun, and Namgyu Kim. 2015. Methodology for Issue-related R&D Keywords Packaging Using Text Mining. Journal of Internet Computing and Services, 16, 2, (2015), 57-66. DOI: 10.7472/jksii.2015.16.2.57.