• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques


Jun Lee, YongJin KWON, Journal of Internet Computing and Services, Vol. 16, No. 4, pp. 39-50, Aug. 2015
10.7472/jksii.2015.16.4.39, Full Text:
Keywords: Temporal information, Termporal Extraction, Machine Learning, Similarity filtering, Historical information, Historical Object

Abstract

In information retrieval process through search engine, some users want to retrieve several documents that are corresponding with specific time period situation. For example, if user wants to search a document that contains the situation before 'Japanese invasions of Korea era', he may use the keyword 'Japanese invasions of Korea' by using searching query. Then, search engine gives all of documents about 'Japanese invasions of Korea' disregarding time period in order. It makes user to do an additional work. In addition, a large percentage of cases which is related to historical documents have different time period between generation date of a document and record time of contents. If time period in document contents can be extracted, it may facilitate effective information for retrieval and various applications. Consequently, we pursue a research extracting time period of Joseon era's historical documents by using historic literature for Joseon era in order to deduct the time period corresponding with document content in this paper. We define historical objects based on historic literature that was collected from web and confirm a possibility of extracting time period of web document by machine learning techniques. In addition to the machine learning techniques, we propose and apply the similarity filtering based on the comparison between the historical objects. Finally, we'll evaluate the result of temporal indexing accuracy and improvement.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Lee, J. & KWON, Y. (2015). A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques. Journal of Internet Computing and Services, 16(4), 39-50. DOI: 10.7472/jksii.2015.16.4.39.

[IEEE Style]
J. Lee and Y. KWON, "A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques," Journal of Internet Computing and Services, vol. 16, no. 4, pp. 39-50, 2015. DOI: 10.7472/jksii.2015.16.4.39.

[ACM Style]
Jun Lee and YongJin KWON. 2015. A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques. Journal of Internet Computing and Services, 16, 4, (2015), 39-50. DOI: 10.7472/jksii.2015.16.4.39.