• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

IPC Multi-label Classification based on Functional Characteristics of Fields in Patent Documents


Sora Lim, YongJin Kwon, Journal of Internet Computing and Services, Vol. 18, No. 1, pp. 77-88, Feb. 2017
10.7472/jksii.2017.18.1.77, Full Text:
Keywords: Patent classification, IPC Classification, Patent Document Fields, Field function, Multi-label classification

Abstract

Recently, with the advent of knowledge based society where information and knowledge make values, patents which are the representative form of intellectual property have become important, and the number of the patents follows growing trends. Thus, it needs to classify the patents depending on the technological topic of the invention appropriately in order to use a vast amount of the patent information effectively. IPC (International Patent Classification) is widely used for this situation. Researches about IPC automatic classification have been studied using data mining and machine learning algorithms to improve current IPC classification task which categorizes patent documents by hand. However, most of the previous researches have focused on applying various existing machine learning methods to the patent documents rather than considering on the characteristics of the data or the structure of patent documents. In this paper, therefore, we propose to use two structural fields, technical field and background, considered as having impacts on the patent classification, where the two field are selected by applying of the characteristics of patent documents and the role of the structural fields. We also construct multi-label classification model to reflect what a patent document could have multiple IPCs. Furthermore, we propose a method to classify patent documents at the IPC subclass level comprised of 630 categories so that we investigate the possibility of applying the IPC multi-label classification model into the real field. The effect of structural fields of patent documents are examined using 564,793 registered patents in Korea, and 87.2% precision is obtained in the case of using title, abstract, claims, technical field and background. From this sequence, we verify that the technical field and background have an important role in improving the precision of IPC multi-label classification in IPC subclass level.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Lim, S. & Kwon, Y. (2017). IPC Multi-label Classification based on Functional Characteristics of Fields in Patent Documents. Journal of Internet Computing and Services, 18(1), 77-88. DOI: 10.7472/jksii.2017.18.1.77.

[IEEE Style]
S. Lim and Y. Kwon, "IPC Multi-label Classification based on Functional Characteristics of Fields in Patent Documents," Journal of Internet Computing and Services, vol. 18, no. 1, pp. 77-88, 2017. DOI: 10.7472/jksii.2017.18.1.77.

[ACM Style]
Sora Lim and YongJin Kwon. 2017. IPC Multi-label Classification based on Functional Characteristics of Fields in Patent Documents. Journal of Internet Computing and Services, 18, 1, (2017), 77-88. DOI: 10.7472/jksii.2017.18.1.77.