|
USING TEXT MINING TO CLASSIFY RESEARCH PAPERS
|
|
|
S. Sulova;L. Todoranova;B. Penchev;R. Nacheva
|
|
|
||
|
|
|
|
1314-2704
|
|
|
||
|
English
|
|
|
17
|
|
|
21
|
|
|
|
|
|
||
|
Recently, the volume of scientific literature has grown rapidly raising an imminent question about its storage and organization. Many research papers are often available only through the websites of the relevant scientific journals. It is an essential problem when different classification codes are used in order to organize these papers or when specific categorization in a certain scientific field is missing. This leads to unnecessary complications in the researchers' aims who want to quickly and easily find literature on a specific topic among the large amount of scientific publications. Simultaneously, the research interest related to the mechanisms of natural language processing is growing because much of the information they work with is unstructured and in the form of plain text. In order to improve and automate the process of organizing and classifying scientific papers we propose an approach based on the technology for natural language processing. This applies the methods of supervised machine learning and two specific algorithms for text categorization - Support Vector Machines (SVM) and Naive Bayes (NB). The proposed approach classifies the scientific literature according to its contents. To successfully execute our scientific research, we used over 200 papers, published in the last four years in the journal ?Izvestiya?, which is issued by the University of Economics - Varna. The articles explore different topic areas and are written in English. The experiments were conducted with the software product RapidMiner.
|
|
|
conference
|
|
|
||
|
||
|
17th International Multidisciplinary Scientific GeoConference SGEM 2017
|
|
|
17th International Multidisciplinary Scientific GeoConference SGEM 2017, 29 June - 5 July, 2017
|
|
|
Proceedings Paper
|
|
|
STEF92 Technology
|
|
|
International Multidisciplinary Scientific GeoConference-SGEM
|
|
|
Bulgarian Acad Sci; Acad Sci Czech Republ; Latvian Acad Sci; Polish Acad Sci; Russian Acad Sci; Serbian Acad Sci & Arts; Slovak Acad Sci; Natl Acad Sci Ukraine; Natl Acad Sci Armenia; Sci Council Japan; World Acad Sci; European Acad Sci, Arts & Letters; Ac
|
|
|
647-654
|
|
|
29 June - 5 July, 2017
|
|
|
website
|
|
|
cdrom
|
|
|
3011
|
|
|
Text Mining; Scientific Papers Classification; Supervised Machine Learning; Naive Bayes; RapidMiner
|
|