Automatic Summarizing the News from Inform.kz by Using Natural Language Processing Tools


연구 분야: Artificial Intelligence



학회: 2021 IEEE International Conference on Smart Information Systems and Technologies (SIST)


초록

The rapid rise of the information on the web brought up new problems of data access and processing. Therefore there is a need for tools that will help to overcome the problem of management and handling the Big Data in a quick manner. The primary goal of this work is to propose an efficient method for automatic text summarization by using Natural Language Processing (NLP) and Machine Learning (ML) techniques. This research introduces an abrupt, easily understandable and uncomplicated implementation of this method via overusing Python programming language. Efficient performance is necessary in web search tasks where an enormous of unstructured data need to be summarized very quickly. The novelty of the work is that text summarization is implemented on Kazakh texts. Extractive summarization uses new, keywords focused, approach. Contribution of the work is manually created stop words used for text summarization specifically for Kazakh language and dataset constructed by scraping news from country's largest international news portal www.inform.kz. State-of-the-art results of the work show that it is possible to implement automatic text summarization for Kazakh language.


Author Profile
Bakdaulet Kynabay

Suleyman Demirel University Kazakhstan

Kazakhstan
Author Profile
Aimoldir Aldabergen

Suleyman Demirel University Kazakhstan

Kazakhstan
Author Profile
Azamat Zhamanov

Suleyman Demirel University Kazakhstan

Kazakhstan

📄 논문 정보

발행 연도 2021년
인용수 2
출판 국가 Kazakhstan
사이트 IEEE
좋아요 수 0

연관 논문 목록 (298건)