TITLE:
The Exploration of the Approach to Data Preparation for Chinese Text Analysis Based on R Language
AUTHORS:
Jiang Li
KEYWORDS:
Data Preparation, Text Analysis, R Language, Chinese Text Segmentation
JOURNAL NAME:
Open Access Library Journal,
Vol.8 No.9,
September
3,
2021
ABSTRACT: This paper explores how to prepare data for analyzing the Chinese texts with R language based on the theory of Welbers, particularly comparing the R package Rwordseg with jiebaR to see the results of Chinese text segmentation at the step of preprocessing.