TY - JOUR AU - Phu, Vo Ngoc AU - Ngoc Tran, Vo Thi AU - Max, Jack PY - 2018 TI - A CURE Algorithm for Vietnamese Sentiment Classification in a Parallel Environment JF - Journal of Computer Science VL - 15 IS - 10 DO - 10.3844/jcssp.2019.1355.1377 UR - https://thescipub.com/abstract/jcssp.2019.1355.1377 AB - Solutions to process big data are imperative and beneficial for numerous fields of research and commercial applications. Thus, a new model has been proposed in this paper to be used for big data set sentiment classification in the Cloudera parallel network environment. Clustering Using Representatives (CURE), combined with Hadoop MAP (M) / REDUCE (R) in Cloudera – a parallel network system, was used for 20,000 documents in a Vietnamese testing data set. The testing data set included 10,000 positive Vietnamese documents and 10,000 negative ones. After testing our new model on the data set, a 62.92% accuracy rate of sentiment classification was achieved. Although our data set is small, this proposed model is able to process millions of Vietnamese documents, in addition to data in other languages, to shorten the execution time in the distributed environment