Год выпуска: 2013 Автор: Mohammed M. Abu Tair Издательство: LAP Lambert Academic Publishing Страниц: 128 ISBN: 9783659347665
Описание
Text classification has become one of the most important techniques in text mining. A number of machine learning algorithms have been introduced to deal with automatic text classification. One of the common classification algorithms is the k-NN algorithm which is known to be one of the best classifiers applied for different languages including Arabic language. However, the k-NN algorithm is of low efficiency because it requires a large amount of computational power. Such a drawback makes it unsuitable to handle a large volume of text documents with high dimensionality and in particular in the Arabic language. This book, therefore, introduces a high performance parallel classifier for large-scale Arabic text that achieves the enhanced level of efficiency, scalability, and accuracy. The parallel classifier based on the sequential k-NN algorithm. We tested the classifier using the OSAC corpus. We studied the performance of the parallel classifier on a multicomputer cluster. The results...