文本分類

分類聚類 不同

  • Clustering determines how to put documents into groups.

  • Classification assigns documents to groups that have been defined.

文本分類基本概念

  • 類別及其數量要先訂好。

  • 可以用不同的機器學習演算法實驗。Popular supervised text categorization algorithms: Naive Bayes, k Nearest Neighbor (kNN) and Logistic Regression, Support Vector Machines and Decision Trees

R 套件 RTextTools 練習

Last updated