Multilevel classification for text with scikit learn

I am trying to create a multilevel classifier using scikit-learn for texts. I'm new to learning scikit and I don't know if it is possible to create a classifier for text. My intention is to use a multi-channel SVM, but I don't know if I need to adapt the texts to train the classifier, otherwise you can work with the texts directly. Does anyone know of any documentation on this issue?

+3


source to share


1 answer


You can refer to this example: Classification of text documents using sparse functions

which can give you exposing not only of multiclass but also basic details of text mining:



  • Vectorizer and hashing
  • Function selection
  • Sparse data processing
  • Comparison of different base models
+4


source







All Articles