Сравнение версий

Ключ

  • Эта строка добавлена.
  • Эта строка удалена.
  • Изменено форматирование.

...

More details on classification methods can be found here: Overview of classification methods in machine learning with Scikit-Learn

Action icon


Action parameters

...

Parameters and their settings

PropertyDescriptionTypeFilling exampleMandatory field
Parameters
The path to the source folder

...

The path to the data folder

...

for training the classification model. Inside the folder there are subfolders

...

whose names are the name of the class. Inside each subfolder there should be

...

txt files with

...

various texts that correspond to the class.

...

Robin.FolderPathC:\doc\imgYes
Path to the resulting folder

...

The path to the folder where the trained classification model will be saved

...

.Robin.FolderPathC:\doc\imgYes
MethodThe

...

method that will be used to train the classification model.

...

The default value is RandomForest. 

The following methods can be used to train the classification model: 

    • Choose the most suitable
    • SVC - support vector method 
    • RandomForest - decision tree classifier
    • GradientBoosting
    • AdaBoost
    • nTree - decision tree classifier
    • KNeighboors - k-nearest neighbor method
    • Naive Bayes - naive Bayesian method

...

  • Good morning!
  • Hello!
  • Sincerely,
  • tel:
  • email:

...

Output parameters

  1. Result - the result shows the percentage of accuracy of the trained model obtained by comparing the test and training sample as a percentage.

Settings

PropertyDescriptionTypeFilling exampleMandatory field
Parameters
The path to the source folderThe path to the data folder for training the classification model. Inside the folder there are subfolders whose names are the name of the class. Inside each subfolder there should be txt files with various texts that correspond to the class.Robin.FolderPathC:\doc\imgYes
Path to the resulting folderThe path to the folder where the trained classification model will be saved.Robin.FolderPathC:\doc\imgYes
MethodThe method that will be used to train the classification model. The default value is RandomForest.
Robin.String
Yes
OverwriteIf the value is «true», and a file with the same name and extension already exists in the folder with the result, then it will be overwritten. If «false», the file will not be overwritten, and the action will return an error.Robin.BooleantrueNo
Custom Stop WordsThe path to the txt file that contains stop words that will not be taken into account when training the classification model. Each stop word must be written on a new line.Robin.FilePath
No
Word combinationThe path to the txt file containing phrases, which, when training the model, it is important not to divide into separate words in order to preserve the meaning of the entire phrase. Each phrase must be written on a new line.Robin.FilePath
No
Results
ResultThe percentage of accuracy of the trained model.Robin.Numeric

...