The workflow reads textual data from a csv file and converts the strings into documents. The documents are then preprocessed, i.e. filtered and stemmed. The preprocessing magic takes place in the Preprocessing metanode. In the Feature Creation metanode two kinds of feature sets and document vectors are created. The top set of vectors contains only single word features the bottom set of vectors contains single word and 2-gram features.
After the document vectors have been created the sentiment class is extracted and two predictive models are built and scored. One model based only on single word features and the second model based on single word and 2-gram features. Bothe models are compared in the ROC curve node.
Workflow
Sentiment Analysis (Classification) of Documents with NGram Features
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 4.1.0
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Loading deployments
Loading ad hoc jobs
Legal
By using or downloading the workflow, you agree to our terms and conditions.