An workflow intended to demonstrate how you can extract patterns of parts of speech (for example, verb followed by adverb followed by noun) and count/visualize those patterns.
Caveats:
* No usual text pre-processing steps are applied here to clean up the data
* Workflow assumes sequences of size 3
* Sequences extend across sentences, which may not make sense
* Sequences are counted across the entire corpus, instead of by document
The workflow uses the first 5 rows of the IMDB movie review dataset.
Workflow
Extraction of Part of Speech (POS) Tag Sequences
Used extensions & nodes
Created with KNIME Analytics Platform version 4.1.2
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Loading deployments
Loading ad hoc jobs
Legal
By using or downloading the workflow, you agree to our terms and conditions.