-
Words are input from the text;
-
common/non-substantive words are deleted through table look-up;
-
content words are stored, along with their position in the
text, as well as any punctuation that is located immediately to the left
and/or right of the word;
-
content words are sorted alphabetically
|