Download WordStat - Provalis Research
Transcript
VOCABULARY FINDER PAGE The Vocabulary Finder feature of WordStat provides a tool to extract single words representing technical terms, company and product names, as well as abbreviations that are specific to the analyzed collection of documents. The feature will also identify common misspellings by comparing the list of word forms encountered in the entire text collection against a list of common words, retrieving words or words with irregular capitalization that are not found in this list. By default, the extraction is performed in reference to common English words. For documents written in another language or to exclude technical terms from a specific domain, set the Active Dictionaries option on the Speller/Thesaurus Option Page to the desired language. The Include Irregular Capitalization option instructs the program to retrieve all words with irregular capitalization, such as proper names of persons, companies or products. The Min Frequency option allows one to eliminate from the list all words appearing only a few times by setting a minimum frequency criterion. Once the above two options have been set, click on the button to start searching for vocabulary words. The list of words retrieved are then listed in a frequency table on the left of the screen and presented in descending order of frequency. To sort this list in alphabetical order, click on the top of the first column. Three types of operations are allowed on these words: 1) You can replace all instances of a selected word in the original document by another word or phrase; 2) You can add this word to a custom list of valid words causing the program to ignore those words the next time there is a search for vocabulary words; or WordStat User’s Manual 53