Frequency lists Here we provide plain text versions of the frequency lists contained in WFWSE. These are raw unedited frequency lists produced by our software and do not contain the many additional notes supplied in the book itself. The lists are tab delimited plain text so can be imported into your prefered spreadsheet format. For the main lists we provide a key to the columns. More details on the process undertaken in the preparation of the lists can be found in the introduction to the book.
These lists show dispersion ranging between 0 and 1 rather than 0 and 100 as in the book. We multiplied the value by 100 and rounded to zero decimal places in the book for reasons of space. Log likelihood values are shown here to one decimal place rather than zero as in the book.
Please note, all frequencies are per million words. There are some extra notes explaining the dummy values (:, @, and %) in the lemmatised lists. CHAPTER 1: Frequencies in the Whole Corpus (Spoken and Written English)
List 1.1: Alphabetical frequency list of the whole corpus (lemmatized): list key complete lists without frequency cut-offs: unix compressed 5.3Mb or WinZip compressed 4.4Mb List 1.2: Rank frequency list for the whole corpus (not lemmatized): list key
CHAPTER 2: Spoken and Written English
List 2.1: Alphabetical frequency list: speech v. writing (lemmatized): list key List 2.2: Rank frequency order: spoken English (not lemmatized): list key List 2.3: Rank frequency order: written English (not lemmatized) list key List 2.4: Distinctiveness list: contrasting speech and writing (ordered by log likelihood): list key
CHAPTER 3: Two Main Varieties of Spoken English Compared
List 3.1: Alphabetical frequency list: conversational v. task-oriented speech (lemmatized): list key List 3.2: Distinctiveness list: contrasting conversational v. task-oriented speech (not lemmatized): list key
CHAPTER 4: Two Main Varieties of Written English Compared
List 4.1: Alphabetical frequency list: imaginative v. informative writing (lemmatized): list key List 4.2: Distinctiveness list: imaginative v. informative writing (not lemmatized): list key
CHAPTER 5: Rank Frequency Lists of Words within Word Classes (Parts of Speech) in the whole corpus
List 5.1: Frequency list of nouns (by lemma): list List 5.2: Frequency list of verbs (by lemma): list List 5.3: Frequency list of adjectives (by lemma): list List 5.4: Frequency list of adverbs (not lemmatized): list List 5.5: Frequency list of pronouns (not lemmatized): list List 5.6: Frequency list of determiners: list List 5.7: Frequency list of determiner/pronouns: list List 5.8: Frequency list of prepositions: list List 5.9: Frequency list of conjunctions: list List 5.10: Frequency list of interjections and discourse particles: list
CHAPTER 6: Frequency Lists of Grammatical Word Classes (based on the Sampler Corpus)
List 6.1.1: Alphabetical list: the whole sampler corpus (spoken and written English): list List 6.1.2: Rank frequency list: the whole sampler corpus: list List 6.2.1: Alphabetical list: spoken v. written English: list List 6.2.2: Rank frequency list: spoken English compared with written English: list List 6.2.3: Rank frequency list: written English compared with spoken English: list List 6.2.4: Distinctiveness list: spoken v. written English: list List 6.3.1: Alphabetical list: conversation v. task-oriented speech: list List 6.3.2: Distinctiveness list: conversation v. task-oriented speech: list List 6.4.1: Alphabetical list: imaginative v. informative writing: list List 6.4.2: Distinctiveness list: imaginative v. informative writing: list
Источник: http://www.comp.lancs.ac.uk/ucrel/bncfreq/flists.html |