When you connect to this website, you send your IP address and sometimes some cookies. You may also give us personal identifying information, such as your name and contact information. All this data is used to securely provide you with the services that you request. We encourage you to review our privacy policy to make sure that you understand how your data is managed, and to contact us if you have any questions. View Privacy Policy

Difference between revisions of "Obscure Collins words"

From NASPAWiki

You are viewing a condensed mobile version of this NASPA webpage.
Switch to full version.

(first draft)
 
m
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
This page discusses
+
This page discusses how many of the words in the Collins lexicon seem obscure to [[OTCWL]] players, and is part of our introduction to [[SOWPODS|Collins (SOWPODS) in North America]]. For brevity, we will refer to the North American and International lexica, OTCWL16 and CSW15, as TWL and CSW respectively.
how many of the '''words''' in the
 
'''[[SOWPODS|Collins]]''' lexicon seem
 
'''obscure''' to [[TWL]] players,
 
and is part of our introduction to
 
[[SOWPODS|Collins (SOWPODS) in North America]].
 
  
 
== Are All Collins Words Obscure, Obsolete, Foreign, etc.? ==
 
== Are All Collins Words Obscure, Obsolete, Foreign, etc.? ==
  
The short answer is mostly not, at least no more obscure than already
+
The short answer is mostly not, at least no more obscure than already makes up the majority of the TWL.
makes up the majority of the TWL.
 
  
One must be careful to avoid a biased viewpoint, because, of course,
+
One must be careful to avoid a biased viewpoint, because when viewing the words that are CSW-only, one is not viewing the CSW itself, but just that part of it that contains the words not in the TWL. These are bound to be somewhat obscure, because if they were common everyday words widely used in English they would already be in the TWL.
when viewing the words that are CSW-only, one is not viewing the
 
CSW itself, but just that part of it that contains the words not
 
in the TWL. These are bound to be somewhat obscure, because if they
 
were common everyday words widely used in English they would be in
 
the TWL already.
 
  
Breaking down the list of CSW-only words into categories by meaning
+
Breaking down the list of CSW-only words into categories by meaning shows that a significant majority of the new words to be added are further examples of the kinds of words that already make up the TWL, such as animals, currency, minerals, and so on. Because the Scrabble lexica (both TWL and CSW) are based on single-volume abridged dictionaries, and not the full unabridged versions, such as Webster’s Third New International Dictionary, or the Oxford English Dictionary, they in fact really contain only a small fraction of the totality of English words. It is thus not surprising they that do not entirely overlap.
shows that a significant majority of the new words to be added are
 
further examples of the kinds of words that make up the TWL, such
 
as animals, currency, minerals, and so on. Because the Scrabble
 
lexica (both TWL and CSW) are based on single-volume abridged
 
dictionaries, and not the full unabridged versions, such as  
 
[[Webster’s Third New International Dictionary]],
 
or the [[Oxford English Dictionary]], they in fact really contain only
 
a small fraction of the totality of English words. It is thus not
 
surprising they that do not entirely overlap.
 
  
There are some differences: CSW has more obsolete and archaic words.
+
There are some differences: CSW has more obsolete and archaic words. They are not unique to CSW (TWL has perhaps 1,000 already), but there are certainly more. There are also more dialect words, particularly Scots, a result of the word source prior to Collins, The Chambers Dictionary, being based in Scotland, and its words being incorporated into the Collins Corpus. There are also significant numbers of words from countries where English is common, such as Australia, South Africa, and India. Ultimately, the ideal content of a word list is somewhat subjective.
They are not unique to CSW (TWL has perhaps 1,000 already), but there
 
are certainly more. There are also more dialect words, particularly
 
Scots, a result of the previous word source, [[The Chambers Dictionary]],
 
being based in Scotland, and its words being incorporated into the
 
Collins Corpus. There are also significant numbers of words from
 
countries where English is common, such as Australia, South Africa,
 
and India.
 
  
Ultimately, the ideal content of a word list is somewhat subjective.
+
The following list shows CSW-only words broken down into categories to show that a significant fraction are parts of categories that would not seem unreasonable to include in a word list, such as animals, food, and so on.
Personally, I would like to see the contents of the Oxford Dictionary
 
added (the Shorter, not the full), and words explictly indicated
 
as obsolete deleted. But I know others do not agree. So here is the
 
list.
 
  
Note: Words do not appear in more than one category, so those not
+
Note:
under ‘obsolete’, ‘archaic’, etc. are not
 
obsolete or archaic. The list is also not guaranteed complete.
 
  
 +
* Words do not appear in more than one category, so those not under ‘obsolete’, ‘archaic’, etc. are not obsolete or archaic
 +
* The list was compiled manually, and so is not guaranteed complete
 +
* The list was compiled from an older edition of CSW than CSW15, but the numbers will not have changed by large amounts so the list remains illustrative
  
 
{|
 
{|
Line 195: Line 165:
 
|}
 
|}
  
Besides this list, another excellent source of CSW browsing is to
+
Aside from this list, an excellent resource for further lists of words by categories is at the [http://www.absp.org.uk Association of British Scrabble Players] website under their [http://www.absp.org.uk/words Words] section. These lists, compiled by David Sutton, cover the full range of CSW words from 2-15 letters, with useful one-line definitions throughout.
view Albert Hahn’s postings of David Sutton’s wordlists on the
+
 
crossword-games-pro ([[CGP]]) email list. Membership is only open to
+
The lists mark words new to CSW15 versus the previous edition, CSW12, but there is no differentiation between CSW and TWL words.
tournament players, but if you are a member, go to <a
 
href="http://games.groups.yahoo.com/group/crossword-games-pro/msearch_adv">the
 
advanced search</a> page, and type in &lsquo;David Sutton&rsquo;s Word List&rsquo;
 
into the &lsquo;Subject: contains&rsquo; box, and click search. The CGP messages
 
are versions of <a href="http://www.absp.org.uk/words/words.html">David&rsquo;s
 
lists</a> at the British Association (<a
 
href="http://www.absp.org.uk">ABSP</a>&rsquo;s) site, but have been marked
 
up to show which words are CSW only. (Note that the &lsquo;New Collins
 
Words&rsquo; there are not referring to words not in TWL, but to the last
 
update to the International list. Only the CGP posts show which
 
ones are CSW-only from the TWL point of view.)
 
  
 
Please direct comments about this page to its author, [[Nick Ball]].
 
Please direct comments about this page to its author, [[Nick Ball]].

Latest revision as of 17:44, 16 April 2017

This page discusses how many of the words in the Collins lexicon seem obscure to OTCWL players, and is part of our introduction to Collins (SOWPODS) in North America. For brevity, we will refer to the North American and International lexica, OTCWL16 and CSW15, as TWL and CSW respectively.

Are All Collins Words Obscure, Obsolete, Foreign, etc.?

The short answer is mostly not, at least no more obscure than already makes up the majority of the TWL.

One must be careful to avoid a biased viewpoint, because when viewing the words that are CSW-only, one is not viewing the CSW itself, but just that part of it that contains the words not in the TWL. These are bound to be somewhat obscure, because if they were common everyday words widely used in English they would already be in the TWL.

Breaking down the list of CSW-only words into categories by meaning shows that a significant majority of the new words to be added are further examples of the kinds of words that already make up the TWL, such as animals, currency, minerals, and so on. Because the Scrabble lexica (both TWL and CSW) are based on single-volume abridged dictionaries, and not the full unabridged versions, such as Webster’s Third New International Dictionary, or the Oxford English Dictionary, they in fact really contain only a small fraction of the totality of English words. It is thus not surprising they that do not entirely overlap.

There are some differences: CSW has more obsolete and archaic words. They are not unique to CSW (TWL has perhaps 1,000 already), but there are certainly more. There are also more dialect words, particularly Scots, a result of the word source prior to Collins, The Chambers Dictionary, being based in Scotland, and its words being incorporated into the Collins Corpus. There are also significant numbers of words from countries where English is common, such as Australia, South Africa, and India. Ultimately, the ideal content of a word list is somewhat subjective.

The following list shows CSW-only words broken down into categories to show that a significant fraction are parts of categories that would not seem unreasonable to include in a word list, such as animals, food, and so on.

Note:

  • Words do not appear in more than one category, so those not under ‘obsolete’, ‘archaic’, etc. are not obsolete or archaic
  • The list was compiled manually, and so is not guaranteed complete
  • The list was compiled from an older edition of CSW than CSW15, but the numbers will not have changed by large amounts so the list remains illustrative
Number of Words
Abbreviations 86
Animals 523
Archaic 164
Australia 112
Chemicals/minerals 141
Clothing/fashion 95
Computing 32
Currency 36
Dialect 177
‘er’ nouns 11
Food/Drink 200
France 112
Geography/geology 87
Germany 15
Heraldry 44
India 132
Interjections 43
Irregular inflections 345
Ireland 36
Italy 28
Japan 21
Language 51
Latin 13
Law 52
Mathematical 22
Medical/anatomical 153
Milton 26
Music 120
New Zealand 122
Obsolete 388
Other 50
Other adjective 383
Other -ier -iest adjective 154
Other noun 1269
Other verb 573
Plants 321
Religious/spiritual 160
South Africa 66
Scots 573
Shakespeare 207
Slang 48
Spenser 307
Sports/games 77
Units 41
USA 12
Welsh 7
Total 635

Aside from this list, an excellent resource for further lists of words by categories is at the Association of British Scrabble Players website under their Words section. These lists, compiled by David Sutton, cover the full range of CSW words from 2-15 letters, with useful one-line definitions throughout.

The lists mark words new to CSW15 versus the previous edition, CSW12, but there is no differentiation between CSW and TWL words.

Please direct comments about this page to its author, Nick Ball.