Re: Words

From: Philip L. Graber (pgraber@emory.edu)
Date: Wed Sep 27 1995 - 09:08:20 EDT


On Wed, 27 Sep 1995, James K. Tauber wrote:

> What puzzles me about these word counts is that the texts (both plain and
> tagged) I have from CCAT have 138019 words.
>
> What's going on here?

I suspect the programs being used to do the counts are handling accents
(which are punctuation marks, and therefore word delimiters, in English)
in different ways. If only spaces and LF and/or RET characters are
counted as word delimiters, the results are different than if / \ etc.
are read as punctuation rather than accents.

Philip Graber Graduate Division of Religion
Graduate Student in New Testament 211 Bishops Hall, Emory University
pgraber@emory.edu Atlanta, GA 30322 USA



This archive was generated by hypermail 2.1.4 : Sat Apr 20 2002 - 15:37:28 EDT