The Project Gutenberg eBook of Moby Multiple Language Lists of Common Words

This ebook is for the use of anyone anywhere in the United States and most other parts of the world at no cost and with almost no restrictions whatsoever. You may copy it, give it away or re-use it under the terms of the Project Gutenberg License included with this ebook or online at www.gutenberg.org. If you are not located in the United States, you will have to check the laws of the country where you are located before using this eBook.

Title: Moby Multiple Language Lists of Common Words

Author: Grady Ward

Release date: May 1, 2002 [eBook #3206]
Most recently updated: September 28, 2019

Language: English

*** START OF THE PROJECT GUTENBERG EBOOK MOBY MULTIPLE LANGUAGE LISTS OF COMMON WORDS ***

Produced by Mike Pullen

**** PLEASE SEE AT THE BOTTOM OF THIS PAGE INSTRUCTIONS FOR DOWNLOADING THIS FILE****

MOBY (tm) LANGUAGE II DOCUMENTATION NOTES

This documentation, the software and/or database are:

Public Domain material by grant from the author, January, 2001.

HISTORICAL NOTE:

The Ward word lists were some of the largest public domain word lists in the world, at the time they were added to the Project Gutenberg collection in 2007. These word lists do not contain 8-bit accented characters or Unicode, as would be found in a more recent Project Gutenberg eBook. Instead, the lists include phonetic spelling, utilizing backslashes and other characters to indicate where accents would normally occur. There is no detailed guide on how these extra characters were used, and therefore it is likely infeasible to map from the word lists back to a correct representation of the word (i.e., to map from a word list entry with slashes or other characters, back to the actual non-English word with accents or other non-ASCII characters).

These lists may still be useful, but they are no longer the state-of-the-art in word lists. In the time since the lists were created, it has become much easier for anyone with interests to make their own lists of unique words from the Project Gutenberg collection or other sources.

Moby (tm) Language II for MSDOS operating systems is compressed and distributed as a single zip file. After decompression the language files included with this product is in ordinary ASCII format with CRLF (ASCII 13/10) delimiters.

MOBY Language II CONTENTS

Read Me First File (aaREADME.txt)
French Language list (french.txt)
German Language list (german.txt)
Italian Language list (italian.txt)
Japanese Language list (japanese.txt)
Spanish Language list (spanish.txt)

Quick Start

1) Insure you have at least 3Mb of free disk space to hold the contents of this zip file.

2) Create a destination directory to hold the files listed above.

3) On the PG Catalog page click on the selection "More Files". You will see a "files.zip" folder in the list. Move this zipped folder to your computer. On your computer open "files.zip", double click on its "files" subdirectory and copy the contents into the destination directory on your computer.

Word lists in five of the world's great languages:

FRENCH number of words 138257 size in bytes 1524757
GERMAN number of words 159809 size in bytes 2055986
ITALIAN number of words 60453 size in bytes 561981
JAPANESE number of words 115523 size in bytes 934783
SPANISH number of words 86059 size in bytes 850523

Total number of words 560101 size in bytes 5928030

Once decompressed, the vocabulary files may be viewed and used just as any TEXT-type file might.