lished in 1968 by ANSI
(American National Standards Institute), with an update in 1977 and
1986. The 7-bit plain ASCII, also called Plain Vanilla ASCII, is a set
of 128 characters with 95 printable unaccented characters (A-Z, a-z,
numbers, punctuation and basic symbols), i.e. the ones that are
available on the English/American keyboard. With the use of other
European languages, extensions of ASCII (also called ISO-8859 or ISO-
Latin) were created as sets of 256 characters to add accented
characters as found in French, Spanish and German, for example ISO
8859-1 (ISO-Latin-1) for French.
Created by Michael Hart in July 1971, Project Gutenberg was the first
information provider on the internet. Michael's purpose was to digitize
as many literary texts as possible, and to offer them for free in a
digital library open to anyone. Michael explained in August 1998: "We
consider etext to be a new medium, with no real relationship to paper,
other than presenting the same material, but I don't see how paper can
possibly compete once people each find their own comfortable way to
etexts, especially in schools."
Whether digitized years ago or now, all Project Gutenberg books are
created in 7-bit plain ASCII, called Plain Vanilla ASCII. When 8-bit
ASCII is used for books with accented characters like French or German,
Project Gutenberg also produces a 7-bit ASCII version with the accents
stripped. (This doesn't apply for languages that are not "convertible"
in ASCII, like Chinese, encoded in Big-5.)
Project Gutenberg sees Plain Vanilla ASCII as the best format by far,
and calls it "the lowest common denominator". It can be read, written,
copied and printed by any simple text editor or word processor on any
electronic device. It is the only format compatible with 99% of
hardware and software. It can be used as it is or to create versions in
many other formats. It will still be used while other formats will be
obsolete, or are already obsolete, like formats of a few short-lived
reading devices launched since 1999. It is the assurance collections
will never be obsolete, and will survive future technological changes.
The goal is to preserve the texts not only over decades but over
centuries.
Project Gutenberg also publishes ebooks in well-known formats like
HTML, XML or RTF. There are Unicode files too. Any other format
provided by volunteers (PDF, LIT, TeX and many others) is usually
accepted, as long as they also supply an ASCII
|