FREE BOOKS

Author's List




PREV.   NEXT  
|<   10   11   12   13   14   15   16   17   18   19   20   21   22   23   24   25   26   27   28   29   30   31   32   33   34  
35   >>  
there were books in Esperanto (45 books), Swedish (40 books), Danish (20 books), Catalan (19 books), Welsh (10 books), Norwegian (10 books), Russian (7 books), Icelandic (7 books), Hungarian (7 books), Middle English (6 books), Greek (6 books) and Bulgarian (6 books). 3. THE METHOD Whether digitized years ago or now, all the books are digitized in 7-bit plain ASCII (American Standard Code for Information Interchange), called Plain Vanilla ASCII. Used since the beginnings of computing, it is the set of unaccented characters present on a standard English-language keyboard (A-Z, a-z, numbers, punctuation and other basic symbols). When 8-bit ASCII (also called ISO-8859 or ISO-Latin) is used for books with accented characters like French or German, Project Gutenberg also produces a 7-bit ASCII version with the accents stripped. (This doesn't apply for languages that are not "convertible" in ASCII, like Chinese, encoded in Big-5.) Plain Vanilla ASCII is the best format by far. It is "the lowest common denominator". It can be read, written, copied and printed by any simple text editor or word processor on any electronic device. It is the only format compatible with 99% of hardware and software. It can be used as it is or to create versions in many other formats. It will still be used while other formats will be obsolete (or are already obsolete, like formats of a few short-lived reading devices launched since 1999). It is the assurance collections will never be obsolete, and will survive future technological changes. The goal is to preserve the texts not only over decades but over centuries. There is no other standard as widely used as ASCII right now, even Unicode, a "universal" encoding system created in 1991. Project Gutenberg also publishes books in well-known formats like HTML, XML or RTF. There are Unicode files too. Any other format provided by volunteers (PDF, LIT, TeX and many others) is usually accepted, as long as they also supply an ASCII version where possible. But a large scale conversion into other formats is handed over to other organizations. For example Blackmask Online, which uses Project Gutenberg's collections to offer thousands of free books in eight different formats based on the Open eBook (OeB) format. Or Manybooks.net, which converts Project Gutenberg's books into formats readable on PDAs. Or Mobilebooks, with 5,000 books in Java (.jar) format that can be downloaded from the website to be
PREV.   NEXT  
|<   10   11   12   13   14   15   16   17   18   19   20   21   22   23   24   25   26   27   28   29   30   31   32   33   34  
35   >>  



Top keywords:
formats
 
format
 

Project

 

Gutenberg

 

obsolete

 

English

 

characters

 

collections

 

Unicode

 
standard

digitized
 

Vanilla

 

version

 

called

 

system

 
encoding
 

publishes

 

devices

 
created
 

reading


launched

 

universal

 

widely

 

technological

 
future
 

decades

 

preserve

 

assurance

 

centuries

 

survive


thousands
 
Blackmask
 
Online
 

Manybooks

 

downloaded

 
website
 

Mobilebooks

 

converts

 

readable

 
volunteers

provided

 
accepted
 

conversion

 

handed

 

organizations

 
supply
 
copied
 
Information
 

Interchange

 
beginnings