Google Books - CPRR Documents
From: KyleKWyatt@gmail.com
A whole bunch of CPRR original documents are up on the web from Stanford University.
—Kyle
Central Pacific Railroad Photographic History Museum
A whole bunch of CPRR original documents are up on the web from Stanford University.
—Kyle
3 Comments:
The Google Book project, announced in December 2004, has been ongoing for several years, and the CPRR and UPRR books available online have been linked on the CPRR Museum's home page. When the books first became available online, they did not provide search capability within the pdf files, so we reprocessed many of the railroad books with optical character recognition to add that search capability.
These documents are stored up neatly in a content-driven DMS.
Sounds like expensive software that needs lots of fast hardware to meet peak loads.
What we did instead just added OCR text to pdf's that previously only contained images of book pages, to make the books searchable. So when someone accesses a book from our server there is no processing of the file needed so zero overhead. Search is provided by Google, not our server. So we can serve books with inexpensive hardware, high load is not compute limited, and having the books has zero ongoing additional cost.
Post a Comment
<< Recent Messages