[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Index by Month]

Re: MCM - Digest V1 #287



At 09:35 AM 5/2/2000 -0700, Erik wrote:

>OK, method #2 is that we do all of step 1 above, but THEN take the scanned
>images (such as the example above), and reconstruct the text of the
>original article.  This is more complicated, taking sophisticated OCR
>software and lots of massaging so the text doesn't look like a ransom
>note.  As I mentioned yesterday, this takes about two hours per issue of
>TAG, whereas just scanning the article in takes about 1/2 hour and can be
>done while watching TV or talking on the phone). It generally does not
>preserve the original layout (which to me is actually not a problem,
>especially for archiving on the web).  The sparse illustrations are added
>back in as graphics, but the resultant file is TINY compared to the other,
>can be put on the web, edited, searched, archived as a PDF, whatever.
>
>
>I will definitely do the first part, because we'll then have a true
>archive of TAG in its original layout.  Last night I got volume 4 scanned
>in in 3 hours.  If I work steadily on this, I can quickly get all 9 and a
>half years done.  But the OCR phase I may either just do very very slowly,
>or save only for articles to be archived on the website.

As I've said before, Erik, you never cease to amaze me!  Thanks for your
effort AND your initiative.

Karen
  ------------------
  To unsubscribe from this list, e-mail majordomo@thekrib.com
  with "unsubscribe aga-mcm" in the body of the message.
  To subscribe to the digest version, add "subscribe aga-mcm-digest"
  in the same message.
  Old messages are available at http://lists.thekrib.com/aga-mcm
  When asked, log in as username is "aga-mcm", and password "incorporate".