A third post about proofreading sifrei Torah

Following on from last week…

The other form of computer checking involves much more sophisticated software, and further reduces the chance of human error. In the process we’ve just been talking about, the letters were fed to me automatically, but I still had to use my brain to identify them and see that they were kosher. In this process, there’s barely any brain involved at all.

In this process, the operator uses a hand-held scanner to get the columns of text into the computer. Then it is run through OCR software – very clever software, which not only recognises letter glyphs but can also be taught to handle variations in glyphs caused by its being hand-written. Because it is a computer, it can also be taught some of the laws of whether a letter is kosher or not, so it can apply those mechanically to each glyph and flag up any doubtful cases.

Finally, the OCR output is compared to a Torah text, and any discrepancies are flagged up along with the doubtfully-kosher ones. A report with all problems is generated and given back with the scroll to the sofer, who then goes through the list and fixes everything on it.

Scan report
Scan report

Like this. Column 003, says the first entry on this report, which starts “Vayomer Adonai Elohim” – one comment. Line 21 (Bereshit 3:5), problem, thus: extra letter vav in the word “mimenu,” where it should say “…yodea Elohim ki b’yom akhalkhem mimenu v’nifkedu eineikhem…” and then in the picture you can see it’s got “v’mimenu,” for some reason or other.

I think I probably started writing the mem, got distracted mid-stroke, forgot I’d already started it, and started it over, but I don’t remember now.

Anyway.

Even this process, though, isn’t completely foolproof. Humans run the software, and as soon as humans come on the scene, there’s potential for human error.

If the various software operations aren’t applied properly – like forgetting to run the spellcheck on a document – the software won’t flag up problems because it won’t have looked for them. Perhaps the “is it there?” process on each letter of a column was run but the “is it kosher?” process accidentally wasn’t.

The computer needs human help to learn the writing, and perhaps the human isn’t paying attention. Perhaps the computer says “hey, human, what’s this?” and the human is half-asleep and says “vav” when he means “yud,” and a spelling mistake consequently goes unspotted.

Sometimes the software just can’t pick up on things. Very fine lines – the scanner might not pick them up; sometimes the presence or absence of a very fine line can be the difference between kosher and pasul. But we can’t (at present) scan to so high a resolution as to pick up on all these; the processing time would be prohibitive.

Finally, the letters are very slightly three-dimensional; a human, with stereo vision, can tell the difference between ink and shadow, and a scanner can’t always. Sometimes it’ll interpret a shadow as a crucial fine line, and report a letter kosher when it really isn’t.

So a scan is an excellent tool – I think it’s one of the finer syntheses of technological development and ancient ritual – but it does not replace all the other proofreading tools we use, and it is not a substitute for hard work and knowing your stuff. Few things are, really.

Filed under Torah

5 Responses to “A third post about proofreading sifrei Torah”

  1. that is rad.


    eli · February 17th, 2010 at 7:58 pm
  2. totally amazing! do you know if all haredi communities approve of such tools? If a tv treifs up food, do computer scanners treif of sifrei torah? or do they not m’kabel electro-teumah either…


    Justin · February 17th, 2010 at 8:13 pm
  3. You started the Mem, and then wrote another whole one. You might have gone to get some coffee in between.


    Amit · February 18th, 2010 at 7:48 am
  4. I’d like to know more about this scanning software. Who developed it and is it well maintained (is it in active development?) Open or closed source, etc.


    Aharon Varady · February 21st, 2010 at 5:26 pm
  5. [...] Following on from the third such post… [...]


    A fourth post about proofreading sifrei Torah | Jewschool · February 24th, 2010 at 7:38 pm

Leave a Reply

If your comment does not immediately appear, do not freak out and repost your message a dozen times. Please note that all new visitors must have their first comment approved by the editor, and you must provide a legitimate e-mail address and use the same username for the system to "remember" you. The editor maintains the right to refuse comments deemed inappropriate or unhelpful. Users who repeatedly delve into ad hominem attacks or other troll-like behavior will be banned.

Trackback (Right-click & 'Copy Link...') | Comments RSS

"I may attack a certain point of view which I consider false, but I will never attack a person who preaches it. I have always a high regard for the individual who is honest and moral, even when I am not in agreement with him. Such a relation is in accord with the concept of kavod habriyot, for beloved is man for he is created in the image of God." —Rav Joseph Soloveitchik