In a recent post, I said I would write a program to transcribe the Rohonc Codex.
Tonight I did the first part. I wrote some code to identify lines of text and graphemes. The image below shows a page of text, with the first-pass graphemes marked by green rectangles.
This is just a first pass. Some of these rectangles enclose multiple graphemes, and they will need to be split apart.
Next, I think I'll build a database of all of the grapheme images, then compare them to each other to identify image families.