As an example of how the context similarity algorithm works, if I use the King James version of the book of Matthew as a source and I look for words similar to "Jesus", I get the following top five words:
Similarity | Word |
0.37 | HE |
0.25 | THEY |
0.22 | PILATE |
0.22 | PETER |
0.18 | SHE |
These are all nouns and pronouns referring to human beings.
What do we get if we perform the same steps on the Rohonc codex? I have previously identified the word below as possibly meaning "Jesus":
N.IGHA
Here are the words my algorithm identifies as most similar in the RC:
Similarity |
Word
| Transcription | Notes |
0.27 | CUT.C | tentatively "Pilate" | |
0.24 | CQ.B1CU | unknown | |
0.24 | N | first glyph of "Jesus" | |
0.22 | IGHA | second glyph of "Jesus" | |
0.22 | RT.A.CO.D | tentatively a saint's name |
It is rewarding to see that the word I previously identified as "Pilate" appears at the top of the list, and a saint's name appears at the bottom of the list.
The wholly new information is the second word, CQ.B1CU, which now looks like a noun referring to a human being.
The appearance of N and IGHA as similar to N.IGHA is also interesting and deserves investigation. Indeed, in the titulus crucis only the second glyph is used to represent the name Jesus. Presumably this is really a two-word phrase, and only the second glyph actually represents the name "Jesus," with the first glyph N representing the title "Lord" or something similar.
No comments:
Post a Comment