The data structure for each collation unit should be a JSON object with the following keys:
null
depending on the unit, this and the data structure expected is explained below.Other keys can be included if they are needed for other functions in the platform.
For transcriptions which have extant text in this unit, the witness array should contain an entry for each hand present in the unit. When dealing with corrected text the collation editor treats each hand as a completely separate witness to the text. For this reason it is advisable to provide a full representation of the corrector reading and not just the corrected words. If you do not do this, then the shared words will appear as om. in the corrector hand. Each hand should be represented by a JSON object with two keys.
Each token object must include the following keys
Any number of additional keys can be included in this list. If you are going to customise the settings then you may need to encode extra data in the token such as punctuation for example. You may also want to encode information about gaps in the text which is explained in the next section.
Within a collation unit the collation editor assumes text is omitted unless your witnesses data tells it otherwise.
To encode lacunose text in addition to the required keys in the token object you will need to add additional keys and details about the lacunose section. When the gap follows a word (as in is not before the first word of the context unit). This is done by adding two extra keys to the token object.
If this is a gap before the very first extant word in the given unit then you must add the following two keys to the first token.
Document siglum: 01
Text: A simple example sentence
[
{
"id": "01",
"tokens": [
{
"index": 2,
"reading": "01",
"original": "A",
"t": "a",
"rule_match": ["a"]
},
{
"index": 4,
"reading": "01",
"original": "simple",
"t": "simple",
"rule_match": ["simple"]
},
{
"index": 6,
"reading": "01",
"original": "example",
"t": "example",
"rule_match": ["example"]
},
{
"index": 8,
"reading": "01",
"original": "sentence",
"t": "sentence",
"rule_match": ["sentence"]
}
]
}
]
Document siglum: 02
Text: A complex corrected example [lac 7-8 char] with damage
02* will be used for the first hand and 02C for the correction
[
{
"id": "02*",
"tokens": [
{
"index": 2,
"reading": "02*",
"original": "A",
"t": "a",
"rule_match": ["a"]
},
{
"index": 4,
"reading": "02*",
"original": "complex",
"t": "complex",
"rule_match": ["complex"]
},
{
"index": 6,
"reading": "02*",
"original": "example",
"t": "example",
"rule_match": ["example"],
"gap_after": true,
"gap_details": "lac 7-8 char"
},
{
"index": 8,
"reading": "02*",
"original": "with",
"t": "with",
"rule_match": ["with"]
},
{
"index": 10,
"reading": "02*",
"original": "damage",
"t": "damage",
"rule_match": ["damage"]
}
]
},
{
"id": "02C",
"tokens": [
{
"index": 2,
"reading": "02C",
"original": "A",
"t": "a",
"rule_match": ["a"]
},
{
"index": 4,
"reading": "02C",
"original": "corrected",
"t": "corrected",
"rule_match": ["corrected"]
},
{
"index": 6,
"reading": "02C",
"original": "example",
"t": "example",
"rule_match": ["example"],
"gap_after": true,
"gap_details": "lac 7-8 char"
},
{
"index": 8,
"reading": "02C",
"original": "with",
"t": "with",
"rule_match": ["with"]
},
{
"index": 10,
"reading": "02C",
"original": "damage",
"t": "damage",
"rule_match": ["damage"]
}
]
}
]