An Extensive Explanation of “The Diacritic Test of āḍḥīḷḹṃṇñṅṛṝṣśṭūṁ ĀḌḤĪḶḸṂṆÑṄṚṜṢŚṬŪṀ”
Referencing a Specific String
Toh 00c
Degé Tengyur, vol. 93 (sher phyin, pha), folios 1.b–292.b
- Surendrabodhi
- zhu chen gyi lo tsA ba ban de ye shes sde
Imprint
First published 2022
Current version v 1.0.1 (2023)
Generated by 84000 Reading Room v2.26.1
84000: Translating the Words of the Buddha is a global non-profit initiative to translate all the Buddha’s words into modern languages, and to make them available to everyone.
Warning: Readers are reminded that according to Vajrayāna Buddhist tradition there are restrictions and commitments concerning tantra. Practitioners who are not sure if they should read this translation are advised to consult the authorities of their lineage. The responsibility for reading this text or sharing it with others who may or may not fulfill the requirements lies in the hands of readers.
This work is provided under the protection of a Creative Commons CC BY-NC-ND (Attribution - Non-commercial - No-derivatives) 3.0 copyright. It may be copied or printed for fair use, but only with full attribution, and not for commercial advantage or personal compensation. For full details, see the Creative Commons license.
Table of Contents
Summary
This file contains test cases for root text references in 84000 TEI.
References are made to the test TEI file Toh 00 (The Diacritic Test of āḍḥīḷḹṃṇñṅṛṝṣśṭūṁ ĀḌḤĪḶḸṂṆÑṄṚṜṢŚṬŪṀ), and all references should also function as back references to this text e.g. a reference in this text to passage A should also function in passage A of the root text as a reference back to this commentary.
References are only made in the commentary text, not in the root.
Note, that the commentary text requires the a <link> element in the <sourceDesc> which declares it is a commentary of another text. E.g.: <link type="isCommentaryOf" target="toh00"/>
.
The root text does not need such a declaration, but it does need a short code in it's <titleStmt>. (See 1.2)
Text Body
The Main Title
Referencing a Specific String
Substring Quotes
Quotes can link to, and highlight, specific phrases in the root text by specifing the @rend
attribute of the pointer as "substring". <ptr type="quote-ref" target="#MILESTONE-ID" rend="substring"/>
.
"Direct references to this text," T00
This will then parse the referenced passage to find the quoted text. Note that the trailing and leading punctuation, and any notes, are ignored.
Subsequent quotes should also link to the substring in the quoted passage.
will be automatically indicated and back-referenced T00
through markers in the right hand margin T00
The same text can be referenced more than once, and references should be case insensitive, as demonstrated in the following quote which quotes a substring of the previous quote.
right hand margin T00
We also need to be sure that quotes in long passages are scrolled ito view:
A quote at the end of a long passage should scroll into view T00
Non-matching Substring Quotes
However as the texts we are dealing with are contemporary translations of ancient texts it is likely that the quoted substring in the commentary is different from the root text. Hence if you want to link directly to the phrase, but it doesn't match, you can put the exact phrase in in the root text in <orig/>
.
An example of this would be: <q><p>The quote in the commentary text</p><orig><ptr type="quote-ref" target="#MILESTONE-ID" rend="substring"/>The quote in the root text</orig></q>
.
The (very different) ways to mark-up the quotations T00
Note the <orig/>
element must be a direct child element of the <q/>
element and the pointer has been placed in <orig/>
. This ensures that this alternative text is used when referencing this particular root text.
Quotes can include text in square brackets that will then be merged into the text to be highlighted.
This is only in the quote itself. Text in the <orig/>
will be considered literal text in the original.
cor[r]ectly linking T00
When Quoted Texts Contain Mark-up
If the source text contains mark-up elements these will disrupt the substring matching. There are a couple of ways to handle this. One is to simply copy the TEI from the source text into an <orig/>
element.
In this example the source text contains a <emph/>
element. We can either simply include the same markup in the quote.
This passage should be highlighted T00
Or, if the root and quote don't match, the markup can be copied into <orig/>
.
despite the
<title/>
element. T00
This quote tests references that include a <term/>
element.
…glossary entry “Element-B (Marked Mode)” T00
The following quote is shorter than the previous quote, but the text nodes are longer. Hence the previous quote will only fully highlight if the nodes are processed in the correct order.
Now testing the default mode="marked" with glossary entry “Element-B” T00
NOTE: Avoid copying <note/>
, <ref/>
or any other elements that do not belong in the text you are pasting into. Such elements can be replaced by empty <span/>
or <space/>
elements if necessary.
Ellipsis Substring Quotes
If a quote contains an ellipsis (…) this will be operative in matching the substring text. The content quote will link to the text that begins with the text before the ellipsis and ends with the text following it.
NOTE: this should be an ellipsis character …
, not just three full-stops.
It should be… correctly linking to that point. T00
Ellipses can also be used in combination with the <orig/>
. This means the whole quote does not have to be included as the origin, just enough to provide a unique reference and a scope (the start and end points of the quote).
For instance the ellipsis character can simply subsitute the non-matching part of the string. <q><p>The quote in the commentary text</p><orig><ptr/>The quote in the…text</orig></q>
Once loaded the commentary is supposed to scroll with the root text accordingly. T00
NOTE: As a general rule add as much of the target quote as possible as this will increase the likelihood of it being flagged if the link is broken.
Numbered Occurrences
Problems can arise when words that occurr multiple times in a passage are to be highlighted, or are used in conjunction with ellipses. Then it may be necessary to specify which of the terms is intended.
In the following passage the second instance of the word Tārā is specified by adding a number in square brackets after the word in the <orig/>
element <q><p>Tārā</p><orig>tārā[2]</orig></q>
:
Tārā (second) T00
The values refers to the number of occurrence of the term or phrase since the previous milestone.
This can be used in conjunction with ellipses to resolve further ambiguities about the span of the quote.
"Red Tārā", T00
"Red Tārā, white Tārā, green Tārā", T00
"white Tārā", T00
"white Tārā, green Tārā", T00
"green Tārā", T00
"white Buddha", T00
Linking a Quote to Multiple Source Texts
It is also possible that a single quote can refer to multiple source texts, in which case you can simply add multiple pointers.
In this example the commentary quotes two source texts. Both have the same translation.
The order of quote links in the TEI should be maintained in the output. The following quote should be a duplicate of the preceding one but with the link order switched.
It is also possible that one or more of the target translations varies from the commentary text. In the following substring quote the first quote matches the root text, but the second one uses <orig/>
to specifiy alternative content.
In the first reference the <ptr/>
has been added to the <p/>
element as the root text matches the quote. In the second reference the <ptr/>
has been added to the <orig/>
element to indicate that this quote reference uses the varying text.
Other Tests
“form itself is emptiness, and emptiness itself is form,” T00