As part of that effort, and recognizing the inevitable instability of encoding languages over time, we provide here an overview of the project contents as expressed in TEI-XML. This documentation combines (in the present file, documentation.xml) the following:
It is hoped that this documentation will help to ensure the project’s long term viability—especially its translatability across the fullest possible range of future text technologies and platforms. To that end, the discussion below, which assumes minimal knowledge of TEI-XML, is aimed principally at the project’s future custodians.
The editorial policy is simple: to capture the intellectual content of the primary sources in TEI-conformant XML. The only exceptions to this focus are the inclusion of tags recording the expanded forms of abbreviations; tags for names, places, and dates; and additional markup for metre and rhyme in the files containing Herbert’s devotional masterpiece, The Temple. The capture and markup otherwise serve our main objective: to present diplomatic transcriptions of all witnesses alongside digital images of the corresponding manuscripts and print exemplars.
Still, this focus is not without its challenges, especially with respect to differences in content and layout among the many source documents. The following policy statement, though not exhaustive in scope, accounts for most of the TEI-XML markup used to capture that variety.
1. Markup of Transcriptions: Parallel Segmentation
A discrete XML file is provided for each sub-division of the major works—whether poem, chapter, letter, oration, or note (on
Valdesso’s Considerations)—and for each of the works not subject to sub-division, i.e.
A Treatise of Temperance and Sobrietie,
Outlandish Proverbs / Jacula Prudentum, and Herbert’s
Will. Each file contains diplomatic transcriptions of all witnesses, expressed in parallel using the TEI Critical Apparatus tag set:
<l n="10">
<app>
<rdg wit="#W">Quâ, Fabio quondā sub duce nata salus.</rdg>
<rdg wit="#NroW">quæ Fabio quondam ſub duce tuta fuit.</rdg>
<rdg wit="#Bla38">Quæ Fabio quondam ſub duce tuta fuit.</rdg>
</app>
</l>
This simplified rendering of l. 10 from Herbert’s Latin epigram, ‘Roma Anagr.’ (
Lucus 25), includes three of the poem’s nine witnesses. Each reading element (
<rdg>) is accompanied by a witness attribute (
wit) whose value is the siglum of the corresponding source. The hashtag indicates that the attribute points to a corresponding element elsewhere in the file, in this case a single item on a list of all nine witnesses:
<listWit>
<witness xml:id="W">
<name type="siglum" ref="source:W">W</name>
</witness>
<witness xml:id="Fj1">
<name type="siglum" ref="source:Fj1">Fj1</name>
</witness>
<witness xml:id="D">
<name type="siglum" ref="source:D">D</name>
</witness>
<witness xml:id="Bla41">
<name type="siglum" ref="source:Bla41">Bla41</name>
</witness>
<witness xml:id="Blh21">
<name type="siglum" ref="source:Blh21">Blh21</name>
</witness>
<witness xml:id="Fv45">
<name type="siglum" ref="source:Fv45">Fv45</name>
</witness>
<witness xml:id="NroW">
<name type="siglum" ref="source:NroW">NroW</name>
</witness>
<witness xml:id="Bla38">
<name type="siglum" ref="source:Bla38">Bla38</name>
</witness>
<witness xml:id="Bla75">
<name type="siglum" ref="source:Bla75">Bla75</name>
</witness>
</listWit>
The witness attribute (
wit) on the reading element (
<rdg>), in other words, points to a witness element (
<witness>) with the matching siglum, expressed here as the value of an
xml:id attribute, which is always an identifier unique to the file (i.e. a URI or ‘Uniform Resource Identifier’). These
<witness> elements, in addition to providing information for output processing, point to still other elements by way of the
ref attribute, in this case the essential bibliographical information associated with the nine sigla, four of which are provided below:
<listBibl>
<head>Lucus</head>
<msDesc>
<msIdentifier xml:id="W">
<country>United Kingdom</country>
<settlement>London</settlement>
<repository>Dr Williams’s Library</repository>
<msName>MS. Jones B62</msName>
</msIdentifier>
</msDesc>
<bibl xml:id="D">
<title>
<hi rend="italics">Ecclesiastes Solomonis</hi>.</title>
<editor>James Duport</editor>
<placeName>Cambridge</placeName>
<date>1662</date>
</bibl>
<msDesc>
<msIdentifier xml:id="NroW">
<country>United Kingdom</country>
<settlement>Northamptonshire</settlement>
<repository>Northamptonshire Record Office</repository>
<msName>W(A)/6/VI/1</msName>
</msIdentifier>
</msDesc>
<msDesc>
<msIdentifier xml:id="Bla38">
<country>United Kingdom</country>
<settlement type="city">London</settlement>
<repository>British Library</repository>
<msName>Add. MS. 6038</msName>
</msIdentifier>
</msDesc>
</listBibl>
This
<listBibl> feature and its child elements are nested within the
<teiHeader> element in the file documentation.xml.
The foregoing describes in a general way the markup or ‘tagging’ protocol of the CWDA. Combined with images corresponding to all pertinent witness pages, the features documented here provide all the information necessary for tracing the origins of any given reading as captured by the transcriptions.
The first example above presents three witnesses to one line of verse in parallel
<rdg> elements, each element containing the entirety of the line as it appears in the witness corresponding to its siglum on the
wit attribute. This simple parallel rendering is representative of many of the verse lines, proverbs, and prose paragraphs as captured throughout the TEI-XML files. However, for practical reasons having to do with the handling of multiple witnesses, the introduction of new witnesses to files already extant, and to the archive’s historically varied approach to markup, other poems and prose works are presented in a manner that avoids repetition of the content. Recall the first example:
<l n="10">
<app>
<rdg wit="#W">Quâ, Fabio quondā sub duce nata salus.</rdg>
<rdg wit="#NroW">quæ Fabio quondam ſub duce tuta fuit.</rdg>
<rdg wit="#Bla38">Quæ Fabio quondam ſub duce tuta fuit.</rdg>
</app>
</l>
Another way to capture the same witness data is as follows:
<l n="10">
<app>
<rdg wit="#W #NroW #Bla38"> <app>
<rdg wit="#W">Quâ,</rdg>
<rdg wit="#NroW">quæ</rdg>
<rdg wit="#Bla38">Quæ</rdg>
</app> Fabio quond<app>
<rdg wit="#W">ā</rdg>
<rdg wit="#NroW #Bla38">am</rdg>
</app>
<app>
<rdg wit="#W">s</rdg>
<rdg wit="#NroW #Bla38">ſ</rdg>
</app>ub duce <app>
<rdg wit="#W">nata salus</rdg>
<rdg wit="#NroW #Bla38">tuta fuit</rdg>
</app>.</rdg>
</app>
</l>
Here, content common to all witnesses, rather than repeated, is bounded by the opening and closing tags of a single
<rdg> element with multiple sigla on its
wit attribute. Additional
<app>/
<rdg> instances, nested inside the one that contains the line as a whole, capture at a more granular level all differences among the witnesses. Both versions of this example produce identical output when submitted to one and the same processing instruction—for example, one that extracts the witness data and arranges it in parallel:
[W] Quâ, Fabio quondā sub duce nata salus.
[NroW] quæ Fabio quondam ſub duce tuta fuit.
[Bla38] Quæ Fabio quondam ſub duce tuta fuit.
Note that it would also be possible, using another simple processing instruction, to produce as output from the second version of markup that of the first. This might be desirable in the interest of making the markup as humanly readable as possible, though no effort has been made at this time to apply such a transformation across the XML. Indeed, the actual markup of l. 10 from
Lucus 25, including all nine witnesses, is as follows:
<l n="10">
<app>
<rdg wit="#Bla41 #W #Bla75 #Fj1 #D #Blh21 #Fv45 #Bla38"> <app>
<rdg wit="#W">Quâ,</rdg>
<rdg wit="#Bla41 #Fj1">Quâ</rdg>
<rdg wit="#D">Qua</rdg>
<rdg wit="#Bla75 #Bla38 #Fv45">Quæ</rdg>
<rdg wit="#Blh21">quæ</rdg>
</app>
<app>
<rdg wit="#D #W #Bla41 #Bla38 #Bla75 #Fj1">F</rdg>
<rdg wit="#Blh21">
<g ref="#ffF">ff</g>
</rdg>
<rdg wit="#Fv45">f</rdg>
</app>abio <app>
<rdg wit="#D #Bla75 #Bla38 #Blh21 #Fv45">quondam</rdg>
<rdg wit="#W">quond<choice>
<abbr>ā</abbr>
<expan>am</expan>
</choice>
</rdg>
<rdg wit="#Bla41 #Fj1">quond<choice>
<abbr>ā</abbr>
<expan>am</expan>
</choice>
</rdg>
</app>
<app>
<rdg wit="#W #Blh21 #Fv45 #Bla75 #Bla41">sub</rdg>
<rdg wit="#D #Bla38 #Fj1">ſub</rdg>
</app>
<app>
<rdg wit="#D #W #Blh21 #Fv45 #Bla75 #Bla41 #Bla38">d</rdg>
<rdg wit="#Fj1">D</rdg>
</app>uce <app>
<rdg wit="#W">nata salus.</rdg>
<rdg wit="#Bla41">nata salus</rdg>
<rdg wit="#D #Fj1">nata ſalus.</rdg>
<rdg wit="#Bla75 #Bla38">tuta fuit.</rdg>
<rdg wit="#Blh21">tuta fuit:</rdg>
<rdg wit="#Fv45">tuta fuit</rdg>
</app>
</rdg>
<rdg wit="#NroW"> quæ Fabio quondam ſub duce tuta
fuit.</rdg>
</app>
</l>
This example also introduces several additional features of markup, the details of which are included in the following discussion.
2. Additional Markup Features (General)
Markup of
abbreviations records both the abbreviated form and the expansion by way of
<abbr> and
<expan> elements nested inside a
<choice> element. This feature allows a parser, whether for display purposes or a search-and-retrieval operation, to ‘choose’ one or the other form of the word (or both). The Latin adjective/adverb
quondam, for example, though displayed in its diplomatic form,
quondā (i.e. abbreviated, as it appears in witnesses
Bla41 and
Fj1), is captured also as an editorial expansion in the underlying codebase:
<rdg wit="#Bla41 #Fj1">quond<choice>
<abbr>ā</abbr>
<expan>am</expan>
</choice>
</rdg>
As with the verse-line example above, the same data can be captured with alternative markup:
<rdg wit="#Bla41 #Fj1">
<choice>
<abbr>quondā</abbr>
<expan>quondam</expan>
</choice>
</rdg>
And as is true of the verse-line example, the markup method—in this case whole- versus partial-word—varies throughout.
Persons and
places are tagged in a manner similar to the handling of source witnesses. In this case,
ref attributes on
<persName> and
<placeName> elements point to URIs in the documentation.xml file, each of which provides information about the person or place marked by the
<persName> or
<placeName> element. All references to a person or place, whatever the orthographical variation or form of abbreviation, are resolved in the markup to one and the same identifier, thereby facilitating robust data search and retrieval. For example, both of the following—
<persName ref="person:herbertGeorge">
<choice>
<abbr>G: H:</abbr>
<expan>George Herbert</expan>
</choice>
</persName>
and
<persName ref="person:herbertGeorge">Georg Herb<hi rend="superscript">t</hi>
</persName>
—point to the same entry in the documentation:
<person xml:id="herbertGeorge">
<persName>George Herbert</persName>
<birth when="1593-04-03">
<date>3 April 1593</date>
<placeName>
<name>Montgomery Castle</name>
<settlement>Montgomeryshire</settlement>
<country>Wales</country>
</placeName>
</birth>
<death when="1633-03-01">
<date>1 March 1633</date>
<placeName>
<name>Bemerton Rectory</name>
<district>Bemerton</district>
<settlement>Montgomeryshire</settlement>
<country>Wales</country>
</placeName>
</death>
</person>
This example does not exhaust the kinds of information that might be associated with the
xml:id attribute value ‘herbertGeorge’. Nor is any of the information provided in this example required. It should also be noted that the tagging of references to persons and places is incomplete at this time, though sufficiently established as to be completed by anyone with access to the documentation provided here.
Dates in the source transcriptions are handled through simple standard-form references on the
when attribute in the
<date> element:
<date when="1633-03-12"
calendar="Gregorian">
<choice>
<abbr>A° Dō:</abbr>
<expan>Anno Domini</expan>
</choice> 1632
</date>
The transcriptions, being diplomatic, rely heavily on the Unicode Standard to capture original orthography and special characters. These include Latin long-s and all ligatures, as well as scribal (and sometimes print) abbreviations that employ macrons on vowels signifying nasal consonants, and double macrons for instances signifying three or more characters. Every effort has been made to expand such abbreviations in the codebase through tags parallel to those capturing the original (i.e. the <abbr>/<expan> pairs cited in examples above).
Here, to illustrate, is a complete transcription of the opener to Herbert’s
Will (copy-text version), followed by the TEI-XML from which it is generated (both versions—the copy-text original, signed by Herbert, and the registered copy):
A° Dō: 1632 T: Georgii Herbert cli͞ci def͞.
<opener>
<date when="1633-03-12"
calendar="Gregorian"> <app>
<rdg wit="#NAp95"> </rdg>
</app>
<choice>
<abbr>A° Dō:</abbr>
<expan>Anno Domini</expan>
</choice> 1632</date>
<lb/>
<app>
<rdg wit="#NAp36">
<choice>
<abbr>T:</abbr>
<expan>Testamentum</expan>
</choice>
</rdg>
</app>
<persName ref="person:herbertGeorge">
<app>
<rdg wit="#NAp36">Georgii Herbert</rdg>
</app>
</persName>
<app>
<rdg wit="#NAp36">
<choice>
<abbr>cli͞ci</abbr>
<expan>clerici</expan>
</choice>
<choice>
<abbr>def͞.</abbr>
<expan>defuncti</expan>
</choice>
</rdg>
</app>
</opener>
This example, as it happens, illustrates many of the features described thus far: discrete witnesses in parallel (including whitespace); dates and names (the latter including the
ref attribute referencing additional information in a separate file); Unicode characters capturing nasal consonants and other scribal abbreviations; and editorial expansions of abbreviations. As elsewhere, the
<choice> element facilitates the suppression of one alternant in the rendering (here, the content of the
<expan> element) and expression of the other (that of the
<abbr> element). Again, one might choose the reverse scenario through a small adjustment in the stylesheet instruction:
Anno Domini 1632 Testamentum Georgii Herbert clerici defuncti
[‘In the Year of Our Lord 163[3], George Herbert, priest, dead’]
Other special characters in early modern books and manuscripts are retained in the diplomatic transcriptions. The Unicode Standard in its current iteration provides codepoints for some of these, including Latin long-s (
ſ), the
ſt and
st ligatures, and the various
f ligatures (i.e.
ff,
fl,
ffi, etc.). These are included with no additional markup. Characters lacking such standardization are captured in the transcriptions by way of the
gaiji provision, the TEI module whose name is drawn from the Japanese for ‘external characters’ (i.e. 外字). For these, we invoke the
<g> and
<glyph> elements, where the former is embedded in the transcription and points to the latter, stored under the
<teiHeader> in the documentation.xml file:
<g ref="gaiji:sllig">ſl</g>
<g ref="gaiji:ssilig">ſſi</g>
<g ref="gaiji:οςlig">ος</g>
<g ref="gaiji:usAbbrev">9</g>
<glyph xml:id="sllig">
<localProp name="NAME"
value="LATIN SMALL LIGATURE LONG S L"/>
<mapping>ſl</mapping>
</glyph>
<glyph xml:id="ssilig">
<localProp name="NAME"
value="LATIN SMALL LIGATURE LONG S LONG S I"/>
<mapping>ſſi</mapping>
</glyph>
<glyph xml:id="οςlig">
<localProp name="NAME"
value="GREEK SMALL LIGATURE OMICRON SIGMA"/>
<mapping>ος</mapping>
</glyph>
<glyph xml:id="usAbbrev">
<localProp name="NAME"
value="COMBINING US"/>
<desc>Scribal abbreviation for Latin -us. Unicode U+1DD2 is provided for
‘COMBINING US ABOVE’, but the character/glyph intended here always
follows the characters with which it is combined, and its vertical
alignment is more often linear or even sublinear than
supralinear.</desc>
<mapping>9</mapping>
</glyph>
Note that the
<mapping> element nested within each
<glyph> element corresponds to its counterpart in the transcription’s
<g> element. The
gaiji module thus serves our purpose of capturing the source contents diplomatically while also providing a placeholder for future Unicode entities that correspond to the glyphs described here.
3. Additional Markup for Major Divisions of the Text
Below are brief descriptions of the contents of the seventeen major divisions of the CWDA text, along with any features of the tagging protocol or file arrangement particular to a division and not already discussed above.
3.1. Volume I: English Prose
3.1.1. The Countrey Parson
CP consists of 40 discrete XML files: 37 chapters,
The Authour to the Reader,
The Authour’s Prayers, and the title pages from the 1652 and 1671 editions (including
Herbert’s Remains). Each line break (
<lb>) indicator (an empty element belonging to the TEI milestoneLike model class) is assigned a URI using the
xml:id attribute whose value consists of the
CP chapter number, witness, page number, and line number. Here, for example, is the tag for line 25 on page 2 of chapter 1 in the 1652 edition:
<lb xml:id="cp1-52p2-25"/>
3.1.2. A Treatise of Temperance and Sobrietie
TTS consists of a single XML file documenting four witnesses to Herbert’s translation of Cornaro’s dietary autobiography:
Ts34,
Ts34a,
Ts36, and
Tm78. The scheme for line breaks is similar to the one for
CP (see above), though here there are many instances in which multiple witnesses share the same break—
<lb xml:id="Ts34.Ts34a.Ts36-4-8"/>
—i.e. line 8 on page 4 in three of the four witnesses.
Common to all witnesses are two footnotes corresponding to two asterisks in the text. These are handled using the
<anchor> and
<note> elements, e.g.
*<anchor xml:id="anchor1"/>
<note type="footnote" target="#anchor1"
xml:lang="it" anchored="true">*[content of note 1]</note>,
where the value of the
target attribute on the
<note> element points to that of the
xml:id attribute on the
<anchor> element.
3.1.3. Notes on Valdesso’s Considerations
Herbert’s commentaries on a selection of the
Hundred and Ten Considerations of Juán de Valdés (‘Valdesso’) were published in two seventeenth-century English editions, here
Vc (1638) and
Dc (1646). In the first, Herbert’s notes appear together in the front of the book, followed by Nicholas Ferrar’s English edition of Valdés’s
Considerations. In the later edition of 1646, they are printed as marginal commentaries alongside the passages to which they correspond. Each of the twenty
Considerations, together with its corresponding note(s), is handled in a discrete XML file, with a separate
<div> (i.e. division) element for each of four parts (two for each of two witnesses). Here, for example, is the organizational structure of the file for ‘Consideration 10’ (i.e. nv10.xml):
<div type="NV" xml:id="nv10">
<div type="ConsiderationVc">
<app>
<rdg wit="#Vc">
</rdg>
</app>
</div>
<div type="NoteVc">
<app>
<rdg wit="#Vc">
</rdg>
</app>
</div>
<div type="ConsiderationDc">
<app>
<rdg wit="#Dc">
</rdg>
</app>
</div>
<div type="NoteDc">
<app>
<rdg wit="#Dc">
</rdg>
</app>
</div>
</div>
3.1.4. Outlandish Proverbs and Jacula Prudentum
The collections of proverbs attributed to Herbert were printed in the seventeenth century under three titles: Outlandish Proverbs and Witt’s Recreations (1640), and Jacula Prudentum (1652). For a detailed account of the complex publication history of these collections, see the Textual Introduction for OP/JP in volume one of the Oxford edition. The 1184 proverbs are presented here as a continuous sequence spread across twenty-three files in segments of fifty items each. Twenty-one of these files cover items 1-1032 (op1-50.xml, op51-100.xml, etc.), originally published as Outlandish Proverbs; items 1033-1184, first published as part of the expanded edition Jacula Prudentum, are spread across the remaining two files (i.e. jp1033-1100.xml, jp1101-1184.xml). For reader convenience, items unnumbered in witnesses Jp52 and NLW are numbered here with surrounding square brackets; this numbering is understood to have been supplied by the CWDA editors (i.e. the tagging does not recognize its supplemental status). The result, at any rate, is a consistent cross-numbering of all three witnesses. An additional manuscript witness, Story Books of Little Gidding vol. IIa (LG), is currently untraced and not included here (though its variants, as recorded in Hutchinson, are included in the Oxford edition).
3.1.5. Letters
Five of Herbert’s nineteen English Letters, three of them autograph, come to us in seventeenth-century manuscripts, while the earliest extant witnesses to the remainder are found in print publications. Our diplomatic transcriptions in several instances resort to the use of the <table> element, together with its XML descendants, <row> and <col>, to indicate approximately the layout of information otherwise captured by the <closer> and <signed> elements alone. We do so recognizing the TEI provision that ‘it may often be difficult to make a clear distinction between details relating purely to the rendition of information and those relating to the information itself’ (chapter 15, Tables). Complementing the <closer> element and its child-element <signed> in the Letters files are the <opener> element and its child, <salute>. The <head> element is reserved for a letter’s title-like heading, whatever its contents, and if it appears at the beginning, prior to the opening <salute> (where present) or first paragraph. One exception is Letters 19, where this information appears alone on the verso of a two-page manuscript and is captured therefore using the <byline> element.
3.1.6. Herbert’s Will
A single file combines both the original (signed by the testator) and registered copies. To balance accommodation of differences in layout between the witnesses against the desire for elegant rendering in the HTML (the latter by way of the same set of stylesheets currently applied to all other members of the CWDA’s XML corpus), the
Will tagging is structured as follows:
<body>
<div n="1" xml:id="mainBody"/>
<div n="2" xml:id="discharges">
<ab n="1" xml:id="dischargesPreamble"/>
<ab n="2" xml:id="dischargesList">
<app>
<rdg wit="#NAp36"/>
<rdg wit="#NAp95"/>
</app>
</ab>
</div>
<div n="3" xml:id="memoriaeSua"/>
<postscript xml:id="probate"
xml:lang="Latin"/>
<closer/>
</body>
Division two contains an itemized list of what the will refers to as ‘legacyes’, identified in the encoding as ‘discharges’ (i.e. the value of an
xml:id attribute). This division is sub-divided into two anonymous-block (
<ab>) elements, one containing a brief preamble, the other containing the list itself. Unlike all but one other division of the text (
<postscript>, discussed below) and their child elements in which the two witnesses are interwoven on a granular level, the list of discharges (
<ab> #2) is divided into two broad
<rdg> elements, each containing the whole of one witness or the other. This difference results from a need to facilitate the capture of complex divergences in the witnesses’ presentation of
<ab> #2’s content.
For similar reasons, rather than using the
<p> element, paragraphs throughout are marked at their starts by the empty
<milestone> element, which includes in all cases the
unit attribute, whose value is ‘paragraph’, together with the
ed attribute, whose value is the siglum of the witness to which the paragraph break corresponds, e.g.
<milestone unit="paragraph" ed="#NAp36"/>
Division three, which begins with the Latin abbreviation ‘MS’ (i.e. memoriæ sua or ‘[for] his memory’), pertains to additional directions. This is followed by the <postscript> element containing the Latin text of probate, a legal requirement for entering the will into the official registry. This registered witness is presented under the siglum NAp95; the original-witness siglum is NAp36. (Additional information pertaining to these witnesses—and all others from across the edition—is provided under the <sourceDesc> element in the documentation.xml file, and more extensively in the Oxford edition.) A concluding <closer> element captures information pertaining to the notary public who examined and approved the will.
3.2. Volume II: Latin and Greek Verse and Prose
3.2.1. Musæ Responsoriæ
MR consists of one print and two manuscript witnesses captured across forty-three poem files, three of which are dedicatory (for James I, Prince Charles, and the bishop of Winchester, Lancelot Andrewes). A third manuscript witness is included for poem files 11, 14, and 40.
Also included here is a file capturing five witnesses to Andrew Melville’s
Anti-Tami-Cami-Categoria, the 204-line poem to which Herbert’s polemical sequence is a ‘Reply’. These include the same three witnesses containing the whole of
MR as well as the David Calderwood editions of
Parasynagma Perthense (1620) and
Altare Damascenum (1623). Rather than interwoven on a line-by-line basis, the five witnesses are captured in their entirety as separate sub-divisions of the main poem, with the
corresp attribute on each
<div> element pointing to its witness:
<text>
<front>
<head type="textDivision">Anti-tami-cami-categoria</head>
<div>
<listWit>
<witness xml:id="Fj1">
<name type="siglum" ref="source:Fj1">Fj1</name>
</witness>
<witness xml:id="Bla41">
<name type="siglum" ref="source:Bla41">Bla41</name>
</witness>
<witness xml:id="Atc20">
<name type="siglum" ref="source:Atc20">Atc20</name>
</witness>
<witness xml:id="Atc23">
<name type="siglum" ref="source:Atc23">Atc23</name>
</witness>
<witness xml:id="D">
<name type="siglum" ref="source:D">D</name>
</witness>
</listWit>
</div>
</front>
<body>
<div type="poem">
<head>
<app>
<rdg wit="#Fj1"/>
<rdg wit="#Bla41"/>
<rdg wit="#Atc20"/>
<rdg wit="#Atc23"/>
<rdg wit="#D"/>
</app>
</head>
<epigraph/>
<div corresp="#Fj1"/>
<div corresp="#Bla41"/>
<div corresp="#Atc20"/>
<div corresp="#Atc23"/>
<div corresp="#D"/>
</div>
</body>
</text>
ATCC witness
Atc23, presented in two columns in the source, is captured by way of the
<table> element: one-hundred rows in two columns for a total of two-hundred lines of verse (lines 5-8 are omitted in this witness):
<div corresp="#Atc23">
<app>
<rdg wit="#Atc23">
<table rows="100" cols="2">
<row n="1">
<cell>
<l n="1"/>
</cell>
<cell>
<l n="31"/>
</cell>
</row>
<row n="2">
<cell>
<l n="2"/>
</cell>
<cell>
<l n="32"/>
</cell>
</row>
</table>
</rdg>
</app>
</div>
Several poems in the
MR sequence include special features as follows. The Duport edition (witness
D) includes marginal notes—i.e.
<note> elements whose content refers to specific lines in
ATCC. Each of these is handled by way of an
<anchor> element bearing an
xml:id attribute whose value corresponds to the
<note> element containing the Melville reference. The note in turn points to its corresponding anchor by way of the
target attribute, e.g.
<anchor xml:id="mr4ref-2"/>
<note n="2" target="#mr4ref-2"
anchored="true">2 <hi rend="italics">Inde ad <lb/>verſ</hi>. 128.</note>
The
<anchor> element is located at the precise place in the text to which the note refers. This arrangement allows for maximum encoding and processing flexibility—accommodating, for example, disparate locations in the file for the
<note> element and its corresponding
<anchor>; or, say, the creation of a list of all Melville references and their precise locations in the
MR sequence. In many (but not all) such cases the
<anchor> element is located immediately adjacent to an actual superscript footnote anchor in the source transcription, captured here by way of the
<hi> element, which includes a
corresp attribute that points to the
<anchor>. The note itself in this case is located elsewhere in the file:
<l n="4">
<app>
<rdg wit="#D">Prædicat autores; <anchor xml:id="mr4ref-3"/>
<hi rend="superscript"
corresp="#mr4ref-3">3</hi>tertia plena Deo eſt.</rdg>
</app>
</l>
<note n="3" target="#mr4ref-3"
anchored="true">3 <hi rend="italics">Inde</hi>
<lb/>176.</note>
The content of the
<note> element is a reference to Melville’s
ATCC that in this way is tied directly to its corresponding
<anchor> and the adjacent footnote anchor in the source. (We hasten to note that several of the
D witness’s references to
ATCC are inaccurate, errors first identified in Hutchinson; and that while it is possible to add a
<choice>/
<sic>/
<corr> feature that would preserve the errors and provide corrections, we have not done so at this time—our objective for CWDA, as distinct from the Oxford critical edition, being limited to accurate capture of the primary-source content.)
Similar to the handling of Melville references in witness
D is that of single-word glosses in the left and right margins of
MR poem 33, witness
Fj1, e.g.
<anchor xml:id="marginNote-7"/>
<note target="#marginNote-7">Ordo</note>
The word ‘Ordo’ in the seventh of such notes corresponds to its appearance in l. 29 of the poem. The
<anchor> element is located at the end of the line, while its corresponding
<note> is located elsewhere in the file. Where a poem’s
<note> and corresponding
<anchor> elements are located in disparate parts of the file, this is usually due to the need to capture the layout of the source using the
<table> element. In all such cases the poem and its accompanying sets of marginal references/glosses are separated into discrete
<cell> elements, one for the poem itself and one for each marginal grouping, whether left, right, or both. These
<cell> elements in addition are numbered 1-3 using the
n attribute:
<table rows="1" cols="3">
<row>
<cell n="1" rend="marginLeft">
<app>
<rdg wit="#Fj1">
<note target="#marginNote-7">Ordo</note>
</rdg>
</app>
</cell>
<cell n="2">
<lg>
<l n="14">
<app>
<rdg wit="#Fj1">Nempe Novatores quis Veteranus
amat?</rdg>
</app>
<app>
<rdg wit="#Fj1">
<anchor xml:id="marginNote-1"/>
</rdg>
</app>
</l>
<l n="29">
<app>
<rdg wit="#Fj1">Militat et nobis, quem vos
contemnitis, ordo:</rdg>
</app>
<app>
<rdg wit="#Fj1">
<anchor xml:id="marginNote-7"/>
</rdg>
</app>
</l>
</lg>
</cell>
<cell n="3" rend="marginRight">
<app>
<rdg wit="#Fj1">
<note target="#marginNote-1">Asſecla</note>
</rdg>
</app>
</cell>
</row>
</table>
We note, finally, that all three of the complete witnesses to MR include, between numbered poems 30 and 31, an unnumbered copy of Herbert’s ‘Roma. Anag.’ (see Lucus, below).
3.2.2. Passio Discerpta
One of two autograph Latin verse sequences in W, its sole witness, PD consists of twenty-one poems in as many files.
3.2.3. Lucus
The second of two autograph verse sequences in witness W, the thirty-five poems in Lucus include three with additional witnesses: poems 10, 25, and 32. The file for Lucus 32 (variously titled ‘Triumphus Mortis’ and ‘Inventa Bellica’) includes, in addition to W, three seventeenth-century manuscript witnesses and a copy of the nineteenth-century Pickering edition of Herbert’s Works. Lucus 25, the notorious Rome anagram (‘Roma. Anagr.’), includes copies of the poem as it appears in the Duport edition of 1662 (D) and in seven manuscript witnesses additional to W. In two of these manuscripts (Fj1 and Bla41), and in D, the poem is situated, unnumbered, between numbered epigrams 30 and 31 of the Musæ Responsoriæ.
3.2.4. Memoriæ Matris Sacrum
The nineteen poems of MMS (fourteen in Latin, five in Greek) are transcribed from a single print witness published in 1627, with one exception, an incomplete manuscript copy of MMS2 (ll. 1-51). The XML does not include markup either acknowledging or correcting the many errors in the application of Greek diacritics. Here, and across the archive, the transcriptions are strictly diplomatic.
3.2.5. Alia Poemata
CWDA’s handling of Herbert’s miscellaneous Latin verse, eleven poems in all, is mostly unremarkable. We combine as a single item (AP1) two untitled poems ascribed to Herbert in Epicedium Cantabrigiense (1612), published on the occasion of the death of Henry, Prince of Wales. The title, given here in square brackets, is from the volume’s title page (i.e. ‘In obitum immaturum … Principis Walliæ’). The same title treatment is applied to AP4 (‘In obitum Serenissimæ Reginæ Annæ’), from the title page of the poem’s single print witness, Lacrymæ Cantabrigienses (1619).
The four poems for Francis Bacon (AP5-7 and 10) are captured here from a variety of early witnesses, manuscript and print. The most complex of these XML files contains the eleven-witness AP7, ‘In honorem Illustriss. D. D. Verulamii, Vicecomitis Sti Albani’ (four print volumes and seven manuscripts). With one exception, these witnesses are interwoven on a line-by-line basis, the granular encoding resembling the sample provided above. The exception, because it presents the poem in two columns, is captured using the <table> element under a single instance of the <rdg> element whose wit value is the siglum Dd72a.
Somewhat complex too is AP11, whose seven print witnesses present a variety of layouts, particularly with respect to stanza groupings and line breaks. For this reason, the file forgoes the <lg> element, capturing stanza divisions instead with the <milestone> element, the value of whose unit attribute is ‘stanza’, while its ed attribute bears the siglum/sigla to which the stanza break applies.
3.2.6. Dubia
Each of the five poems here is based on a single known manuscript witness (which is part of the reason why it is deemed to be among poemata incerti auctoris). The five are numbered relative to their candidacy for inclusion in the Herbert canon (Dubia 1 having the strongest claim, Dubia 5 the weakest—see the Oxford edition for details).
3.2.7. Orationes
As with the English prose, the markup for Herbert’s Latin
Orationes includes
<lb> elements, each with its own URI, e.g.
<lb xml:id="orat1-C4v-33"/>,
here the thirty-third line on f. C4v of the sole witness to
Oratione 1. For
Oratione 3 only, the medial number on the
xml:id value is a page rather than folio indicator, e.g. the fifth line on p. 11:
<lb xml:id="orat3-11-5"/>
We also provide here the English version of Oratione 1 that was published alongside its counterpart in Latine Orations (1623)—though it is uncertain whether Herbert was responsible for the translation.
3.2.8. Epistolæ
Of Herbert’s Latin correspondence, nineteen letters survive, sixteen of them recorded in the official ‘Orator’s Book’ of the Epistolæ Academiæ (in a hand not Herbert’s, though the headings to all sixteen and the sign-offs to three are in fact autograph). All but two of the sixteen appear in one or more additional witnesses (sigla GBE, GBZ, and/or Bla73). Witnesses to three of the nineteen letters include Herbert’s autograph originals. Though <lb> elements mark all line breaks in the sources, they have yet to be assigned URIs. And, as is sometimes the case elsewhere in CWDA, the source witnesses here are transcribed in their entirety as separate divisions of the single XML file containing them.
Epistolæ 17’s sole witness (Wj57) includes notes and note anchors, as well as (perhaps oddly, for a manuscript) catchwords. The notes have target attributes pointing to corresponding <anchor> elements placed next to superscript symbols that appear in the text. These latter are captured by <hi> elements, each with a corresp attribute whose value points to that of the xml:id attribute on its adjacent <anchor>. Notes 1 and 2 are footnotes located at the bottom of the page, while note 3 is located in the left margin next to the line of text in which its anchor occurs. The place attribute on <note> captures this distinction. Wj57’s three catchwords are captured using the <fw> element—even though catchwords in manuscripts, strictly speaking, are not forme-work features, and the third catchword here (‘ornatu’) lacks its counterpart at the top of the following page.
Epistolæ 18 is one of the few surviving Herbert autographs. Neither here nor elsewhere in this resource, however, do we include the <handShift> element and scribe or scribeRef attribute to identify hands. Our transcription otherwise is diplomatic throughout, capturing all Latin abbreviations and inserting their expanded forms as <expan> elements. To accommodate the capture of layout, the <table> element, child to a single <closer> element, contains two <cell> elements, one for a postscript to the letter, the other for Herbert’s signature. These are followed by a salutation (<salute>) addressed to the letter’s recipient, the bishop of Winchester, Lancelot Andrewes.
3.3. Volume III: English Verse
3.3.1. The Temple
The three essential witnesses—Williams MS. Jones B62 (
W), Bodleian MS. Tanner 307 (
B), and the first edition of 1633 (
T)—are captured here as diplomatic transcriptions in line-by-line parallel segmentation, with one exception: ‘Easter-wings’. This most famous of Herbert’s shape poems is encoded as two parallel
<div> elements nested within a comprehensive third
<div>. The first of these subdivisions captures the two facing-page stanzas of witnesses
W and
B as the two columns of a single
<table> element. The second subdivision is devoted exclusively to witness
T, where the poem’s lines are printed vertically—top to bottom and right to left. All twenty lines for this witness are nested within a single
<lg> element, the value of whose
type attribute is ‘doubleDecet’ and whose
rend attribute carries the additional values ‘center’ and ‘vertical-text’:
<text xml:id="easterWings">
<body>
<div>
<div xml:id="easterwingsWB">
<table rows="11" cols="2">
<row>
<cell>
<milestone unit="stanza" n="1"
ed="#W #B"/>
<l n="1" real="D|S|I|I/">
<app>
<rdg wit="#W">
<lb/>Lord who createdst man in
wealth & store</rdg>
<rdg wit="#B">Lord, who createdst Man in wealth
& store,</rdg>
</app>
</l>
</cell>
<cell>
<milestone unit="stanza" n="2"
ed="#W #B"/>
<l n="11">
<app>
<rdg wit="#W">
<lb/>My tender age in sorrow did
beginn</rdg>
<rdg wit="#B">My tender age in sorrow did
beginne:</rdg>
</app>
</l>
</cell>
</row>
</table>
</div>
<div xml:id="easterwingsT">
<lg type="doubleDecet"
rend="center vertical-text">
<l n="11">
<app>
<rdg wit="#T">My tender age in ſorrow did
beginne:</rdg>
</app>
</l>
</lg>
</div>
</div>
</body>
</text>
These specifications facilitate stylesheet processing whose display output presents the witnesses in parallel while retaining their distinctive layouts: the two stanzas or ‘wings’ of W and B as horizontal lines pressing rightward, those of T as ‘ascending’ vertical lines. The twenty lines are numbered in the encoding according to their traditional verso-recto order, even if their order for witness T in the XML is, first, lines 11-20 (the recto side of the two-page opening), followed by lines 1-10 (the verso).
Here, to illustrate, are renderings using a customized iteration of the Versioning Machine:
3.3.2. Poems in Williams MS. Jones B62 (W)
In addition to roughly half of the poems in what would come to be known as The Temple, together with the Latin sequences discussed above, W includes six English poems that are omitted in B and T. (It is perhaps notable that the title The Temple appears nowhere in W, and that while it does appear in B, the subtitle Sacred Poems and Private Ejaculations occurs in neither manuscript, appearing for the first time in the edition of 1633.) Included here in the folder that contains the Temple poems, the six additional W files are distinguished from the others by virtue of the <text> element, the value of whose type attribute is ‘williamsMS’.
3.3.3. Other English Poems
Ten of the eleven titles included here were known to Hutchinson and are now widely accepted as canonical. Two notable additions to the textual record are a copy of the English poem to Francis Bacon (siglum Bla41), accompanied, as in the other three witnesses, by a copy of ‘Æthiopissa ambit Cestum diversi coloris virum’; and three witnesses to a poem omitted in all modern Herbert editions, ‘On the death of Mr Barker of Hammon, and his wife who dyed both together’.
4. Rhyme and Metrical Analysis
Herbert’s lyrical and formal sophistication is well-known. CWDA aims to enhance the study of this aspect of his verse by invoking the TEI provisions for rhyme and metrical analysis across the entirety of The Temple, namely the met, real, and rhyme attributes, as well as the <rhyme> element. The met attribute specifies the conventional metre within which the poet is working; the real attribute specifies the ‘realized’ pattern of the line (often a subjective value); and the rhyme attribute specifies the rhyming pattern as based on the final words or syllables of successive lines. This latter feature is complemented by the <rhyme> element. Not to be confused with the rhyme attribute, the <rhyme> element marks the actual rhyming word or syllable, and includes a label attribute specifying its role in the rhyme pattern. (This aspect of the tagging, provided in the samples here, has yet to be applied across all Temple poems.)
Here is a typical example from ‘The Church-porch’ (with the text of
W only, the witness that serves as copy-text for this poem in the Oxford edition):
<div type="poem" met="-+|-+|-+|-+|-+/"
rhyme="ababcc">
<head/>
<lg type="sestet" n="1">
<l n="1" real="T|S|I|I|I/">
<app>
<rdg wit="#W">Thou whose sweet youth & early hopes
in<rhyme label="a">hance</rhyme>
</rdg>
</app>
</l>
<l n="2" real="I|I|I|I|H/">
<app>
<rdg wit="#W">The price of thee, & mark thee for a
<rhyme label="b">treasure</rhyme>:</rdg>
</app>
</l>
<l n="3" real="T|I|H|A/">
<app>
<rdg wit="#W">Harken vnto a Verser, who may <rhyme label="a">chance</rhyme>
</rdg>
</app>
</l>
<l n="4" real="T|I|I|I|H/">
<app>
<rdg wit="#W">Ryme thee to good & make a bait of <rhyme label="a">pleasure</rhyme>.</rdg>
</app>
</l>
<l n="5" real="H|S|A|I/"> <app>
<rdg wit="#W">A Verſe may find him, who a sermon <rhyme label="c">flies</rhyme>
</rdg>
</app>
</l>
<l n="6"> <app>
<rdg wit="#W">And turne delight into a sacri<rhyme label="c">fice</rhyme>.</rdg>
</app>
</l>
</lg>
</div>
The value of the
met attribute on the
<div> element is a series of symbols whose meanings are documented under the
<metDecl> element in the documentation.xml file: the unstressed and stressed syllables (- and +), the metrical foot boundary (|), and the metrical line boundary (/). The metrical pattern, when specified on the
<div> element, as here, is understood to recurse through the entire poem, indicating in this case that iambic pentameter is the conventional metre for all of its 462 lines. The
real attribute, applied to each line (i.e.
<l>) individually, specifies the actual pattern ‘realized’ within the conventional one, indicated here by alphabetical symbols also documented under the
<metDecl> element—i.e. iamb (I), trochee (T), spondee (S), amphibrach (H), anapest (A), pyrrhus (P), etc. These are interchangeable (and documented as such) with combinative strings of -, +, and | symbols (-+|, +-|, ++|, -+-|, etc.) representing these metrical-foot units:
<metDecl pattern="[ITAHSP]+">
<metSym n="iamb" value="I" terminal="false">-+|</metSym>
<metSym n="trochee" value="T"
terminal="false">+-|</metSym>
<metSym n="anapest" value="A"
terminal="false">--+|</metSym>
<metSym n="dactyl" value="D"
terminal="false">+--|</metSym>
<metSym n="amphibrach" value="H"
terminal="false">-+-|</metSym>
<metSym n="spondee" value="S"
terminal="false">++|</metSym>
<metSym n="pyrrhus" value="P"
terminal="false">--|</metSym>
<metSym value="-">unstressed syllable</metSym>
<metSym value="+">stressed syllable</metSym>
<metSym value="|">foot boundary</metSym>
<metSym value="/">metrical line boundary</metSym>
</metDecl>
Where a
<l> element omits the
real attribute, its realized pattern is understood to be identical with the conventional one specified by the
met attribute on the
<div> element.
Just as the value of met on <div> is understood recursively, so too the value of the rhyme attribute; the ababcc pattern of the present example, in other words, describes all 77 stanzas or <lg>s —with one exception, namely, any <lg> that includes its own rhyme attribute, understood in such cases to supercede the rhyme attribute on the higher-level <div> element.
Software programmed with these parametres in mind should be able to retrieve and process all data pertinent to metrical analysis and rhyme. In order to ensure robust processing of such data across the Temple corpus, however, other poems require a more complex treatment.
For example, ‘Antiphon’ II (titled ‘Ode’ in
W) consists of three sestets with identical
met and
rhyme schemes, followed by a cinquain that differs in both metre and rhyme. This structure is further complicated by the presence throughout of
<sp> (i.e. speech) and
<speaker> elements. To accommodate this (and similar structures elsewhere in the files), the poem’s stanzas are marked not by
<lg>s, but rather by special
<milestone> elements, the value of whose
unit attribute is always ‘stanza’, while the value of its
type attribute is either ‘sestet’ or ‘cinquain’. The
met and
rhyme attributes in this case are assigned to neither the
<div> nor the
<lg> element, but rather to this special
<milestone> element whose
unit value is ‘stanza’ (and which functions effectively as a replacement for the
<lg> element). This project’s TEI schema (the works.rng file) has been modified to accommodate these specifications by adding
met and
rhyme to the list of permissible attributes included under the
<element> element, the value of whose
name attribute is ‘milestone’. Here is an excerpt from ‘Antiphon’ II:
<div type="poem">
<head>Ode. [Antiphon II]</head>
<milestone unit="stanza" type="sestet"
n="1"
met="-+|-+|-+|-+/-+|-+/-+|-+/-+|-+|-+|-+/-+|-+/-+|-+/" rhyme="ababcb"/>
<sp who="#chorus">
<speaker> Chorus. </speaker>
<l n="1" real="T|I|I/"> Praised be the God of <rhyme label="a">Love</rhyme>,</l>
</sp>
<sp who="#men">
<speaker> Men </speaker>
<l n="2" real="+|I/"> Heere <rhyme label="b">below</rhyme>:</l>
</sp>
<sp who="#angels">
<speaker> Angels. </speaker>
<l n="3" real="I|I/"> And heere <rhyme label="a">above</rhyme>
</l>
</sp>
<sp who="#chorus">
<speaker> Cho. </speaker>
<l n="4" real="+|I|I|I/"> Who hath dealt his mercies <rhyme label="b">so</rhyme>
</l>
</sp>
<sp who="#angels">
<speaker> Ang. </speaker>
<l n="5" real="+|I/"> To his <rhyme label="c">friend</rhyme>
</l>
</sp>
<sp who="#men">
<speaker> Men. </speaker>
<l n="6" real="I|I/"> And to his <rhyme label="b">foe</rhyme>.</l>
</sp>
<milestone unit="stanza" type="cinquain"
n="4"
met="-+|-+|-+|-+/-+|-+/-+|-+/-+|-+|-+|-+/-+|-+|-+|-+/" rhyme="ababb"/>
<sp who="#chorus">
<speaker> Cho. </speaker>
<l n="19" real="+|I|I|I/"> Lord thou dost deserve much <rhyme label="a">more</rhyme>
</l>
</sp>
<sp who="#men">
<speaker> Ang. </speaker>
<l n="20" real="+|I/"> Wee have <rhyme label="b">none</rhyme>,</l>
</sp>
<sp who="#angels #men">
<speaker> Men. </speaker>
<l n="21" real="I|I/"> Wee have no <rhyme label="a">store</rhyme>.</l>
</sp>
<sp who="#chorus">
<speaker> Cho. </speaker>
<l n="22" real="+|I|I|I/"> Praised be the God <rhyme label="b">alone</rhyme>
</l>
<l n="23" real="+|I|I|I/"> Which hath made of two folds <rhyme label="b">one</rhyme>.</l>
</sp>
</div>
Note that the
real attribute, whose value expresses the realized metre in counterpoint to the conventional metre expressed by the
met attribute, assumes its usual position on the
<l> (i.e. line) element.
Still other poems are distnguished by overlapping patterns of metre and rhyme. ‘The Church-floore’, for example, consists broadly of two sections of twelve and eight lines respectively. The octave, though innovative, is fairly straightforward in terms of the encoding (six tetrameter punctuated by two pentameter lines, with a rhyme scheme resembling the sestet of an Italian sonnet to which has been added a closing couplet):
<lg type="octave"
met="-+|-+|-+|-+/-+|-+|-+|-+/-+|-+|-+|-+|-+/-+|-+|-+|-+/-+|-+|-+|-+/-+|-+|-+|-+|-+/-+|-+|-+|-+/-+|-+|-+|-+/" rhyme="aabccbdd">
<l n="13" real="T|T|T|I/"> Hither sometimes sinne steales &
staines</l>
<l n="14"> The Marbles neat, & curious vaines:</l>
<l n="15" real="I|H|A|I/"> But all is cleansed, when the Marble
weeps.</l>
<l n="16" real="T|+|T|A/"> Sometimes death, puffing at ye doore,</l>
<l n="17"> Blows all the dust about the floore;</l>
<l n="18"> But while he thinkes to spoile the roome, he sweeps.</l>
<l n="19" real="D|D|I/"> Blest be the Architect, whose art</l>
<l n="20" real="I|I|P|S/"> Could build so strong in a weake
heart.</l>
</lg>
The twelve-line first half, however, is slightly more complex: four metrically identical tercets, across which stanzaic units plays a more extended rhyme scheme. To accommodate this overlapping structure, those twelve lines are tagged as a single line group element (i.e.
<lg>) within which are nested four tercet
<lg>s, a scheme rare for this project but wholly in accordance with TEI-conformant XML:
<lg rhyme="abcabdefcefd">
<lg type="tercet" n="1"
met="-+|-+|-+|-+|-+/-+|-+|-+/-+|-+/">
<l n="1" real="T|I|I|I|I/"> Mark you the floore? that square &
speckel’d stone,</l>
<l n="2"> Which lookes so firme & strong,</l>
<l n="3"> Is Patience.</l>
</lg>
<lg type="tercet" n="2"
met="-+|-+|-+|-+|-+/-+|-+|-+/-+|-+/">
<l n="4">
</l>
<l n="5">
</l>
<l n="6">
</l>
</lg>
<lg type="tercet" n="3"
met="-+|-+|-+|-+|-+/-+|-+|-+/-+|-+/">
<l n="7" real="I|H|A|I/">
</l>
<l n="8" real="T|I|I/">
</l>
<l n="9">
</l>
</lg>
<lg type="tercet" n="4"
met="-+|-+|-+|-+|-+/-+|-+|-+/-+|-+/">
<l n="10" real="A|I|A|S/">
</l>
<l n="11" real="T|I|I/">
</l>
<l n="12">
</l>
</lg>
</lg>
5. CWDA Infrastructure and User Interface
All preceding documentation pertains to the capture and expression of the Herbert corpus as TEI-XML. These XML files are stored in folders whose titles correspond to those listed above. (A copy of the
Temple XML files, with tags for rhyme and metrical analysis as described above, is stored in a separate folder titled TempleProsody.) Nested within each of these titled folders is an
images folder that contains images of all the source pages referenced in the XML by way of attributes on the
<graphic> and
<pb> (i.e. pagebreak) elements:
<facsimile>
<graphic url="images/ec65" xml:id="EC65"/>
<graphic url="images/ec66" xml:id="EC66"/>
<graphic url="images/ec67" xml:id="EC67"/>
</facsimile>
<pb ed="#EC" facs="#EC65"/>
<pb ed="#EC" facs="#EC66"/>
<pb ed="#EC" facs="#EC67"/>
The
facs (i.e. facsimile) attribute on the
<pb> element points to its corresponding
xml:id attribute on a
<graphic> element whose
url attribute (i.e. unique record locator) references the location of the folder in which the image is stored, named for its corresponding siglum and folio/page number (e.g.
ec65). Each of these images is based on an uncompressed TIFF file that is processed and arranged as a series of nested raster tiles, together with an XML file specifying zoom levels and related parametres.
A separate src (i.e. source) folder contains the following additional files:
works.rng
works.xsl
works.css
works.js
worksOptionlists.xsl
The first of these src files is the project schema, which expresses the TEI-specific constraints on the structure and content of the XML. It was created using the TEI’s Roma editor for meta-schema documentation and local encoding guidelines. Linked to the XML files by way of XML editing software (e.g. Oxygen), the RNG schema is used to validate the XML content, ensuring that it follows a set of pre-determined ‘rules’ (i.e. the schema specifications); violations are flagged by the software.
The remaining four files are the components of a customized iteration of the Versioning Machine, open-source software used to display the project’s XML-encoded content. The works.xsl file, combined with the additional CSS and JS stylesheets it invokes, transforms the XML content into Versioning Machine output. An additional XSL file invoked by works.xsl is the worksOptionlists.xsl file, which provides the project’s various tables of contents.
A Saxon XSLT processor in the Oxygen XML editor runs the stylesheets and their instructions against the TEI-XML files to create the end-user output: source transcriptions, images, and tables of contents (i.e. drop-down menus) combined in a browser-based (i.e. HTML) interface that facilitates intuitive navigation of the contents.