TEI Lite: Encoding for Interchange: an introduction to the TEI
Final revised edition for TEI P5
Lou Burnard
C. M. Sperberg-McQueen
August 2012

Prefatory note

TEI Lite was the name adopted for what the TEI editors originally conceived of as a simple demonstration of how the TEI (Text Encoding Initiative) encoding scheme might be adopted to meet 90% of the needs of 90% of the TEI user community. In retrospect, it was predictable that many people should imagine TEI Lite to be all there is to TEI, or find TEI Lite to be far too heavy for their needs.

The original TEI Lite (1996) was based largely on observations of existing and previous practice in the encoding of texts, particularly as manifest in the collections of the Oxford Text Archive and other collections of the period. It is therefore unsurprising that it seems to have become, if not a de facto standard, at least a common point of departure for electronic text centres and encoding projects world wide. Maybe the fact that we actually produced this shortish, readable, manual for it also helped.

Early adopters of TEI Lite included a number of ‘Electronic Text Centers’ and digital library initiatives. It was also adopted as the basis for some early TEI-conformant authoring systems, and as the basis for introductory tutorials, many of them in languages other than English (see further the list of legacy versions at http://www.tei-c.org/Vault/P4/Lite/).

In 2002, following the publication of TEI P4, the XML version of the TEI Guidelines, which uses the generation of TEI Lite as an example of the TEI modification mechanism, the opportunity was taken to produce a lightly revised XML-conformant version. In 2006, a more substantially revised version based on TEI P5 was produced; this reflected the many changes between TEI P4 and TEI P5, but was not otherwise significantly different. In 2012, the TEI Technical Council, decided that a final revision should be undertaken to ensure that the documentation remained consistent with the latest (2.1) release of TEI P5. This version uses a recently added mechanism in the TEI customization architecture, which permits a customization to define only the TEI elements to be included in a schema, rather than the elements to be excluded from it. As such it is probably more resilient to change than earlier versions.

Lou Burnard, August 2012

Table of contents

This document provides an introduction to the recommendations of the Text Encoding Initiative (TEI), by describing a specific subset of the full TEI encoding scheme. The scheme documented here can be used to encode a wide variety of commonly encountered textual features, in such a way as to maximize the usability of electronic transcriptions and to facilitate their interchange among scholars using different computer systems. It is fully compatible with the full TEI scheme, as defined by TEI document P5, Guidelines for Electronic Text Encoding and Interchange, as of February 2006, and available from the TEI Consortium website at http://www.tei-c.org/.

1 Introduction

The Text Encoding Initiative (TEI) Guidelines are addressed to anyone who wants to interchange information stored in an electronic form. They emphasize the interchange of textual information, but other forms of information such as images and sound are also addressed. The Guidelines are equally applicable in the creation of new resources and in the interchange of existing ones.

The Guidelines provide a means of making explicit certain features of a text in such a way as to aid the processing of that text by computer software running on different machines. This process of making explicit we call markup or encoding. Any textual representation on a computer uses some form of markup; the TEI came into being partly because of the enormous variety of mutually incomprehensible encoding schemes currently besetting scholarship, and partly because of the expanding range of scholarly uses now being identified for texts in electronic form.

The TEI Guidelines describe an encoding scheme which can be expressed using a number of different formal languages. The first editions of the Guidelines used the Standard Generalized Markup Language (SGML); since 2002, this has been replaced by the use of the Extensible Markup Language (XML). These markup languages have in common the definition of text in terms of elements and attributes, and rules governing their appearance within a text. The TEI's use of XML is ambitious in its complexity and generality, but it is fundamentally no different from that of any other XML markup scheme, and so any general-purpose XML-aware software is able to process TEI-conformant texts.

Since 2001, the TEI has been a community initiative supported by an international membership consortium. It was originally an international research project sponsored by the Association for Computers and the Humanities, the Association for Computational Linguistics, and the Association for Literary and Linguistic Computing, with substantial funding over its first five years from the U.S. National Endowment for the Humanities, Directorate General XIII of the Commission of the European Communities, the Andrew W. Mellon Foundation, the Social Science and Humanities Research Council of Canada and others. The Guidelines were first published in May 1994, after six years of development involving many hundreds of scholars from different academic disciplines worldwide. During the years that followed, the Guidelines became increasingly influential in the development of the digital library, in the language industries, and even in the development of the World Wide Web itself. The TEI Consortium was set up in January 2001, and a year later produced an edition of the Guidelines entirely revised for XML compatibility. In 2004, it set about a major revision of the Guidelines to take full advantage of new schema languages, the first release of which appeared in 2005. This revision of the TEI Lite document conforms to version 2.1 of this most recent edition of the Guidelines, TEI P5, released in June 2012.

At the outset of its work, the overall goals of the TEI were defined by the closing statement of a planning conference held at Vassar College, N.Y., in November, 1987; these ‘Poughkeepsie Principles’ were further elaborated in a series of design documents. The Guidelines, say these design documents, should:

The world of scholarship is large and diverse. For the Guidelines to have wide acceptability, it was important to ensure that:

  1. the common core of textual features be easily shared;
  2. additional specialist features be easy to add to (or remove from) a text;
  3. multiple parallel encodings of the same feature should be possible;
  4. the richness of markup should be user-defined, with a very small minimal requirement;
  5. adequate documentation of the text and its encoding should be provided.

The present document describes a manageable selection from the extensive set of elements and recommendations resulting from those design goals, which is called TEI Lite.

In selecting from the several hundred elements defined by the full TEI scheme, we have tried to identify a useful ‘starter set’, comprising the elements which almost every user should know about. Experience working with TEI Lite will be invaluable in understanding the full TEI scheme and in knowing how to integrate specialized parts of it into the general TEI framework.

Our goals in defining this subset may be summarized as follows:

The reader may judge our success in meeting these goals for him or herself.

Although we have tried to make this document self-contained, as suits a tutorial text, the reader should be aware that it does not cover every detail of the TEI encoding scheme. All of the elements described here are fully documented in the TEI Guidelines themselves, which should be consulted for authoritative reference information on these, and on the many others which are not described here. Some basic knowledge of XML is assumed.

2 A Short Example

We begin with a short example, intended to show what happens when a passage of prose is typed into a computer by someone with little sense of the purpose of mark-up, or the potential of electronic texts. In an ideal world, such output might be generated by a very accurate optical scanner. It attempts to be faithful to the appearance of the printed text, by retaining the original line breaks, by introducing blanks to represent the layout of the original headings and page breaks, and so forth. Where characters not available on the keyboard are needed (such as the accented letter a in faàl or the long dash), it attempts to mimic their appearance.

            CHAPTER 38
            
            READER, I married him. A quiet wedding we had: he and I, the par-
            son and clerk, were alone present. When we got back from church, I
            went into the kitchen of the manor-house, where Mary was cooking
            the dinner, and John cleaning the knives, and I said --
            'Mary, I have been married to Mr Rochester this morning.' The
            housekeeper and her husband were of that decent, phlegmatic
            order of people, to whom one may at any time safely communicate a
            remarkable piece of news without incurring the danger of having
            one's ears pierced by some shrill ejaculation and subsequently stunned
            by a torrent of wordy wonderment. Mary did look up, and she did
            stare at me; the ladle with which she was basting a pair of chickens
            roasting at the fire, did for some three minutes hang suspended in air,
            and for the same space of time John's knives also had rest from the
            polishing process; but Mary, bending again over the roast, said only --
            'Have you, miss? Well, for sure!'
            A short time after she pursued, 'I seed you go out with the master,
            but I didn't know you were gone to church to be wed'; and she
            basted away. John, when I turned to him, was grinning from ear to
            ear.
            'I telled Mary how it would be,' he said: 'I knew what Mr Ed-
            ward' (John was an old servant, and had known his master when he
            was the cadet of the house, therefore he often gave him his Christian
            name) -- 'I knew what Mr Edward would do; and I was certain he
            would not wait long either: and he's done right, for aught I know. I
            wish you joy, miss!' and he politely pulled his forelock.
            'Thank you, John. Mr Rochester told me to give you and Mary
            this.'
            I put into his hand a five-pound note.  Without waiting to hear
            more, I left the kitchen. In passing the door of that sanctum some time
            after, I caught the words --
            'She'll happen do better for him nor ony o' t' grand ladies.' And
            again, 'If she ben't one o' th' handsomest, she's noan faa\l, and varry
            good-natured; and i' his een she's fair beautiful, onybody may see
            that.'
            I wrote to Moor House and to Cambridge immediately, to say what
            I had done: fully explaining also why I had thus acted. Diana and
            
            474
            
            JANE EYRE                      475
            
            Mary approved the step unreservedly. Diana announced that she
            would just give me time to get over the honeymoon, and then she
            would come and see me.
            'She had better not wait till then, Jane,' said Mr Rochester, when I
            read her letter to him; 'if she does, she will be too late, for our honey-
            moon will shine our life long: its beams will only fade over your
            grave or mine.'
            How St John received the news I don't know: he never answered
            the letter in which I communicated it: yet six months after he wrote
            to me, without, however, mentioning Mr Rochester's name or allud-
            ing to my marriage. His letter was then calm, and though very serious,
            kind. He has maintained a regular, though not very frequent correspond-
            ence ever since: he hopes I am happy, and trusts I am not of those who
            live without God in the world, and only mind earthly things.
            
          

This transcription suffers from a number of shortcomings:

We now present the same passage, as it might be encoded using the TEI Guidelines. As we shall see, there are many ways in which this encoding could be extended, but as a minimum, the TEI approach allows us to represent the following distinctions:
  • Paragraph and chapter divisions are now marked explicitly.
  • Apostrophes are distinguished from quotation marks; direct speech is explicitly marked.
  • The accented letter and the long dash are correctly represented.
  • Page divisions have been marked with an empty pb element alone.
  • The lineation of the original has not been retained and words broken by typographic accident at the end of a line have been re-assembled without comment.
  • For convenience of proof reading, a new line has been introduced at the start of each paragraph, but the indentation is removed.
<pb n="474"/>
<div n="38type="chapter">
 <p>Reader, I married him. A quiet wedding we had: he and I, the parson and clerk, were alone
   present. When we got back from church, I went into the kitchen of the manor-house, where
   Mary was cooking the dinner, and John cleaning the knives, and I said —</p>
 <p>
  <q>Mary, I have been married to Mr Rochester this morning.</q> The housekeeper and her
   husband were of that decent, phlegmatic order of people, to whom one may at any time safely
   communicate a remarkable piece of news without incurring the danger of having one's ears
   pierced by some shrill ejaculation and subsequently stunned by a torrent of wordy
   wonderment. Mary did look up, and she did stare at me; the ladle with which she was basting
   a pair of chickens roasting at the fire, did for some three minutes hang suspended in air,
   and for the same space of time John's knives also had rest from the polishing process; but
   Mary, bending again over the roast, said only —</p>
 <p>
  <q>Have you, miss? Well, for sure!</q>
 </p>
 <p>A short time after she pursued, <q>I seed you go out with the master, but I didn't know
     you were gone to church to be wed</q>; and she basted away. John, when I turned to him, was
   grinning from ear to ear. <q>I telled Mary how it would be,</q> he said: <q>I knew what Mr
     Edward</q> (John was an old servant, and had known his master when he was the cadet of the
   house, therefore he often gave him his Christian name) — <q>I knew what Mr Edward would do;
     and I was certain he would not wait long either: and he's done right, for aught I know. I
     wish you joy, miss!</q> and he politely pulled his forelock.</p>
 <p>
  <q>Thank you, John. Mr Rochester told me to give you and Mary this.</q>
 </p>
 <p>I put into his hand a five-pound note. Without waiting to hear more, I left the kitchen.
   In passing the door of that sanctum some time after, I caught the words —</p>
 <p>
  <q>She'll happen do better for him nor ony o' t' grand ladies.</q> And again, <q>If she
     ben't one o' th' handsomest, she's noan faàl, and varry good-natured; and i' his een she's
     fair beautiful, onybody may see that.</q>
 </p>
 <p>I wrote to Moor House and to Cambridge immediately, to say what I had done: fully
   explaining also why I had thus acted. Diana and <pb n="475"/> Mary approved the step
   unreservedly. Diana announced that she would just give me time to get over the honeymoon,
   and then she would come and see me.</p>
 <p>
  <q>She had better not wait till then, Jane,</q> said Mr Rochester, when I read her letter
   to him; <q>if she does, she will be too late, for our honeymoon will shine our life long:
     its beams will only fade over your grave or mine.</q>
 </p>
 <p>How St John received the news I don't know: he never answered the letter in which I
   communicated it: yet six months after he wrote to me, without, however, mentioning Mr
   Rochester's name or alluding to my marriage. His letter was then calm, and though very
   serious, kind. He has maintained a regular, though not very frequent correspondence ever
   since: he hopes I am happy, and trusts I am not of those who live without God in the world,
   and only mind earthly things.</p>
</div>

This particular encoding represents a set of choices or priorities. As a trivial example, note that in the second example, end-of-line hyphenation has been silently removed. Conceivably Brontë (or her printer) intended the word ‘honeymoon’ to appear as ‘honey-moon’ on its second appearance, though this seems unlikely: our decision to focus on Brontë's text, rather than on the printing of it in this particular edition, makes it impossible to be certain. This is an instance of the fundamental selectivity of any encoding. An encoding makes explicit only those textual features of importance to the encoder. It is not difficult to think of ways in which the encoding of even this short passage might readily be extended. For example:

TEI-recommended ways of carrying out most of these are described in the remainder of this document. The TEI scheme as a whole also provides for an enormous range of other possibilities, of which we cite only a few:

For recommendations on these and many other possibilities, the full Guidelines should be consulted.

3 The Structure of a TEI Text

All TEI-conformant texts contain (a) a TEI header (marked up as a teiHeader element) and (b) the transcription of the text proper (marked up as a text element). These two elements are combined together to form a single TEI element, which must be declared within the TEI namespace1.

The TEI header provides information analogous to that provided by the title page of a printed text. It has up to four parts: a bibliographic description of the machine-readable text, a description of the way it has been encoded, a non-bibliographic description of the text (a text profile), and a revision history. The header is described in more detail in section 19 The Electronic Title Page.

A TEI text may be unitary (a single work) or composite (a collection of single works, such as an anthology). In either case, the text may have an optional front or back. In between is the body of the text, which, in the case of a composite text, may consist of groups, each containing more groups or texts.

A unitary text will be encoded using an overall structure like this:
<TEI xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
<!-- [ TEI Header information ] -->
 </teiHeader>
 <text>
  <front>
<!-- [ front matter ... ] -->
  </front>
  <body>
<!-- [ body of text ... ] -->
  </body>
  <back>
<!-- [ back matter ... ] -->
  </back>
 </text>
</TEI>
A composite text also has an optional front and back. In between occur one or more groups of texts, each with its own optional front and back matter. A composite text will thus be encoded using an overall structure like this:
<TEI xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
<!--[ header information for the composite ]-->
 </teiHeader>
 <text>
  <front>
<!--[ front matter for the composite ]-->
  </front>
  <group>
   <text>
    <front>
<!--[ front matter of first text ]-->
    </front>
    <body>
<!--[ body of first text ]-->
    </body>
    <back>
<!--[ back matter of first text ]-->
    </back>
   </text>
   <text>
    <front>
<!--[ front matter of second text]-->
    </front>
    <body>
<!--[ body of second text ]-->
    </body>
    <back>
<!--[ back matter of second text ]-->
    </back>
   </text>
<!--[ more texts or groups of texts here ]-->
  </group>
  <back>
<!--[ back matter for the composite ]-->
  </back>
 </text>
</TEI>
It is also possible to define a composite of complete TEI texts, each with its own header. Such a collection is known as a TEI corpus, and may itself have a header:
<teiCorpus xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
<!--[header information for the corpus]-->
 </teiHeader>
 <TEI>
  <teiHeader>
<!--[header information for first text]-->
  </teiHeader>
  <text>
<!--[first text in corpus]-->
  </text>
 </TEI>
 <TEI>
  <teiHeader>
<!--[header information for second text]-->
  </teiHeader>
  <text>
<!--[second text in corpus]-->
  </text>
 </TEI>
</teiCorpus>
It is also possible to create a composite of corpora -- that is, one <teiCorpus> element may contain many nested <teiCorpus> elements rather than many nested TEI elements, to any depth considered necessary.

In the remainder of this document, we discuss chiefly simple text structures. The discussion in each case consists of a short list of relevant TEI elements with a brief definition of each, followed by definitions for any attributes specific to that element, and a reference to any classes of which the element is a member. These references are linked to full specifications for each object, as given in the TEI Guidelines. In most cases, short examples are also given.

For example, here are the elements discussed so far:

4 Encoding the Body

As indicated above, a simple TEI document at the textual level consists of the following elements:

Elements specific to front and back matter are described below in section 18 Front and Back Matter. In this section we discuss the elements making up the body of a text.

4.1 Text Division Elements

The body of a prose text may be just a series of paragraphs, or these paragraphs may be grouped together into chapters, sections, subsections, etc. Each paragraph is tagged using the p tag. The div element is used to represent any such grouping of paragraphs.

  • p (paragraph) marks paragraphs in prose.
  • div (text division) contains a subdivision of the front, body, or back of a text.

The type attribute on the div element may be used to supply a conventional name for this category of text division, or otherwise distinguish them. Typical values might be ‘book’, ‘chapter’, ‘section’, ‘part’, ‘poem’, ‘song’, etc. For a given project, it will usually be advisable to define and adhere to a specific list of such values.

A div element may itself contain further, nested, divs, thus mimicking the traditional structure of a book, which can be decomposed hierarchically into units such as parts, containing chapters, containing sections, and so on. TEI texts in general conform to this simple hierarchic model.

The xml:id attribute may be used to supply a unique identifier for the division, which may be used for cross references or other links to it, such as a commentary, as further discussed in section 8 Cross References and Links. It is often useful to provide an xml:id attribute for every major structural unit in a text, and to derive its values in some systematic way, for example by appending a section number to a short code for the title of the work in question, as in the examples below. It is particularly useful to supply such identifiers if the resource concerned is to be made available over the web, since they make it much easier for other web-based applications to link directly to the corresponding parts of your text.

The n attribute may be used to supply (additionally or alternatively) a short mnemonic name or number for a division, or any other element. If a conventional form of reference or abbreviation for the parts of a work already exists (such as the book/chapter/verse pattern of Biblical citations), the n attribute is the place to record it; unlike the identifier supplied by xml:id, it does not need to be unique.

The xml:lang attribute may be used to specify the language of the division. Languages are identified by an internationally defined code, as further discussed in section 6.3 Foreign Words or Expressions below.

The rend attribute may be used to supply information about the rendition (appearance) of a division, or any other element, as further discussed in section 6 Marking Highlighted Phrases below. As with the type attribute, a project will often find it useful to predefine the possible values for this attribute, but TEI Lite does not constrain it in anyway.

These four attributes, xml:id, n, xml:lang, and rend are so widely useful that they are allowed on any element in any TEI schema: they are global attributes. Other global attributes defined in the TEI Lite scheme are discussed in section 8.3 Special kinds of Linking.

The value of every xml:id attribute should be unique within a document. One simple way of ensuring that this is so is to make it reflect the hierarchic structure of the document. For example, Smith's Wealth of Nations as first published consists of five books, each of which is divided into chapters, while some chapters are further subdivided into parts. We might define xml:id values for this structure as follows:
<body>
 <div n="Itype="bookxml:id="WN1">
  <div n="I.1type="chapterxml:id="WN101">
<!-- ... -->
  </div>
  <div n="I.2type="chapterxml:id="WN102">
<!-- ... -->
  </div>
<!-- ... -->
  <div n="I.10type="chapter"
   xml:id="WN110">

   <div n="I.10.1type="part"
    xml:id="WN1101">

<!-- ... -->
   </div>
   <div n="I.10.2type="part"
    xml:id="WN1102">

<!-- ... -->
   </div>
  </div>
<!-- ... -->
 </div>
 <div n="IItype="bookxml:id="WN2">
<!-- ... -->
 </div>
</body>
A different numbering scheme may be used for xml:id and n attributes: this is often useful where a canonical reference scheme is used which does not tally with the structure of the work. For example, in a novel divided into books each containing chapters, where the chapters are numbered sequentially through the whole work, rather than within each book, one might use a scheme such as the following:
<body>
 <div n="1type="volumexml:id="TS01">
  <div n="1type="chapterxml:id="TS011">
<!-- ... -->
  </div>
  <div n="2type="chapterxml:id="TS012">
<!-- ... -->
  </div>
 </div>
 <div n="2type="volumexml:id="TS02">
  <div n="3type="chapterxml:id="TS021">
<!-- ... -->
  </div>
  <div n="4type="chapterxml:id="TS022">
<!-- ... -->
  </div>
 </div>
</body>
Here the work has two volumes, each containing two chapters. The chapters are numbered conventionally 1 to 4, but the xml:id values specified allow them to be regarded additionally as if they were numbered 1.1, 1.2, 2.1, 2.2.

4.2 Headings and Closings

Every div may have a title or heading at its start, and (less commonly) a trailer such as ‘End of Chapter 1’ at its end. The following elements may be used to transcribe them:

  • head (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc.
  • UNKNOWN ELEMENT trailer

Some other elements which may be necessary at the beginning or ending of text divisions are discussed below in section 18.1.2 Prefatory Matter.

Whether or not headings and trailers are included in a transcription is a matter for the individual transcriber to decide. Where a heading is completely regular (for example ‘Chapter 1’) or may be automatically constructed from attribute values (e.g. <div type="chapter" n="1">), it may be omitted; where it contains otherwise unrecoverable text it should always be included. For example, the start of Hardy's Under the Greenwood Tree might be encoded as follows:
<div n="Wintertype="Partxml:id="UGT1">
 <div n="1type="Chapterxml:id="UGT11">
  <head>Mellstock-Lane</head>
  <p>To dwellers in a wood almost every species of tree ... </p>
 </div>
</div>

4.3 Prose, Verse and Drama

As in the Bronte example above, the paragraphs making up a textual division are tagged with the p tag. In poetic or dramatic texts different tags are needed, to represent verse lines and stanzas in the first case, or individual speeches and stage directions in the second. :

  • l (verse line) contains a single, possibly incomplete, line of verse.
  • lg (line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc.
  • UNKNOWN ELEMENT sp
  • UNKNOWN ELEMENT speaker
  • UNKNOWN ELEMENT stage
Here, for example, is the start of a poetic text in which verse lines and stanzas are tagged:
<lg n="I">
 <l>I Sing the progresse of a
   deathlesse soule,</l>
 <l>Whom Fate, with God made, but doth not controule,</l>
 <l>Plac'd in
   most shapes; all times before the law</l>
 <l>Yoak'd us, and when, and since, in this I
   sing.</l>
 <l>And the great world to his aged evening;</l>
 <l>From infant morne, through manly
   noone I draw.</l>
 <l>What the gold Chaldee, of silver Persian saw,</l>
 <l>Greeke brass, or
   Roman iron, is in this one;</l>
 <l>A worke t'out weare Seths pillars, bricke and
   stone,</l>
 <l>And (holy writs excepted) made to yeeld to none,</l>
</lg>

Note that the l element marks verse lines, not typographic lines: the original lineation of the first few lines above has not therefore been made explicit by this encoding, and may be lost. The lb element described in section 5 Page and Line Numbers might additionally be used to mark typographic lines if so desired.

Here is the end of a famous dramatic text, in which speeches and stage directions are marked:
<sp>
 <speaker>Vladimir</speaker>
 <p>Pull on your trousers.</p>
</sp>
<sp>
 <speaker>Estragon</speaker>
 <p>You want me to pull off my trousers?</p>
</sp>
<sp>
 <speaker>Vladimir</speaker>
 <p>Pull <emph>on</emph> your trousers.</p>
</sp>
<sp>
 <speaker>Vladimir</speaker>
 <p>
  <stage>(realizing his trousers are down)</stage>.
   True</p>
</sp>
<stage>He pulls up his trousers</stage>
<sp>
 <speaker>Vladimir</speaker>
 <p>Well? Shall we go?</p>
</sp>
<sp>
 <speaker>Estragon</speaker>
 <p>Yes, let's go.</p>
</sp>
<stage>They do not move.</stage>
Note that the <stage> (stage direction) element can appear either within a speech or between speeches. The <sp> ("speech") element contains, following an optional <speaker> element indicating who is speaking, either paragraphs (if the speech is in prose) or verse lines or stanzas as in the next example. In this case, it is quite common to find that verse lines are split between speakers. The easiest way of encoding this is to use the part attribute to indicate that the lines so fragmented are incomplete :
<div n="Itype="Act">
 <head>ACT I</head>
 <div n="1type="Scene">
  <head>SCENE I</head>
  <stage rend="italic"> Enter Barnardo and Francisco, two Sentinels, at several doors</stage>
  <sp>
   <speaker>Barn</speaker>
   <l part="Y">Who's there?</l>
  </sp>
  <sp>
   <speaker>Fran</speaker>
   <l>Nay, answer me. Stand and unfold yourself.</l>
  </sp>
  <sp>
   <speaker>Barn</speaker>
   <l part="I">Long live the King!</l>
  </sp>
  <sp>
   <speaker>Fran</speaker>
   <l part="M">Barnardo?</l>
  </sp>
  <sp>
   <speaker>Barn</speaker>
   <l part="F">He.</l>
  </sp>
  <sp>
   <speaker>Fran</speaker>
   <l>You come most carefully upon your hour.</l>
  </sp>
<!-- ... -->
 </div>
</div>
The same mechanism may be applied to stanzas which are divided between two speakers:
<div>
 <sp>
  <speaker>First voice</speaker>
  <lg part="Itype="stanza">
   <l>But why drives on that ship so fast</l>
   <l>Withouten wave or wind?</l>
  </lg>
 </sp>
 <sp>
  <speaker>Second Voice</speaker>
  <lg part="F">
   <l>The air is cut away before.</l>
   <l>And closes from behind.</l>
  </lg>
 </sp>
<!-- ... -->
</div>
The <sp> element can also be used for dialogue presented in a prose work as if it were drama, as in the next example, which also demonstrates the use of the who attribute to bear a code identifying the speaker of the piece of dialogue concerned:
<div>
 <sp who="#OPI">
  <speaker>The reverend Doctor Opimian</speaker>
  <p>I do not think I have named a single unpresentable fish.</p>
 </sp>
 <sp who="#GRM">
  <speaker>Mr Gryll</speaker>
  <p>Bream, Doctor: there is not much to be said for bream.</p>
 </sp>
 <sp who="#OPI">
  <speaker>The Reverend Doctor Opimian</speaker>
  <p>On the contrary, sir, I think there is much to be said for him. In the first
     place....</p>
  <p>Fish, Miss Gryll -- I could discourse to you on fish by the hour: but for the present I
     will forbear.</p>
 </sp>
</div>
Here the who attribute values (#OPI etc.) are links, pointing to a list of the characters in the novel, each of which has an identifier:
<list>
 <head>Characters in the novel</head>
 <item xml:id="OPI">
  <name>Dr Opimian</name> : named for the famous Roman fine wine</item>
 <item xml:id="GRM">
  <name>Mr Gryll</name> : named for the mythical Gryllus, one of Ulysses'
   sailors transformed by Circe into a pig, who argues that he was happier in that state than
   as a man</item>
</list>

5 Page and Line Numbers

Page and line breaks etc. may be marked with the following elements.

These elements mark a single point in the text, not a span of text. The global n attribute should be used to supply the number of the page or line beginning at the tag.

When working from a paginated original, it is often useful to record its pagination, if only to simplify later proof-reading. It is also useful for synchronizing an encoded text with a set of page images. Recording the line breaks may be useful for similar reasons.

If features such as pagination or lineation are marked for more than one edition, specify the edition in question using the ed attribute, and supply as many tags are necessary. For example, in the following passage we indicate where the page breaks occur in two different editions (ED1 and ED2)
<p>I wrote to Moor House and to Cambridge immediately, to say what I had done: fully
explaining also why I had thus acted. Diana and <pb ed="ED1n="475"/> Mary approved the step
unreservedly. Diana announced that she would <pb ed="ED2n="485"/>just give me time to get
over the honeymoon, and then she would come and see me.</p>

A special attribute break may be used to indicate whether or not this empty element is considered as a word-breaking, irrespective of any adjacent whitespace. For example, in the following encoded sample:

The pb and lb elements are special cases of the general class of milestone elements which mark reference points within a text. The generic milestone element can mark any kind of reference point: for example, a column break, the start of a new kind of section not otherwise tagged, or in general any significant change in the text not marked by an XML element. The names used for types of unit and for editions referred to by the ed and unit attributes may be chosen freely, but should be documented in the header <refsDecl> element (see 19.2.3 Reference and Classification Declarations). The milestone element may be used to replace the others, or the others may be used as a set; they should not be mixed arbitrarily.

6 Marking Highlighted Phrases

6.1 Changes of Typeface, etc.

Highlighted words or phrases are those made visibly different from the rest of the text, typically by a change of type font, handwriting style, ink colour etc., which is intended to draw the reader's attention to some associated change.

The global rend attribute can be attached to any element, and used wherever necessary to specify details of the highlighting used for it in the source. For example, a heading rendered in bold might be tagged <head rend="bold">, and one in italic <head rend="italic">.

The values to be used for the rend attribute are not specified by the TEI Guidelines, since they will depend entirely on the needs of the particular project. Some typical values might include italic, bold etc. for font variations; center, right etc. for alignment; large, small etc. for size; smallcaps, allcaps etc. for type variants and so on. Several such words may be used in combination as necessary, but no formal syntax is proposed. The full TEI Guidelines provide more rigorous mechanisms, using other W3C standards such as CSS, as an alternative to the use of rend.

It is not always possible or desirable to interpret the reasons for such changes of rendering in a text. In such cases, the element hi may be used to mark a sequence of highlighted text without making any claim as to its status.

  • hi (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made.
In the following example, the use of a distinct typeface for the subheading and for the included name are recorded but not interpreted:
<p>
 <hi rend="gothic">And this Indenture further
   witnesseth</hi> that the said <hi rend="italic">Walter Shandy</hi>, merchant, in
consideration of the said intended marriage ...
</p>

Alternatively, where the cause for the highlighting can be identified with confidence, a number of other, more specific, elements are available.

  • emph (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect.
  • foreign (foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text.
  • UNKNOWN ELEMENT gloss
  • UNKNOWN ELEMENT label
  • UNKNOWN ELEMENT mentioned
  • term (term) contains a single-word, multi-word, or symbolic designation which is regarded as a technical term.
  • title (title) contains a title for any kind of work.

Some features (notably quotations and glosses) may be found in a text either marked by highlighting, or with quotation marks. In either case, the elements q and <gloss> (as discussed in the following section) should be used. If the highlighting is to be recorded, use the global rend attribute.

As an example of the elements defined here, consider the following sentence:
On the one hand the Nibelungenlied is associated with the new rise of romance of twelfth-century France, the romans d'antiquité, the romances of Chrétien de Troyes, and the German adaptations of these works by Heinrich van Veldeke, Hartmann von Aue, and Wolfram von Eschenbach.
Interpreting the role of the highlighting, the sentence might look like this:
<p>On the one hand the <title>Nibelungenlied</title>
is associated with the new rise of romance of twelfth-century France, the <foreign>romans
   d'antiquité</foreign>, the romances of Chrétien de Troyes, ...</p>
Describing only the appearance of the original, it might look like this:
<p>On the one hand the <hi rend="italic">Nibelungenlied</hi> is associated with the new rise of romance of twelfth-century France,
the <hi rend="italic">romans d'antiquité</hi>, the romances of Chrétien de Troyes,
...</p>

6.2 Quotations and Related Features

Like changes of typeface, quotation marks are conventionally used to denote several different features within a text, of which the most frequent is quotation. When possible, we recommend that the underlying feature be tagged, rather than the simple fact that quotation marks appear in the text, using the following elements:

  • q (quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used.
  • UNKNOWN ELEMENT mentioned
  • soCalled (so called) contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics.
  • UNKNOWN ELEMENT gloss
Here is a simple example of a quotation:
<p>Few dictionary makers are likely to forget Dr. Johnson's description of the
lexicographer as <q>a harmless drudge.</q>
</p>

To record how a quotation was printed (for example, in-line or set off as a display or block quotation), the rend attribute should be used. This may also be used to indicate the kind of quotation marks used.

Direct speech interrupted by a narrator can be represented simply by ending the quotation and beginning it again after the interruption, as in the following example:
<p>
 <q>Who-e debel you?</q> — he at last said —
<q>you no speak-e, damme, I kill-e.</q> And so saying, the lighted tomahawk began
flourishing about me in the dark.
</p>
If it is important to convey the idea that the two q elements together make up a single speech, the linking attributes next and prev may be used, as described in section 8.3 Special kinds of Linking.
Quotations may be accompanied by a reference to the source or speaker, using the who attribute, whether or not this is explicit in the text, as in the following example:
<q who="#Wilson">Spaulding, he came
down into the office just this day eight weeks with this very paper in his hand, and he
says:—<q who="#Spaulding">I wish to the Lord, Mr. Wilson, that I was a red-headed
   man.</q>
</q>
This example also demonstrates how quotations may be embedded within other quotations: one speaker (Wilson) quotes another speaker (Spaulding).

The creator of the electronic text must decide whether quotation marks are replaced by the tags or whether the tags are added and the quotation marks kept. If the quotation marks are removed from the text, the rend attribute may be used to record the way in which they were rendered in the copy text.

The full TEI Guidelines provide additional elements to distinguish direct speech, quotation, and other typical uses of quotation mark although it is not always possible and may not be considered desirable to interpret the function of quotation marks in a text. For simplicity, only q (which may be used for any such case) has been included in TEI Lite.

6.3 Foreign Words or Expressions

Words or phrases which are not in the main language of the texts may be tagged as such in one of two ways. If the word or phrase is already tagged for some reason, the element indicated should bear a value for the global xml:lang attribute indicating the language used. Where there is no applicable element, the element foreign may be used, again using the xml:lang attribute. For example:
<p>John has real <foreign xml:lang="fr">savoir-faire</foreign>.</p>
<p>Have you read <title xml:lang="de">Die
   Dreigroschenoper</title>?</p>
<p>
 <mentioned xml:lang="fr">Savoir-faire</mentioned> is French
for know-how.
</p>
<p>The court issued a writ of <term xml:lang="la">mandamus</term>.</p>

As these examples show, the foreign element should not be used to tag foreign words if some other more specific element such as title, <mentioned>, or term applies. The global xml:lang attribute may be attached to any element to show that it uses some other language than that of the surrounding text.

The codes used to identify languages, supplied on the xml:lang attribute, must be constructed in a particular way, and must conform to common Internet standards2, as further explained in the relevant section of the TEI Guidelines. Some simple example codes for a few languages are given here:

zhChinesegrcAncient Greek
enEnglishelGreek
enmMiddle EnglishjaJapanese
frFrenchlaLatin
deGermansaSanskrit

7 Notes

All notes, whether printed as footnotes, endnotes, marginalia, or elsewhere, should be marked using the same element:

Where possible, the body of a note should be inserted in the text at the point at which its identifier or mark first appears. This may not be possible for example with marginalia, which may not be anchored to an exact location. For simplicity, it may be adequate to position marginal notes before the relevant paragraph or other element. Notes may also be placed in a separate division of the text (as end-notes are, in printed books) and linked to the relevant portion of the text using their target attribute.

The n attribute may be used to supply the number or identifier of a note if this is required. The resp attribute should be used consistently to distinguish between authorial and editorial notes, if the work has both kinds.

Examples:
<p>Collections are ensembles of
distinct entities or objects of any sort. <note n="1place="foot"> We explain below why we
   use the uncommon term <mentioned>collection</mentioned> instead of the expected
 <mentioned>set</mentioned>. Our usage corresponds to the <mentioned>aggregate</mentioned>
   of many mathematical writings and to the sense of <mentioned>class</mentioned> found in
   older logical writings. </note> The elements ...</p>
<lg xml:id="RAM609">
 <note place="margin">The
   curse is finally expiated</note>
 <l>And now this spell was snapt: once more</l>
 <l>I viewed
   the ocean green,</l>
 <l>And looked far forth, yet little saw</l>
 <l>Of what had else been seen
   —</l>
</lg>

8 Cross References and Links

Explicit cross references or links from one point in a text to another in the same or another document may be encoded using the elements described in this section. Implicit links (such as the association between two parallel texts, or that between a text and its interpretation) may be encoded using the linking attributes discussed in section 8.3 Special kinds of Linking.

8.1 Simple Cross References

A cross reference from one point within a single document to another can be encoded using either of the following elements:

  • ref (reference) defines a reference to another location, possibly modified by additional text or comment.
  • ptr (pointer) defines a pointer to another location.

The difference between these two elements is that ptr is an empty element, simply marking a point from which a link is to be made, whereas ref may contain some text as well, typically identifying the target of the cross reference. The ptr element would be used for a cross reference which is to be indicated by some non-verbal means such as a symbol or icon, or in an electronic text by a button. It is also useful in document production systems, where the formatter can generate the correct verbal form of the cross reference.

The following two forms, for example, are logically equivalent :
See especially <ref target="#SEC12">section 12 on
page 34</ref>.
See especially <ptr target="#SEC12"/>.
The value of the target attribute on either element may be the identifier of some other element within the current document. The passage or phrase being pointed at must bear an identifier, and must therefore be tagged as an element of some kind. In the following example, the cross reference is to a div element:
... see especially <ptr target="#SEC12"/>. ...
<div xml:id="SEC12">
 <head>Concerning Identifiers</head>
<!-- ... -->
</div>
Because the xml:id attribute is global, any element in a TEI document may be pointed to in this way. In the following example, a paragraph has been given an identifier so that it may be pointed at:
... this is
discussed in <ref target="#pspec">the paragraph on links</ref> ...
<p xml:id="pspec">Links
may be made to any kind of element ...</p>

Sometimes the target of a cross reference does not correspond with any particular feature of a text, and so may not be tagged as an element of some kind. If the desired target is simply a point in the current document, the easiest way to mark it is by introducing an anchor element at the appropriate spot. If the target is some sequence of words not otherwise tagged, the seg element may be introduced to mark them. These two elements are described as follows:

  • anchor (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element.
  • seg (arbitrary segment) represents any segmentation of text below the ‘chunk’ level.
In the following (imaginary) example, ref elements have been used to represent points in this text which are to be linked in some way to other parts of it; in the first case to a point, and in the second, to a sequence of words:
Returning to <ref target="#ABCD">the point where I
dozed off</ref>, I noticed that <ref target="#EFGH">three words</ref> had been circled in
red by a previous reader
This encoding requires that elements with the specified identifiers (ABCD and EFGH in this example) are to be found somewhere else in the current document. Assuming that no element already exists to carry these identifiers, the anchor and seg elements may be used:
....
<anchor type="bookmarkxml:id="ABCD"/> .... ....<seg type="targetxml:id="EFGH"> ...
</seg> ...

The type attribute should be used (as above) to distinguish amongst different purposes for which these general purpose elements might be used in a text. Some other uses are discussed in section 8.3 Special kinds of Linking below.

8.2 Pointing to other documents

So far, we have shown how the elements ptr and ref may be used for cross-references or links whose targets occur within the same document as their source. However, the same elements may also be used to refer to elements in any other XML document or resource, such as a document on the web, or a database component. This is possible because the value of the target attribute may be any valid universal resource indicator (URI)Note: A full definition of this term, defined by the W3C (the consortium which manages the development and maintenance of the World Wide Web), is beyond the scope of this tutorial: however, the most frequently encountered version of a URI is the familiar ‘URL’ used to indicate a web page, such as http://www.tei-c.org/index.xml.

A URI may reference a web page or just a part of one, for example http://www.tei-c.org/index.xml#SEC2. The sharp sign indicates that what follows it is the identifier of an element to be located within the XML document identified by what precedes it: this example will therefore locate an element which has an xml:id attribute value of SEC2 within the document retrieved from http://www.tei-c.org/index.xml. In the examples we have discussed so far, the part to the left of the sharp sign has been omitted: this is understood to mean that the referenced element is to be located within the current document.

Parts of an XML document can be specified by means of other more sophisticated mechanisms using a special language called Xpath, also defined by the W3C. This is particularly useful where the elements to be linked to do not bear identifiers and must therefore be located by some other means.

8.3 Special kinds of Linking

The following special purpose linking attributes are defined for every element in the TEI Lite scheme:

ana
links an element with its interpretation.
corresp
links an element with one or more other corresponding elements.
next
links an element to the next element in an aggregate.
prev
links an element to the previous element in an aggregate.
The ana (analysis) attribute is intended for use where a set of abstract analyses or interpretations have been defined somewhere within a document, as further discussed in section 15 Interpretation and Analysis. For example, a linguistic analysis of the sentence ‘John loves Nancy’ might be encoded as follows:
<seg ana="SVOtype="sentence">
 <seg ana="#NP1type="lex">John</seg>
 <seg ana="#VVItype="lex">loves</seg>
 <seg ana="#NP1type="lex">Nancy</seg>
</seg>
This encoding implies the existence elsewhere in the document of elements with identifiers SVO, NP1, and VV1 where the significance of these particular codes is explained. Note the use of the seg element to mark particular components of the analysis, distinguished by the type attribute.
The corresp (corresponding) attribute provides a simple way of representing some form of correspondence between two elements in a text. For example, in a multilingual text, it may be used to link translation equivalents, as in the following example
<seg corresp="#EN1xml:id="FR1"
 xml:lang="fr">
Jean
aime Nancy</seg>
<seg corresp="#FR1xml:id="EN1"
 xml:lang="en">
John loves
Nancy</seg>
The same mechanism may be used for a variety of purposes. In the following example, it has been used to represent the correspondences between ‘the show’ and ‘Shirley’, and between ‘NBC’ and ‘the network’:
<p>
 <title xml:id="shirley">Shirley</title>, which
made its Friday night debut only a month ago, was not listed on <name xml:id="nbc">NBC</name>'s new schedule, although <seg corresp="#nbcxml:id="network">the network</seg>
says <seg corresp="#shirleyxml:id="show">the show</seg> still is being
considered.
</p>
The next and prev attributes provide a simple way of linking together the components of a discontinuous element, as in the following example:
<q next="#Q1bxml:id="Q1a">Who-e debel you?</q>
he at last said — <q prev="#Q1axml:id="Q1b">you no speak-e, damme, I kill-e.</q> And so
saying, the lighted tomahawk began flourishing about me in the dark.

9 Editorial Interventions

The process of encoding an electronic text has much in common with the process of editing a manuscript or other text for printed publication. In either case a conscientious editor may wish to record both the original state of the source and any editorial correction or other change made in it. The elements discussed in this and the next section provide some facilities for meeting these needs.

9.1 Correction and Normalization

The following elements may be used to mark correction, that is editorial changes introduced where the editor believes the original to be erroneous:

  • corr (correction) contains the correct form of a passage apparently erroneous in the copy text.
  • sic (Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate.

The following elements may be used to mark normalization, that is editorial changes introduced for the sake of consistency or modernization of a text:

  • orig (original form) contains a reading which is marked as following the original, rather than being normalized or corrected.
  • reg (regularization) contains a reading which has been regularized or normalized in some sense.

As an example, consider this extract from the quarto printing of Shakespeare's Henry V.

... for his nose was as sharp as a pen and a table of green feelds
A modern editor might wish to make a number of interventions here, specifically to modernize (or normalise) the Elizabethan spellings of a' and feelds for he and fields respectively. He or she might also want to emend table to babbl'd, following an editorial tradition that goes back to the 18th century Shakespearian scholar Lewis Theobald. The following encoding would then be appropriate:
... for his nose was as sharp as
a pen and <reg>he</reg>
<corr resp="#Theobald">babbl'd</corr> of green
<reg>fields</reg>
A more conservative or source-oriented editor, however, might want to retain the original, but at the same time signal that some of the readings it contains are in some sense anomalous:
... for his nose was as sharp as a pen and
<orig>a</orig>
<sic>table</sic> of green
<orig>feelds</orig>
Finally, a modern digital editor may decide to combine both possibilities in a single composite text, using the choice element.
  • choice (choice) groups a number of alternative encodings for the same point in a text.
This allows an editor to mark where alternative readings are possible:
... for his nose was as sharp as a pen and
<choice>
 <orig>a</orig>
 <reg>he</reg>
</choice>
<choice>
 <corr resp="#Theobald">babbl'd</corr>
 <sic>table</sic>
</choice> of green

<choice>
 <orig>feelds</orig>
 <reg>fields</reg>
</choice>

9.2 Omissions, Deletions, and Additions

In addition to correcting or normalizing words and phrases, editors and transcribers may also supply missing material, omit material, or transcribe material deleted or crossed out in the source. In addition, some material may be particularly hard to transcribe because it is hard to make out on the page. The following elements may be used to record such phenomena:

  • add (addition) contains letters, words, or phrases inserted in the source text by an author, scribe, or a previous annotator or corrector.
  • gap (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible.
  • del (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, or a previous annotator or corrector.
  • unclear (unclear) contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source.
These elements may be used to record changes made by an editor, by the transcriber, or (in manuscript material) by the author or scribe. For example, if the source for an electronic text read ‘The following elements are provided for for simple editorial interventions.’ then it might be felt desirable to correct the obvious error, but at the same time to record the deletion of the superfluous second for, thus:
The following elements are provided for <del resp="#LB">for</del> simple editorial interventions.
The attribute value #LB on the resp attribute is used to point to a fuller definition (typically in a <respStmt> element) for the agency responsible for correcting the duplication of for.
If the source read ‘The following elements provided for simple editorial interventions.’ (i.e. if the verb had been inadvertently dropped) then the corrected text might read:
The following elements <add resp="#LB">are</add> provided for simple editorial interventions.
These elements are also used to record authorial changes in manuscripts. A manuscript in which the author has first written ‘How it galls me, what a galling shadow’, then crossed out the word galls and inserted dogs might be encoded thus:
How it <del hand="#DHLtype="overstrike">galls</del>
<add hand="#DHLplace="supralinear">dogs</add> me, what a galling shadow
Again, the code #DHL points to another location where more information about the hand concerned is to be found3.
Similarly, the unclear and gap elements may be used together to indicate the omission of illegible material; the following example also shows the use of add for a conjectural emendation:
One hundred
&amp; twenty good regulars joined to me <unclear>
 <gap reason="indecipherable"/>
</unclear>
&amp; instantly, would aid me signally <add hand="#ed">in?</add> an enterprise against
Wilmington.
The del element marks material which has been transcribed as part of the electronic text despite being marked as deleted, while gap marks the location of material which is omitted from the electronic text, whether it is legible or not. A language corpus, for example, might omit long quotations in foreign languages:
<p> ... An example of a list appearing in a fief
ledger of <name type="place">Koldinghus</name>
 <date>1611/12</date> is given below. It shows cash income from a sale of
honey.</p>
<gap>
 <desc>quotation from ledger (in Danish)</desc>
</gap>
<p>A description of the
overall structure of the account is once again ... </p>
Other corpora (particular those constructed before the widespread use of scanners) systematically omit figures and mathematics:
<p>At the bottom of your screen below the mode line is the <term>minibuffer</term>. This is
the area where Emacs echoes the commands you enter and where you specify filenames for Emacs
to find, values for search and replace, and so on. <gap reason="graphic">
  <desc>diagram of
     Emacs screen</desc>
 </gap>
</p>

The full TEI scheme provides more precise ways of capturing different aspects of a transcription, distinguishing for example between text added or supplied by the encoder and text indicated as supplied or deleted in the source. TEI Lite does not provide different tags for these purposes.

9.3 Abbreviations and their Expansion

Like names, dates, and numbers, abbreviations may be transcribed as they stand or expanded; they may be left unmarked, or encoded using the following elements:

  • abbr (abbreviation) contains an abbreviation of any sort.
  • expan (expansion) contains the expansion of an abbreviation.
The abbr element is useful as a means of distinguishing semi-lexical items such as acronyms or jargon:
We can sum up the above
discussion as follows: the identity of a <abbr>CC</abbr> is defined by that calibration of
values which motivates the elements of its <abbr>GSP</abbr>;
Every manufacturer of <abbr>3GL</abbr> or
<abbr>4GL</abbr> languages is currently nailing on <abbr>OOP</abbr> extensions

The type attribute may be used to distinguish types of abbreviation by their function.

The expan element is used to mark an expansion supplied by an encoder. This element is particularly useful in the transcription of manuscript materials. For example, the character p with a bar through its descender as a conventional representation for the word per is commonly encountered in Medieval European manuscripts. An encoder may choose to expand this as follows:
<expan>per</expan>
The expansion corresponding with an abbreviated form may not always contain the same letters as the abbreviation. Where it does, however, common editorial practice is to italicize or otherwise signal which letters have been supplied. The expan element should not be used for this purpose since its function is to indicate an expanded form, not a part of one. For example, consider the common abbreviation wt (for with) found in medieval texts. In a modern edition, an editor might wish to represent this as ‘with’, italicising the letters not found in the source. One simple means of achieving that would be an encoding such as the follow
<expan>w<hi rend="it">i</hi>t<hi rend="it">h</hi>
</expan>
The full TEI also provides elements ex and am for use in this situation, but these are not included in the TEI Lite schema.
To record both an abbreviation and its expansion, the choice element mentioned above may be used to group the abbreviated form with its proposed expansion:
<choice>
 <abbr>wt</abbr>
 <expan>with</expan>
</choice>

10 Names, Dates, and Numbers

The TEI scheme defines elements for a large number of ‘data-like’ features which may appear almost anywhere within almost any kind of text. These features may be of particular interest in a range of disciplines; they all relate to objects external to the text itself, such as the names of persons and places, numbers and dates. They also pose particular problems for many natural language processing (NLP) applications because of the variety of ways in which they may be presented within a text. The elements described here, by making such features explicit, reduce the complexity of processing texts containing them.

10.1 Names and Referring Strings

A referring string is a phrase which refers to some person, place, object, etc. Two elements are provided to mark such strings:

  • rs (referencing string) contains a general purpose name or referring string.
  • UNKNOWN ELEMENT name
The type attribute is used to distinguish amongst (for example) names of persons, places and organizations, where this is possible:
<q>My dear <rs type="person">Mr. Bennet</rs>, </q>
said his lady to him one day,
<q>have you heard that <rs type="place">Netherfield Park</rs>
is let at last?</q>
It being one of the principles of the <rs type="organization">Circumlocution Office</rs> never, on any account whatsoever, to give a
straightforward answer, <rs type="person">Mr Barnacle</rs> said,
<q>Possibly.</q>
As the following example shows, the rs element may be used for any reference to a person, place, etc, not necessarily one in the form of a proper noun or noun phrase.
<q>My dear <rs type="person">Mr. Bennet</rs>,</q>
said <rs type="person">his lady</rs> to him one day...

The <name> element by contrast is provided for the special case of referencing strings which consist only of proper nouns; it may be used synonymously with the rs element, or nested within it if a referring string contains a mixture of common and proper nouns.

Simply tagging something as a name is rarely enough to enable automatic processing of personal names into the canonical forms usually required for reference purposes. The name as it appears in the text may be inconsistently spelled, partial, or vague. Moreover, name prefixes such as van or de la, may or may not be included as part of the reference form of a name, depending on the language and country of origin of the bearer.

The key attribute provides an alternative normalized identifier for the object being named, like a database record key. It may thus be useful as a means of gathering together all references to the same individual or location scattered throughout a document:
<q>My dear <rs key="BENM1type="person">Mr.
   Bennet</rs>, </q> said <rs key="BENM2type="person">his lady</rs> to him one day,
<q>have
you heard that <rs key="NETP1type="place">Netherfield Park</rs> is let at
last?</q>
This use should be distinguished from the case of the reg (regularization) element, which provides a means of marking the standard form of a referencing string as demonstrated below:
<name key="WADLM1type="person">
 <choice>
  <sic>Walter de la Mare</sic>
  <reg>de la Mare, Walter</reg>
 </choice>
</name> was
born at <name key="Ch1type="place">Charlton</name>, in <name key="KT1type="county">Kent</name>, in 1873.
The index element discussed in indexing may be more appropriate if the function of the regularization is to provide a consistent index:
<p>
 <name type="place">Montaillou</name> is not a
large parish. At the time of the events which led to <name type="person">Fournier</name>'s
<index>
  <term>Benedict XII, Pope of Avignon (Jacques Fournier)</term>
 </index>
investigations, the local population consisted of between 200 and 250
inhabitants.
</p>
Although adequate for many simple applications, these methods have two inconveniences: if the name occurs many times, then its regularised form must be repeated many times; and the burden of additional XML markup in the body of the text may be inconvenient to maintain and complex to process. For applications such as onomastics, relating to persons or places named rather than the name itself, or wherever a detailed analysis of the component parts of a name is needed, the full TEI Guidelines provide a range of other solutions.

10.2 Dates and Times

Tags for the more detailed encoding of times and dates include the following:

  • date (date) contains a date in any format.
  • UNKNOWN ELEMENT time
These elements have a number of attributes which can be used to provide normalised versions of their values.
  • att.datable provides attributes for normalization of elements that contain dates, times, or datable events.
    periodsupplies pointers to one or more definitions of named periods of time (typically <category>s, dates or <event>s) within which the datable item is understood to have occurred.
    when [att.datable.w3c]supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
The when attribute specifies a normalized form for the date or time, using one of the standard formats defined by ISO 8601. Partial dates or times (e.g. ‘1990’, ‘September 1990’, ‘twelvish’) can be expressed by omitting a part of the value supplied, as in the following examples:
<date when="1980-02-21">21
Feb 1980</date>
<date when="1990">1990</date>
<date when="1990-09">September 1990</date>
<date when="--09">September</date>
<date when="2001-09-11T12:48:00">Sept 11th, 12 minutes before 9
am</date>
Note in the last example the use of a normalized representation for the date string which includes a time: this example could thus equally well be tagged using the <time> element.
Given on the <date when="1977-06-12">Twelfth
Day of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the
Republic the Two Hundredth and first and of the University the Eighty-Sixth.</date>
<l>specially when it's nine below zero</l>
<l>and <time when="15:00:00">three o'clock in the afternoon</time>
</l>

10.3 Numbers

Numbers can be written with either letters or digits (twenty-one, xxi, and 21) and their presentation is language-dependent (e.g. English 5th becomes Greek 5.; English 123,456.78 equals French 123.456,78). In natural-language processing or machine-translation applications, it is often helpful to distinguish them from other, more ‘lexical’ parts of the text. In other applications, the ability to record a number's value in standard notation is important. The num element provides this possibility:

  • num (number) contains a number, written in any form.
For example:
<num value="33">xxxiii</num>
<num type="cardinalvalue="21">twenty-one</num>
<num type="percentagevalue="10">ten percent</num>
<num type="percentagevalue="10">10%</num>
<num type="ordinalvalue="5">5th</num>

11 Lists

The element <list> is used to mark any kind of list. A list is a sequence of text items, which may be numbered, bulleted, or arranged as a glossary list. Each item may be preceded by an item label (in a glossary list, this label is the term being defined):

Individual list items are tagged with item. The first item may optionally be preceded by a head, which gives a heading for the list. The numbering of a list may be omitted, indicated using the n attribute on each item, or (rarely) tagged as content using the <label> element. The following are all thus equivalent:
<list>
 <head>A short list</head>
 <item>First item in list.</item>
 <item>Second item in list.</item>
 <item>Third item in list.</item>
</list>
<list>
 <head>A short list</head>
 <item n="1">First item in list.</item>
 <item n="2">Second item in list.</item>
 <item n="3">Third item in list.</item>
</list>
<list>
 <head>A short list</head>
 <label>1</label>
 <item>First item in list.</item>
 <label>2</label>
 <item>Second item in list.</item>
 <label>3</label>
 <item>Third item in list.</item>
</list>
The styles should not be mixed in the same list.
A simple two-column table may be treated as a glossary list, tagged <list type="gloss">. Here, each item comprises a term and a gloss, marked with <label> and item respectively. These correspond to the elements term and <gloss>, which can occur anywhere in prose text.
<list type="gloss">
 <head>Vocabulary</head>
 <label xml:lang="enm">nu</label>
 <item>now</item>
 <label xml:lang="enm">lhude</label>
 <item>loudly</item>
 <label xml:lang="enm">bloweth</label>
 <item>blooms</item>
 <label xml:lang="enm">med</label>
 <item>meadow</item>
 <label xml:lang="enm">wude</label>
 <item>wood</item>
 <label xml:lang="enm">awe</label>
 <item>ewe</item>
 <label xml:lang="enm">lhouth</label>
 <item>lows</item>
 <label xml:lang="enm">sterteth</label>
 <item>bounds, frisks</item>
 <label xml:lang="enm">verteth</label>
 <item xml:lang="la">pedit</item>
 <label xml:lang="enm">murie</label>
 <item>merrily</item>
 <label xml:lang="enm">swik</label>
 <item>cease</item>
 <label xml:lang="enm">naver</label>
 <item>never</item>
</list>

Where the internal structure of a list item is more complex, it may be preferable to regard the list as a table, for which special-purpose tagging is defined below (13 Tables).

Lists of whatever kind can, of course, nest within list items to any depth required. Here, for example, a glossary list contains two items, each of which is itself a simple list:
<list type="gloss">
 <label>EVIL</label>
 <item>
  <list type="simple">
   <item>I am cast upon a horrible desolate island, void of all hope of recovery.</item>
   <item>I am singled out and separated as it were from all the world to be miserable.</item>
   <item>I am divided from mankind — a solitaire; one banished from human society.</item>
  </list>
 </item>
 <label>GOOD</label>
 <item>
  <list type="simple">
   <item>But I am alive; and not drowned, as all my ship's company were.</item>
   <item>But I am singled out, too, from all the ship's crew, to be spared from
       death...</item>
   <item>But I am not starved, and perishing on a barren place, affording no
       sustenances....</item>
  </list>
 </item>
</list>
A list need not necessarily be displayed in list format. For example,
<p>On those remote pages it is written that animals
are divided into <list rend="run-on">
  <item n="a">those that belong to the Emperor,</item>
  <item n="b"> embalmed ones, </item>
  <item n="c"> those that are trained, </item>
  <item n="d"> suckling pigs, </item>
  <item n="e"> mermaids, </item>
  <item n="f"> fabulous ones, </item>
  <item n="g"> stray dogs, </item>
  <item n="h"> those that are included in this classification, </item>
  <item n="i"> those that tremble as if they were mad, </item>
  <item n="j"> innumerable ones, </item>
  <item n="k"> those drawn with a very fine camel's-hair brush, </item>
  <item n="l"> others, </item>
  <item n="m"> those that have just broken a flower vase, </item>
  <item n="n"> those that resemble flies from a distance.</item>
 </list>
</p>

Lists of bibliographic items should be tagged using the <listBibl> element, described in the next section.

12 Bibliographic Citations

It is often useful to distinguish bibliographic citations where they occur within texts being transcribed for research, if only so that they will be properly formatted when the text is printed out. The element bibl is provided for this purpose. Where the components of a bibliographic reference are to be distinguished, the following elements may be used as appropriate. It is generally useful to mark at least those parts (such as the titles of articles, books, and journals) which will need special formatting. The other elements are provided for cases where particular interest attaches to such details.

For example, the following editorial note might be transcribed as shown:
He was a member of Parliament for Warwickshire in 1445, and died March 14, 1470 (according to Kittredge, Harvard Studies 5. 88ff).
He was a member of Parliament for Warwickshire
in 1445, and died March 14, 1470 (according to <bibl>
 <author>Kittredge</author>,
<title>Harvard Studies</title>
 <biblScope>5. 88ff</biblScope>
</bibl>).

For lists of bibliographic citations, the <listBibl> element should be used; it may contain a series of bibl elements.

13 Tables

Tables represent a challenge for any text processing system, but simple tables, at least, appear in so many texts that even in the simplified TEI tag set presented here, markup for tables is necessary. The following elements are provided for this purpose:

For example, Defoe uses mortality tables like the following in the Journal of the Plague Year to show the rise and ebb of the epidemic:
<p>It was indeed coming on amain, for the burials
that same week were in the next adjoining parishes thus:— <table cols="4rows="5">
  <row role="data">
   <cell role="label">St. Leonard's, Shoreditch</cell>
   <cell>64</cell>
   <cell>84</cell>
   <cell>119</cell>
  </row>
  <row role="data">
   <cell role="label">St. Botolph's, Bishopsgate</cell>
   <cell>65</cell>
   <cell>105</cell>
   <cell>116</cell>
  </row>
  <row role="data">
   <cell role="label">St. Giles's, Cripplegate</cell>
   <cell>213</cell>
   <cell>421</cell>
   <cell>554</cell>
  </row>
 </table>
</p>
<p>This shutting up of houses was at first counted a very cruel and unchristian
method, and the poor people so confined made bitter lamentations. ... </p>

14 Figures and Graphics

Not all the components of a document are necessarily textual. The most straightforward text will often contain diagrams or illustrations, to say nothing of documents in which image and text are inextricably intertwined, or electronic resources in which the two are complementary.

The encoder may simply record the presence of a graphic within the text, possibly with a brief description of its content, and may also provide a link to a digitized version of the graphic, using the following elements:

Any textual information accompanying the graphic, such as a heading and/or caption, may be included within the <figure> element itself, in a head and one or more p elements, as also may any text appearing within the graphic itself. It is strongly recommended that a prose description of the image be supplied, as the content of a <figDesc> element, for the use of applications which are not able to render the graphic, and to render the document accessible to vision-impaired readers. (Such text is not normally considered part of the document proper.)

The simplest use for these elements is to mark the position of a graphic and provide a link to it, as in this example;
<pb n="412"/>
<figure>
 <graphic url="p412fig.png"/>
</figure>
<pb n="413"/>
This indicates that the graphic contained by the file p412fig.png appears between pages 412 and 413.
The graphic element can appear anywhere that textual content is permitted, within but not between paragraphs or headings. In the following example, the encoder has decided to treat a specific printer's ornament as a heading:
<head>
 <graphic url="http://www.iath.virginia.edu/gants/Ornaments/Heads/hp-ral02.gif"/>
</head>
More usually, a graphic will have at the least an identifying title, which may be encoded using the head element, or a number of figures may be grouped together in a particular structure. It is also often convenient to include a brief description of the image. The <figure> element provides a means of wrapping one or more such elements together as a kind of graphic ‘block’:
<figure>
 <graphic url="fessipic.png"/>
 <head>Mr Fezziwig's Ball</head>
 <figDesc>A Cruikshank
   engraving showing Mr Fezziwig leading a group of revellers.</figDesc>
</figure>
These cases should be carefully distinguished from the case where an encoded text is complemented by a collection of digital images, maintained as a distinct resource. The facs attribute may be used to associate any element in an encoded text with a digital facsimile of it. In the simple case where only page images are available, the facs attribute on the pb element may be used to associate each image with an appropriate point in the text:
<text>
 <pb facs="page1.pngn="1"/>
<!-- text contained on page 1 is encoded here -->
 <pb facs="page2.pngn="2"/>
<!-- text contained on page 2 is encoded here -->
</text>
This method is only appropriate in the simple case where each digital image file page1.png etc. corresponds with a single transcribed and encoded page. If more detailed alignment of image and transcription is required, for example because the image files actually represent double page spreads, more sophisticated mechanisms are provided in the full TEI Guidelines.

15 Interpretation and Analysis

It is often said that all markup is a form of interpretation or analysis. While it is certainly difficult, and may be impossible, to distinguish firmly between ‘objective’ and ‘subjective’ information in any universal way, it remains true that judgments concerning the latter are typically regarded as more likely to provide controversy than those concerning the former. Many scholars therefore prefer to record such interpretations only if it is possible to alert the reader that they are considered more open to dispute, than the rest of the markup. This section describes some of the elements provided by the TEI scheme to meet this need.

15.1 Orthographic Sentences

Interpretation typically ranges across the whole of a text, with no particular respect to other structural units. A useful preliminary to intensive interpretation is therefore to segment the text into discrete and identifiable units, each of which can then bear a label for use as a sort of ‘canonical reference’. To facilitate such uses, these units may not cross each other, nor nest within each other. They may conveniently be represented using the following element:

  • s (s-unit) contains a sentence-like division of a text.
As the name suggests, the s element is most commonly used (in linguistic applications at least) for marking orthographic sentences, that is, units defined by orthographic features such as punctuation. For example, the passage from Jane Eyre discussed earlier might be divided into s-units as follows:
<pb n="474"/>
<div n="38type="chapter">
 <p>
  <s n="001">Reader, I married him.</s>
  <s n="002">A quiet wedding we had:</s>
  <s n="003">he
     and I, the parson and clerk, were alone present.</s>
  <s n="004">When we got back from
     church, I went into the kitchen of the manor-house, where Mary was cooking the dinner, and
     John cleaning the knives, and I said —</s>
 </p>
 <p>
  <q>
   <s n="005">Mary, I have been married to Mr Rochester this morning.</s>
  </q> ... </p>
</div>
Note that s elements cannot nest: the beginning of one s element implies that the previous one has finished. When s-units are tagged as shown above, it is advisable to tag the entire text end-to-end, so that every word in the text being analysed will be contained by exactly one s element, whose identifier can then be used to specify a unique reference for it. If the identifiers used are unique within the document, then the xml:id attribute might be used in preference to the n used in the above example.

15.2 Words and punctuation

Tokenization, that is, the identification of lexical or non-lexical tokens within a text, is a very common requirement for all kinds of textual analysis, and not an entirely trivial one. The decision as to whether, for example, ‘can't’ in English or ‘du’ in French should be treated as one word or two is not simple. Consequently it is often useful to make explicit the preferred tokenization in a marked up text. The following elements are available for this purpose:

  • w (word) represents a grammatical (not necessarily orthographic) word.
  • pc (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark.
For example, the output from a part of speech tagger might be recorded in TEI Lite as follows:
<s n="1">
 <w ana="#NP0">Marley</w>
 <w ana="#VBD">was</w>
 <w ana="#AJ0">dead</w>
 <pc>:</pc>
 <w ana="#TO0">to</w>
 <w ana="#VBB">begin</w>
 <w ana="#PRP">with</w>
 <pc>. </pc>
</s>
In this example, each word has been decorated with an automatically generated part of speech code, using the ana attribute discussed in section 8.3 Special kinds of Linking above. The w also provides for each word to be associated with a root form or lemma, either explicitly using the lemma attribute, or by reference, using the lemmaRef attribute, as in this example:
...<w ana="#VBDlemma="be"
 lemmaRef="http://www.myLexicon.com/be">
was</w> ...

15.3 General-Purpose Interpretation Elements

The w element is a specialisation of the seg element which has already been introduced for use in identifying otherwise unmarked targets of cross references and hypertext links (see section 8 Cross References and Links); it identifies some phrase-level portion of text to which the encoder may assign a user-specified type, as well as a unique identifier; it may thus be used to tag textual features for which there is no other provision in the published TEI Guidelines.

For example, the Guidelines provide no ‘apostrophe’ element to mark parts of a literary text in which the narrator addresses the reader (or hearer) directly. One approach might be to regard these as instances of the q element, distinguished from others by an appropriate value for the who attribute. A possibly simpler, and certainly more general, solution would however be to use the seg element as follows:
<div n="38type="chapter">
 <p>
  <seg type="apostrophe">Reader, I married him.</seg> A quiet wedding we had: ...</p>
</div>
The type attribute on the seg element can take any value, and so can be used to record phrase-level phenomena of any kind; it is good practice to record the values used and their significance in the header.

A seg element of one type (unlike the s element which it superficially resembles) can be nested within a seg element of the same or another type. This enables quite complex structures to be represented; some examples were given in section 8.3 Special kinds of Linking above. However, because it must respect the requirement that elements be properly nested and may not cut across each other, it cannot cope with the common requirement to associate an interpretation with arbitrary segments of a text which may completely ignore the document hierarchy. It also requires that the interpretation itself be represented by a single coded value in the type attribute.

Neither restriction applies to the <interp> element, which provides powerful features for the encoding of quite complex interpretive information in a relatively straightforward manner.

  • UNKNOWN ELEMENT interp
  • UNKNOWN ELEMENT interpGrp

These elements allow the encoder to specify both the class of an interpretation, and the particular instance of that class which the interpretation involves. Thus, whereas with seg one can say simply that something is an apostrophe, with <interp> one can say that it is an instance (apostrophe) of a larger class (rhetorical figures).

Moreover, <interp> is a ‘stand off’ element: it does not surround the segments of text which it describes, but instead is linked to the passage in question either by means of the ana attribute discussed in section 8.3 Special kinds of Linking above, or by means of its own inst attribute. This means that any kind of analysis can be represented, independently of the document hierarchy, as well as facilitating the grouping of analyses of a particular type together. A special purpose <interpGrp> element is provided for the latter purpose.

For example, suppose that you wish to mark such diverse aspects of a text as themes or subject matter, rhetorical figures, and the locations of individual scenes of the narrative. Different portions of our sample passage from Jane Eyre for example, might be associated with the rhetorical figures of apostrophe, hyperbole, and metaphor; with subject-matter references to churches, servants, cooking, postal service, and honeymoons; and with scenes located in the church, in the kitchen, and in an unspecified location (drawing room?).

These interpretations could be placed anywhere within the text element; it is however good practice to put them all in the same place (e.g. a separate section of the front or back matter), as in the following example:
<back>
 <div type="Interpretations">
  <p>
   <interp resp="#LB-MSM"
    type="figureOfSpeechxml:id="fig-apos-1">
apostrophe</interp>
   <interp resp="#LB-MSM"
    type="figureOfSpeechxml:id="fig-hyp-1">
hyperbole</interp>
   <interp resp="#LB-MSMtype="setting"
    xml:id="set-church-1">
church</interp>
   <interp resp="#LB-MSMtype="reference"
    xml:id="ref-church-1">
church</interp>
   <interp resp="#LB-MSMtype="reference"
    xml:id="ref-serv-1">
servants</interp>
  </p>
 </div>
</back>
The evident redundancy of this encoding can be considerably reduced by using the <interpGrp> element to group together all those <interp> elements which share common attribute values, as follows:
<back>
 <div type="Interpretations">
  <p>
   <interpGrp resp="#LB-MSM"
    type="figureOfSpeech">

    <interp xml:id="fig-apos">apostrophe</interp>
    <interp xml:id="fig-hyp">hyperbole</interp>
    <interp xml:id="fig-meta">metaphor</interp>
   </interpGrp>
   <interpGrp resp="#LB-MSM"
    type="scene-setting">

    <interp xml:id="set-church">church</interp>
    <interp xml:id="set-kitch">kitchen</interp>
    <interp xml:id="set-unspec">unspecified</interp>
   </interpGrp>
   <interpGrp resp="#LB-MSM"
    type="reference">

    <interp xml:id="ref-church">church</interp>
    <interp xml:id="ref-serv">servants</interp>
    <interp xml:id="ref-cook">cooking</interp>
   </interpGrp>
  </p>
 </div>
</back>
Once these interpretation elements have been defined, they can be linked with the parts of the text to which they apply in either or both of two ways. The ana attribute can be used on whichever element is appropriate:
<div n="38type="chapter">
 <p ana="#set-church #set-kitch"
  xml:id="P38.1">

  <s ana="#fig-aposxml:id="P38.1.1">Reader, I
     married him.</s>
 </p>
</div>
Note in this example that since the paragraph has two settings (in the church and in the kitchen), the identifiers of both have been supplied.
Alternatively, the <interp> elements can point to all the parts of the text to which they apply, using their inst attribute:
<interp inst="#P38.1.1resp="#LB-MSM"
 type="figureOfSpeechxml:id="fig-apos-2">
apostrophe</interp>
<interp inst="#P38.1resp="#LB-MSM"
 type="scene-settingxml:id="set-church-2">
church</interp>
<interp inst="#P38.1resp="#LB-MSM"
 type="scene-settingxml:id="set-kitchen-2">
kitchen</interp>
The <interp> element is not limited to any particular type of analysis. The literary analysis shown above is but one possibility; one could equally well use <interp> to capture a linguistic part-of-speech analysis. For example, the example sentence given in section 8.3 Special kinds of Linking assumes a linguistic analysis which might be represented as follows:
<interp type="posxml:id="NP1">noun
phrase, singular</interp>
<interp type="posxml:id="VV1">inflected verb, present-tense
singular</interp> ...

16 Technical Documentation

Although the focus of this document is on the use of the TEI scheme for the encoding of existing ‘pre-electronic’ documents, the same scheme may also be used for the encoding of new documents. In the preparation of new documents (such as this one), XML has much to recommend it: the document's structure can be clearly represented, and the same electronic text can be re-used for many purposes — to provide both online hypertext or browsable versions and well-formatted typeset versions from a common source for example.

To facilitate this, the TEI Lite schema includes some elements for marking features of technical documents in general, and of XML-related documents in particular.

16.1 Additional Elements for Technical Documents

The following elements may be used to mark particular features of technical documents:

  • UNKNOWN ELEMENT eg
  • UNKNOWN ELEMENT code
  • UNKNOWN ELEMENT ident
  • UNKNOWN ELEMENT gi
  • UNKNOWN ELEMENT att
  • UNKNOWN ELEMENT formula
  • UNKNOWN ELEMENT val
The following example shows how these elements might be used to encode a passage from a tutorial introducing the Fortran programming language:
<p>It is traditional to introduce a language with a
program like the following: <eg xml:space="preserve"> CHAR*12 GRTG           GRTG = 'HELLO WORLD'            PRINT *, GRTG            END         </eg>
</p>
<p>This simple example first declares a variable <ident>GRTG</ident>, in the line
<code>CHAR*12 GRTG</code>, which identifies <ident>GRTG</ident> as consisting of 12 bytes
of type <ident>CHAR</ident>. To this variable, the value <val>HELLO WORLD</val> is then
assigned.</p>

A formatting application, given a text like that above, can be instructed to format examples appropriately (e.g. to preserve line breaks, or to use a distinctive font). Similarly, the use of tags such as <ident> greatly facilitates the construction of a useful index.

The <formula> element should be used to enclose a mathematical or chemical formula presented within the text as a distinct item. Since formulae generally include a large variety of special typographic features not otherwise present in ordinary text, it will usually be necessary to present the body of the formula in a specialized notation. The notation used should be specified by the notation attribute, as in the following example:
<formula notation="tex"> \begin{math}E =
mc^{2}\end{math} </formula>

A particular problem arises when XML encoding is the subject of discussion within a technical document, itself encoded in XML. In such a document, it is clearly essential to distinguish clearly the markup occurring within examples from that marking up the document itself, and end-tags are highly likely to occur. One simple solution is to use the predefined entity reference &lt; to represent each < character which marks the start of an XML tag within the examples. A more general solution is to mark off the whole body of each example as containing data which is not to be scanned for XML mark-up by the parser. This is achieved by enclosing it within a special XML construct called a CDATA marked section, as in the following example:

<p>A list should be encoded as
          follows: <eg><![ CDATA [ <list> <item>First item in the
          list</item> <item>Second item</item> </list> ]]> </eg> The
          <gi>list</gi> element consists of a series of <gi>item</gi>
          elements.

The <list> element used within the example above will not be regarded as forming part of the document proper, because it is embedded within a marked section (beginning with the special markup declaration <![CDATA[ , and ending with ]]>).

Note also the use of the <gi> element to tag references to element names (or generic identifiers) within the body of the text.

16.2 Generated Divisions

Most modern document production systems have the ability to generate automatically whole sections such as a table of contents or an index. The TEI Lite scheme provides an element to mark the location at which such a generated section should be placed.

  • UNKNOWN ELEMENT divGen
The <divGen> element can be placed anywhere that a division element would be legal, as in the following example:
<front>
 <titlePage>
<!-- ... -->
 </titlePage>
 <divGen type="toc"/>
 <div>
  <head>Preface</head>
<!-- ... -->
 </div>
</front>
<body>
<!-- ... -->
</body>
<back>
 <div>
  <head>Appendix</head>
<!-- ... -->
 </div>
 <divGen n="Indextype="index"/>
</back>

This example also demonstrates the use of the type attribute to distinguish the different kinds of division to be generated: in the first case a table of contents (a toc) and in the second an index.

When an existing index or table of contents is to be encoded (rather than one being generated) for some reason, the <list> element discussed in section 11 Lists should be used.

16.3 Index Generation

While production of a table of contents from a properly tagged document is generally unproblematic for an automatic processor, the production of a good quality index will often require more careful tagging. It may not be enough simply to produce a list of all parts tagged in some particular way, although extracting (for example) all occurrences of elements such as term or <name> will often be a good departure point for an index.

The TEI schema provides a special purpose index tag which may be used to mark both the parts of the document which should be indexed, and how the indexing should be done.

  • index (index entry) marks a location to be indexed for whatever purpose.
For example, the second paragraph of this section might include the following:
... TEI lite also provides a special purpose
<gi>index</gi> tag
<index>
 <term>indexing</term>
</index>
<index>
 <term>index (tag)</term>
 <index>
  <term>use in index generation</term>
 </index>
</index>
which may be used ...
The index element can also be used to provide a form of interpretive or analytic information. For example, in a study of Ovid, it might be desired to record all the poet's references to different figures, for comparative stylistic study. In the following lines of the Metamorphoses, such a study would record the poet's references to Jupiter (as deus, se, and as the subject of confiteor [in inflectional form number 227]), to Jupiter-in-the-guise-of-a-bull (as imago tauri fallacis and the subject of teneo), and so on.4
<l n="3.001">iamque deus posita fallacis
imagine tauri</l>
<l n="3.002">se confessus erat Dictaeaque rura tenebat</l>
This need might be met using the note element discussed in section in 7 Notes, or with the <interp> element discussed in section 15 Interpretation and Analysis. Here we demonstrate how it might also be satisfied by using the index element.
We assume that the object is to generate more than one index: one for names of deities (called dn), another for onomastic references (called on), a third for pronominal references (called pr) and so forth. One way of achieving this might be as follows:
<l n="3.001">iamque deus posita
fallacis imagine tauri <index indexName="dn">
  <term>Iuppiter</term>
  <index>
   <term>deus</term>
  </index>
 </index>
 <index indexName="on">
  <term>Iuppiter (taurus)</term>
  <index>
   <term>imago tauri
       fallacis</term>
  </index>
 </index>
</l>
<l n="3.002">se confessus erat Dictaeaque rura tenebat
<index indexName="pr">
  <term>Iuppiter</term>
  <index>
   <term>se</term>
  </index>
 </index>
 <index indexName="v">
  <term>Iuppiter</term>
  <index>
   <term>confiteor
       (v227)</term>
  </index>
 </index>
</l>
For each index element above, an entry will be generated in the appropriate index, using as headword the content of the term element it contains; the term elements nested within the secondary index element in each case provide a secondary keyword. The actual reference will be taken from the context in which the index element appears, i.e. in this case the identifier of the l element containing it.

16.4 Addresses

The address element is used to mark a postal address of any kind. It contains one or more addrLine elements, one for each line of the address.

  • address (address) contains a postal address, for example of a publisher, an organization, or an individual.
  • addrLine (address line) contains one line of a postal address.
Here is a simple example:
<address>
 <addrLine>Computer Center (M/C 135)</addrLine>
 <addrLine>1940 W. Taylor, Room 124</addrLine>
 <addrLine>Chicago, IL 60612-7352</addrLine>
 <addrLine>U.S.A.</addrLine>
</address>
The individual parts of an address may be further distinguished by using the <name> element discussed above (section 10.1 Names and Referring Strings).
<address>
 <addrLine>Computer Center (M/C 135)</addrLine>
 <addrLine>1940 W. Taylor, Room 124</addrLine>
 <addrLine>
  <name type="city">Chicago</name>, IL 60612-7352</addrLine>
 <addrLine>
  <name type="country">USA</name>
 </addrLine>
</address>

17 Character Sets, Diacritics, etc.

With the advent of XML and its adoption of Unicode as the required character set for all documents, most problems previously associated with the representation of the divers languages and writing systems of the world are greatly reduced. For those working with standard forms of the European languages in particular, almost no special action is needed: any XML editor should enable you to input accented letters or other ‘non-ASCII’ characters directly, and they should be stored in the resulting file in a way which is transferable directly between different systems.

There are two important exceptions: the characters & and < may not be entered directly in an XML document, since they have a special significance as initiating markup. They must always be represented as entity references, like this: &amp; or &lt;. Other characters may also be represented by means of entity reference where necessary, for example to retain compatibility with a pre-Unicode processing system.

18 Front and Back Matter

18.1 Front Matter

For many purposes, particularly in older texts, the preliminary material such as title pages, prefatory epistles, etc., may provide very useful additional linguistic or social information. P5 provides a set of recommendations for distinguishing the textual elements most commonly encountered in front matter, which are summarized here.

18.1.1 Title Page

The start of a title page should be marked with the element titlePage. All text contained on the page should be transcribed and tagged with the appropriate element from the following list:

  • titlePage (title page) contains the title page of a text, appearing within the front or back matter.
  • docTitle (document title) contains the title of a document, including all its constituents, as given on a title page.
  • titlePart (title part) contains a subsection or division of the title of a work, as indicated on a title page.
  • byline (byline) contains the primary statement of responsibility given for a work on its title page or at the head or end of the work.
  • docAuthor (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline).
  • UNKNOWN ELEMENT docDate
  • UNKNOWN ELEMENT docEdition
  • docImprint (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page.
  • epigraph (epigraph) contains a quotation, anonymous or attributed, appearing at the start or end of a section or on a title page.

Typeface distinctions should be marked with the rend attribute when necessary, as described above. Very detailed description of the letter spacing and sizing used in ornamental titles is not as yet provided for by the Guidelines. Changes of language should be marked by appropriate use of the xml:lang attribute or the foreign element, as necessary. Names of people, places, or organizations, may be tagged using the <name> element wherever they appear if no other more specific element is available.

Two example title pages follow:
<titlePage rend="Roman">
 <docTitle>
  <titlePart type="main"> PARADISE REGAIN'D. A POEM In IV <hi>BOOKS</hi>. </titlePart>
  <titlePart> To which is added <title>SAMSON AGONISTES</title>. </titlePart>
 </docTitle>
 <byline>The Author <docAuthor>JOHN MILTON</docAuthor>
 </byline>
 <docImprint>
  <name>LONDON</name>, Printed by <name>J.M.</name> for <name>John Starkey</name>
   at the <name>Mitre</name> in <name>Fleetstreet</name>, near
 <name>Temple-Bar.</name>
 </docImprint>
 <docDate>MDCLXXI</docDate>
</titlePage>
<titlePage>
 <docTitle>
  <titlePart type="main"> Lives of the Queens of England, from the Norman
     Conquest;</titlePart>
  <titlePart type="sub">with anecdotes of their courts. </titlePart>
 </docTitle>
 <titlePart>Now first published from Official Records and other authentic documents private
   as well as public.</titlePart>
 <docEdition>New edition, with corrections and additions</docEdition>
 <byline>By <docAuthor>Agnes Strickland</docAuthor>
 </byline>
 <epigraph>
  <q>The treasures of antiquity laid up in old historic rolls, I opened.</q>
  <bibl>BEAUMONT</bibl>
 </epigraph>
 <docImprint>Philadelphia: Blanchard and Lea</docImprint>
 <docDate>1860.</docDate>
</titlePage>
As elsewhere, the ref attribute may be used to link a name with a canonical definition of the entity being named. For example:
<byline>By <docAuthor>
  <name ref="http://en.wikipedia.org/wiki/Agnes_Strickland">Agnes
     Strickland</name>
 </docAuthor>
</byline>

18.1.2 Prefatory Matter

Major blocks of text within the front matter should be marked using div elements; the following suggested values for the type attribute may be used to distinguish various common types of prefatory matter:

preface
A foreword or preface addressed to the reader in which the author or publisher explains the content, purpose, or origin of the text
dedication
A formal offering or dedication of a text to one or more persons or institutions by the author.
abstract
A summary of the content of a text as continuous prose
ack
A formal declaration of acknowledgment by the author in which persons and institutions are thanked for their part in the creation of a text
contents
A table of contents, specifying the structure of a work and listing its constituents. The <list> element should be used to mark its structure.
frontispiece
A pictorial frontispiece, possibly including some text.

Where other kinds of prefatory matter are encountered, the encoder is at liberty to invent other values for the type attribute.

Like any text division, those in front matter may contain low level structural or non-structural elements as described elsewhere. They will generally begin with a heading or title of some kind which should be tagged using the head element. Epistles will contain the following additional elements:

  • salute (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc.
  • signed (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text.
  • byline (byline) contains the primary statement of responsibility given for a work on its title page or at the head or end of the work.
  • dateline (dateline) contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.
  • UNKNOWN ELEMENT argument
  • cit (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example.
  • UNKNOWN ELEMENT imprimatur
  • opener (opener) groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter.
  • closer (closer) groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter.

Epistles which appear elsewhere in a text will, of course, contain these same elements.

As an example, the dedication at the start of Milton's Comus should be marked up as follows:
<div type="dedication">
 <head>To the Right Honourable <name>JOHN Lord Viscount BRACLY</name>, Son and Heir apparent
   to the Earl of Bridgewater, &amp;c.</head>
 <salute>MY LORD,</salute>
 <p>THis <hi>Poem</hi>, which receiv'd its first occasion of Birth from your Self, and
   others of your Noble Family .... and as in this representation your attendant
 <name>Thyrsis</name>, so now in all reall expression</p>
 <closer>
  <salute>Your faithfull, and most humble servant</salute>
  <signed>
   <name>H. LAWES.</name>
  </signed>
 </closer>
</div>

18.2 Back Matter

18.2.1 Structural Divisions of Back Matter

Because of variations in publishing practice, back matter can contain virtually any of the elements listed above for front matter, and the same elements should be used where this is so. Additionally, back matter may contain the following types of matter within the back element. Like the structural divisions of the body, these should be marked as div elements, and distinguished by the following suggested values of the type attribute:

appendix
An ancillary self-contained section of a work, often providing additional but in some sense extra-canonical text.
glossary
A list of terms associated with definition texts (‘glosses’): this should be encoded as a <<list type="gloss">> element
notes
A section in which textual or other kinds of notes are gathered together.
bibliogr
A list of bibliographic citations: this should be encoded as a <listBibl>
index
Any form of pre-existing index to the work (An index may also be generated for a document by using the index element described above).
colophon
A statement appearing at the end of a book describing the conditions of its physical production.

19 The Electronic Title Page

Every TEI text has a header which provides information analogous to that provided by the title page of printed text. The header is introduced by the element teiHeader and has four major parts:

A corpus or collection of texts with many shared characteristics may have one header for the corpus and individual headers for each component of the corpus. In this case the type attribute indicates the type of header. <teiHeader type="corpus"> introduces the header for corpus-level information.

Some of the header elements contain running prose which consists of one or more ps. Others are grouped:

19.1 The File Description

The fileDesc element is mandatory. It contains a full bibliographic description of the file with the following elements:

  • titleStmt (title statement) groups information about the title of a work and those responsible for its content.
  • UNKNOWN ELEMENT editionStmt
  • extent (extent) describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
  • publicationStmt (publication statement) groups information concerning the publication or distribution of an electronic or other text.
  • UNKNOWN ELEMENT seriesStmt
  • notesStmt (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description.
  • sourceDesc (source description) describes the source(s) from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence.
A minimal header has the following structure:
<teiHeader>
 <fileDesc>
  <titleStmt>
<!-- bibliographic description of the digital resource -->
  </titleStmt>
  <publicationStmt>
<!-- information about how the resource is distributed -->
  </publicationStmt>
  <sourceDesc>
<!-- information about the sources from which the digital resource is derived -->
  </sourceDesc>
 </fileDesc>
</teiHeader>

19.1.1 The Title Statement

The following elements can be used in the titleStmt:

  • title (title) contains a title for any kind of work.
  • author (author) in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.
  • UNKNOWN ELEMENT sponsor
  • UNKNOWN ELEMENT funder
  • UNKNOWN ELEMENT principal
  • UNKNOWN ELEMENT respStmt
The title of a digital resource derived from a non-digital one will obviously be similar. However, it is important to distinguish the title of the computer file from that of the source text, for example:
[title of source]: a machine readable transcription [title of source]: electronic edition A machine readable version of: [title of source]
The <respStmt> element contains the following subcomponents:
  • UNKNOWN ELEMENT resp
  • UNKNOWN ELEMENT name
Example:
<titleStmt>
 <title>Two stories by Edgar Allen Poe: a machine readable transcription</title>
 <author>Poe, Edgar Allen (1809-1849)</author>
 <respStmt>
  <resp>compiled by</resp>
  <name>James D. Benson</name>
 </respStmt>
</titleStmt>

19.1.2 The Edition Statement

The <editionStmt> groups information relating to one edition of the digital resource (where edition is used as elsewhere in bibliography), and may include the following elements:

  • UNKNOWN ELEMENT edition
  • UNKNOWN ELEMENT respStmt
Example:
<editionStmt>
 <edition n="U2">Third
   draft, substantially revised <date>1987</date>
 </edition>
</editionStmt>

Determining exactly what constitutes a new edition of an electronic text is left to the encoder.

19.1.3 The Extent Statement

The extent statement describes the approximate size of the digital resource.

Example:
<extent>4532
bytes</extent>

19.1.4 The Publication Statement

The publicationStmt is mandatory. It may contain a simple prose description or groups of the elements described below:

  • publisher (publisher) provides the name of the organization responsible for the publication or distribution of a bibliographic item.
  • distributor (distributor) supplies the name of a person or other agency responsible for the distribution of a text.
  • authority (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor.

At least one of these three elements must be present, unless the entire publication statement is in prose. The following elements may occur within them:

  • pubPlace (publication place) contains the name of the place where a bibliographic item was published.
  • address (address) contains a postal address, for example of a publisher, an organization, or an individual.
  • idno (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way.
  • availability (availability) supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc.
  • licence contains information about a licence or other legal agreement applicable to the text.
  • date (date) contains a date in any format.
Example:
<publicationStmt>
 <publisher>University of Victoria Humanities Computing and Media Centre</publisher>
 <pubPlace>Victoria, BC</pubPlace>
 <date>2011</date>
 <availability status="restricted">
  <licence target="http://creativecommons.org/licenses/by-sa/3.0/"> Distributed under a
     Creative Commons Attribution-ShareAlike 3.0 Unported License </licence>
 </availability>
</publicationStmt>

19.1.5 Series and Notes Statements

The <seriesStmt> element groups information about the series, if any, to which a publication belongs. It may contain title, idno, or <respStmt> elements.

The notesStmt, if used, contains one or more note elements which contain a note or annotation. Some information found in the notes area in conventional bibliography has been assigned specific elements in the TEI scheme.

19.1.6 The Source Description

The sourceDesc is a mandatory element which records details of the source or sources from which the computer file is derived. It may contain simple prose or a bibliographic citation, using one or more of the following elements:

  • bibl (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.
  • UNKNOWN ELEMENT listBibl
Examples:
<sourceDesc>
 <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,
   1968)</bibl>
</sourceDesc>
<sourceDesc>
 <bibl>
  <author>CNN Network News</author>
  <title>News headlines</title>
  <date>12 Jun
     1989</date>
 </bibl>
</sourceDesc>

19.2 The Encoding Description

The encodingDesc element specifies the methods and editorial principles which governed the transcription of the text. Its use is highly recommended. It may be prose description or may contain elements from the following list:

  • projectDesc (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected.
  • UNKNOWN ELEMENT samplingDecl
  • editorialDecl (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text.
  • UNKNOWN ELEMENT refsDecl
  • UNKNOWN ELEMENT classDecl

19.2.1 Project and Sampling Descriptions

Examples of projectDesc and <samplingDesc>:
<encodingDesc>
 <projectDesc>
  <p>Texts collected for
     use in the Claremont Shakespeare Clinic, June 1990.
  </p>
 </projectDesc>
</encodingDesc>
<encodingDesc>
 <samplingDecl>
  <p>Samples of
     2000 words taken from the beginning of the text</p>
 </samplingDecl>
</encodingDesc>

19.2.2 Editorial Declarations

The editorialDecl contains a prose description of the practices used when encoding the text. Typically this description should cover such topics as the following, each of which may conveniently be given as a separate paragraph.

correction
how and under what circumstances corrections have been made in the text.
normalization
the extent to which the original source has been regularized or normalized.
quotation
what has been done with quotation marks in the original -- have they been retained or replaced by entity references, are opening and closing quotes distinguished, etc.
hyphenation
what has been done with hyphens (especially end-of-line hyphens) in the original -- have they been retained, replaced by entity references, etc.
segmentation
how has the text has been segmented, for example into sentences, tone-units, graphemic strata, etc.
interpretation
what analytic or interpretive information has been added to the text.
Example:
<editorialDecl>
 <p>The part of
   speech analysis applied throughout section 4 was added by hand and has not been
   checked.</p>
 <p>Errors in transcription controlled by using the WordPerfect spelling
   checker.</p>
 <p>All words converted to Modern American spelling using Webster's 9th
   Collegiate dictionary.</p>
</editorialDecl>

19.2.3 Reference and Classification Declarations

The <refsDecl> element is used to document the way in which any standard referencing scheme built into the encoding works. In its simplest form, it consists of prose description.

Example:
<refsDecl>
 <p>The <att>n</att>
   attribute on each <gi>div</gi> contains the canonical reference for each division in the
   form XX.yyy where XX is the book number in roman numeral and yyy is the section number in
   arabic.</p>
 <p>Milestone tags refer to the edition of 1830 as E30 and that of 1850 as E50.
 </p>
</refsDecl>

The <classDecl> element groups together definitions or sources for any descriptive classification schemes used by other parts of the header. At least one such scheme must be provided, encoded using the following elements:

  • UNKNOWN ELEMENT taxonomy
  • bibl (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.
  • UNKNOWN ELEMENT category
  • UNKNOWN ELEMENT catDesc
In the simplest case, the taxonomy may be defined by a bibliographic reference, as in the following example:
<classDecl>
 <taxonomy xml:id="LC-SH">
  <bibl>Library of Congress Subject Headings
  </bibl>
 </taxonomy>
</classDecl>
Alternatively, or in addition, the encoder may define a special purpose classification scheme, as in the following example:
<taxonomy xml:id="B">
 <bibl>Brown Corpus</bibl>
 <category xml:id="B.A">
  <catDesc>Press
     Reportage</catDesc>
  <category xml:id="B.A1">
   <catDesc>Daily</catDesc>
  </category>
  <category xml:id="B.A2">
   <catDesc>Sunday</catDesc>
  </category>
  <category xml:id="B.A3">
   <catDesc>National</catDesc>
  </category>
  <category xml:id="B.A4">
   <catDesc>Provincial</catDesc>
  </category>
  <category xml:id="B.A5">
   <catDesc>Political</catDesc>
  </category>
  <category xml:id="B.A6">
   <catDesc>Sports</catDesc>
  </category>
 </category>
 <category xml:id="B.D">
  <catDesc>Religion</catDesc>
  <category xml:id="B.D1">
   <catDesc>Books</catDesc>
  </category>
  <category xml:id="B.D2">
   <catDesc>Periodicals and
       tracts</catDesc>
  </category>
 </category>
</taxonomy>

Linkage between a particular text and a category within such a taxonomy is made by means of the <catRef> element within the textClass element, as described in the next section below.

19.3 The Profile Description

The profileDesc element enables information characterizing various descriptive aspects of a text to be recorded within a single framework. It has three optional components:

  • creation (creation) contains information about the creation of a text.
  • UNKNOWN ELEMENT langUsage
  • textClass (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc.

The creation element is useful for documenting where a work was created, even though it may not have been published or recorded there.

Example:
<creation>
 <date when="1992-08">August 1992</date>
 <name type="place">Taos, New Mexico</name>
</creation>
The <langUsage> element is useful where a text contains many different languages. It may contain <language> elements to document each particular language used:
  • UNKNOWN ELEMENT language
For example, a text containing predominantly text in French as spoken in Quebec, but also smaller amounts of British and Canadian English might be documented as follows:
<langUsage>
 <language ident="fr-CAusage="60">Québecois</language>
 <language ident="en-CAusage="20">Canadian business English</language>
 <language ident="en-GBusage="20">British English</language>
</langUsage>

The textClass element classifies a text. This may be done with reference to a classification system locally defined by means of the <classDecl> element, or by reference to some externally defined established scheme such as the Universal Decimal Classification. Texts may also be classified using lists of keywords, which may themselves be drawn from locally or externally defined control lists. The following elements are used to supply such classifications:

  • UNKNOWN ELEMENT classCode
  • UNKNOWN ELEMENT catRef
  • keywords (keywords) contains a list of keywords or phrases identifying the topic or nature of a text.
The simplest way of classifying a text is by means of the <classCode> element. For example, a text with classification 410 in the Universal Decimal Classification might be documented as follows:
<classCode scheme="http://www.udc.org">410</classCode>
When a classification scheme has been locally defined using the <taxonomy> element discussed in the preceding subsection, the <catRef> element should be used to reference it. To continue the earlier example, a work classified in the Brown Corpus as Press reportage - Sunday and also as Religion might be documented as follows:
<catRef target="#B.A3 #B.D"/>
The element keywords contains a list of keywords or phrases identifying the topic or nature of a text. As usual, the attribute scheme identifies the source from which these terms are taken. For example, if the LC Subject Headings are used, following declaration of that classification system in a <taxonomy> element as above :
<textClass>
 <keywords scheme="#LCSH">
  <list>
   <item>English literature -- History and criticism -- Data processing.</item>
   <item>English literature -- History and criticism -- Theory etc.</item>
   <item>English language -- Style -- Data processing.</item>
  </list>
 </keywords>
</textClass>

Multiple classifications may be supplied using any of the mechanisms described in this section.

19.4 The Revision Description

The revisionDesc element provides a change log in which each change made to a text may be recorded. The log may be recorded as a sequence of change elements each of which contains a brief description of the change. The attributes when and who may be used to identify when the change was carried out and the agency responsible for it.

Example:
<revisionDesc>
 <change when="1991-03-06who="#EMB">File format updated</change>
 <change when="1990-05-25who="#EMB">Stuart's corrections entered</change>
</revisionDesc>

In a production environment it will usually be found preferable to use some kind of automated system to track and record changes. Many such version control systems, as they are known, can also be configured to update the TEI Header of a file automatically.

Appendix A List of Elements Described

The TEI Lite schema is a pure subset of TEI P5. In the following list of elements and classes used, some information, notably the examples, derives from the canonical definition for the element in TEI P5 and may therefore refer to elements or attributes not provided by TEI Lite. Note however that only the elements listed here are available within the TEI Lite schema. These specifications also refer to many attributes which although available in TEI Lite are not discussed in this tutorial for lack of space.

Schema tei_lite: Elements

<ab>

<ab> (anonymous block) contains any component-level unit of text, acting as a container for phrase or inter level elements analogous to, but without the same constraints as, a paragraph. [16.3. Blocks, Segments, and Anchors]
Modulelinking
Attributes
Member of
Contained by
May contain
Note

The ab element may be used at the encoder's discretion to mark any component-level elements in a text for which no other more specific appropriate markup is defined. Unlike paragraphs, ab may nest and may use the type and subtype attributes.

Example
<div n="Genesistype="book">
 <div n="1type="chapter">
  <ab>In the beginning God created the heaven and the earth.</ab>
  <ab>And the earth was without form, and void; and
     darkness was upon the face of the deep. And the
     spirit of God moved upon the face of the waters.</ab>
  <ab>And God said, Let there be light: and there was light.</ab>
<!-- ...-->
 </div>
</div>
Schematron

<sch:report test="(ancestor::tei:l or ancestor::tei:lg) and not( ancestor::tei:floatingText |parent::tei:figure |parent::tei:note )"> Abstract model violation: Lines may not contain higher-level divisions such as p or ab, unless ab is a child of figure or note, or is a descendant of floatingText.
</sch:report>
Content model
<content>
 <macroRef key="macro.abContent"/>
</content>
Schema Declaration
element ab
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.declaring.attributes,
   tei_att.fragmentable.attributes,
   tei_att.written.attributes,
   tei_macro.abContent
}

<abbr>

<abbr> (abbreviation) contains an abbreviation of any sort. [3.6.5. Abbreviations and Their Expansions]
Modulecore
Attributes
type(type) allows the encoder to classify the abbreviation according to some convenient typology.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Sample values include:
suspension
(suspension) the abbreviation provides the first letter(s) of the word or phrase, omitting the remainder.
contraction
(contraction) the abbreviation omits some letter(s) in the middle.
brevigraph
the abbreviation comprises a special symbol or mark.
superscription
(superscription) the abbreviation includes writing above the line.
acronym
(acronym) the abbreviation comprises the initial letters of the words of a phrase.
title
(title) the abbreviation is for a title of address (Dr, Ms, Mr, …)
organization
(organization) the abbreviation is for the name of an organization.
geographic
(geographic) the abbreviation is for a geographic name.
Note

The type attribute is provided for the sake of those who wish to classify abbreviations at their point of occurrence; this may be useful in some circumstances, though usually the same abbreviation will have the same type in all occurrences. As the sample values make clear, abbreviations may be classified by the method used to construct them, the method of writing them, or the referent of the term abbreviated; the typology used is up to the encoder and should be carefully planned to meet the needs of the expected use. For a typology of Middle English abbreviations, see 6.2.

Member of
Contained by
May contain
Note

If abbreviations are expanded silently, this practice should be documented in the editorialDecl, either with a normalization element or a p.

Example
<choice>
 <expan>North Atlantic Treaty Organization</expan>
 <abbr cert="low">NorATO</abbr>
 <abbr cert="high">NATO</abbr>
 <abbr cert="highxml:lang="fr">OTAN</abbr>
</choice>
Example
<choice>
 <abbr>SPQR</abbr>
 <expan>senatus populusque romanorum</expan>
</choice>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element abbr
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   attribute type { teidata.enumerated }?,
   tei_macro.phraseSeq
}

<accMat>

<accMat> (accompanying material) contains details of any significant additional material which may be closely associated with the manuscript or object being described, such as non-contemporaneous documents or fragments bound in with it at some earlier historical period. [10.7.3.3. Accompanying Material]
Modulemsdescription
Attributes
Member of
Contained by
msdescription: physDesc
May contain
Example
<accMat>A copy of a tax form from 1947 is included in the envelope
with the letter. It is not catalogued separately.</accMat>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element accMat
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_macro.specialPara
}

<acquisition>

<acquisition> (acquisition) contains any descriptive or other information concerning the process by which a manuscript or manuscript part or other object entered the holding institution. [10.8. History]
Modulemsdescription
Attributes
calendarindicates one or more systems or calendars to which the date represented by the content of this element belongs.
Deprecatedwill be removed on 2024-11-11
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Contained by
msdescription: history
May contain
Example
<acquisition>Left to the <name type="place">Bodleian</name> by
<name type="person">Richard Rawlinson</name> in 1755.
</acquisition>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element acquisition
{
   tei_att.global.attributes,
   tei_att.datable.attributes,
   attribute calendar { list { teidata.pointer+ } }?,
   tei_macro.specialPara
}

<add>

<add> (addition) contains letters, words, or phrases inserted in the source text by an author, scribe, or a previous annotator or corrector. [3.5.3. Additions, Deletions, and Omissions]
Modulecore
Attributes
Member of
Contained by
May contain
Note

In a diplomatic edition attempting to represent an original source, the add element should not be used for additions to the current TEI electronic edition made by editors or encoders. In these cases, either the corr or supplied element are recommended.

In a TEI edition of a historical text with previous editorial emendations in which such additions or reconstructions are considered part of the source text, the use of add may be appropriate, dependent on the editorial philosophy of the project.

Example
The story I am
going to relate is true as to its main facts, and as to the
consequences <add place="above">of these facts</add> from which
this tale takes its title.
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element add
{
   tei_att.global.attributes,
   tei_att.transcriptional.attributes,
   tei_att.placement.attributes,
   tei_att.typed.attributes,
   tei_att.dimensions.attributes,
   tei_macro.paraContent
}

<addName>

<addName> (additional name) contains an additional name component, such as a nickname, epithet, or alias, or any other descriptive phrase used within a personal name. [13.2.1. Personal Names]
Modulenamesdates
Attributes
Member of
Contained by
May contain
Example
<persName>
 <forename>Frederick</forename>
 <addName type="epithet">the Great</addName>
 <roleName>Emperor of Prussia</roleName>
</persName>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element addName
{
   tei_att.global.attributes,
   tei_att.personal.attributes,
   tei_att.typed.attributes,
   tei_macro.phraseSeq
}

<address>

<address> (address) contains a postal address, for example of a publisher, an organization, or an individual. [3.6.2. Addresses 2.2.4. Publication, Distribution, Licensing, etc. 3.12.2.4. Imprint, Size of a Document, and Reprint Information]
Modulecore
Attributes
Member of
Contained by
May contain
Note

This element should be used for postal addresses only. Within it, the generic element addrLine may be used as an alternative to any of the more specialized elements available from the model.addrPart class, such as <street>, <postCode> etc.

Example

Using just the elements defined by the core module, an address could be represented as follows:

<address>
 <street>via Marsala 24</street>
 <postCode>40126</postCode>
 <name>Bologna</name>
 <name>Italy</name>
</address>
Example

When a schema includes the names and dates module more specific elements such as country or settlement would be preferable over generic <name>:

<address>
 <street>via Marsala 24</street>
 <postCode>40126</postCode>
 <settlement>Bologna</settlement>
 <country>Italy</country>
</address>
Example
<address>
 <addrLine>Computing Center, MC 135</addrLine>
 <addrLine>P.O. Box 6998</addrLine>
 <addrLine>Chicago, IL 60680</addrLine>
 <addrLine>USA</addrLine>
</address>
Example
<address>
 <country key="FR"/>
 <settlement type="city">Lyon</settlement>
 <postCode>69002</postCode>
 <district type="arrondissement">IIème</district>
 <district type="quartier">Perrache</district>
 <street>
  <num>30</num>, Cours de Verdun</street>
</address>
Content model
<content>
 <sequence>
  <classRef key="model.global"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence maxOccurs="unbounded"
   minOccurs="1">

   <classRef key="model.addrPart"/>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element address
{
   tei_att.global.attributes,
   ( tei_model.global*, ( tei_model.addrPart, tei_model.global* )+ )
}

<addrLine>

<addrLine> (address line) contains one line of a postal address. [3.6.2. Addresses 2.2.4. Publication, Distribution, Licensing, etc. 3.12.2.4. Imprint, Size of a Document, and Reprint Information]
Modulecore
Attributes
Member of
Contained by
core: address
May contain
Note

Addresses may be encoded either as a sequence of lines, or using any sequence of component elements from the model.addrPart class. Other non-postal forms of address, such as telephone numbers or email, should not be included within an address element directly but may be wrapped within an addrLine if they form part of the printed address in some source text.

Example
<address>
 <addrLine>Computing Center, MC 135</addrLine>
 <addrLine>P.O. Box 6998</addrLine>
 <addrLine>Chicago, IL</addrLine>
 <addrLine>60680 USA</addrLine>
</address>
Example
<addrLine>
 <ref target="tel:+1-201-555-0123">(201) 555 0123</ref>
</addrLine>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element addrLine { tei_att.global.attributes, tei_macro.phraseSeq }

<addSpan>

<addSpan> (added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also add). [11.3.1.4. Additions and Deletions]
Moduletranscr
Attributes
Member of
Contained by
May containEmpty element
Note

Both the beginning and the end of the added material must be marked; the beginning by the addSpan element itself, the end by the spanTo attribute.

Example
<handNote scribe="HelgiÓlafsson"
 xml:id="HEOL"/>

<!-- ... -->
<body>
 <div>
<!-- text here -->
 </div>
 <addSpan hand="#HEOLn="added_gathering"
  spanTo="#P025"/>

 <div>
<!-- text of first added poem here -->
 </div>
 <div>
<!-- text of second added poem here -->
 </div>
 <div>
<!-- text of third added poem here -->
 </div>
 <div>
<!-- text of fourth added poem here -->
 </div>
 <anchor xml:id="P025"/>
 <div>
<!-- more text here -->
 </div>
</body>
Schematron

<sch:assert test="@spanTo">The @spanTo attribute of <sch:name/> is required.</sch:assert>
Schematron

<sch:assert test="@spanTo">L'attribut spanTo est requis.</sch:assert>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element addSpan
{
   tei_att.global.attributes,
   tei_att.transcriptional.attributes,
   tei_att.placement.attributes,
   tei_att.typed.attributes,
   tei_att.spanning.attributes,
   tei_att.dimensions.attributes,
   empty
}

<am>

<am> (abbreviation marker) contains a sequence of letters or signs present in an abbreviation which are omitted or replaced in the expanded form of the abbreviation. [11.3.1.2. Abbreviation and Expansion]
Moduletranscr
Attributes
Member of
Contained by
May contain
Example
do you <abbr>Mr<am>.</am>
</abbr> Jones?
Example
<choice>
 <abbr>Aug<am>g</am>
 </abbr>
 <expan>Aug<ex>ustorum duo</ex>
 </expan>
</choice>
Example
<abbr>eu<am>
  <g ref="#b-er"/>
 </am>y</abbr>
<abbr>
 <am>
  <g ref="#b-per"/>
 </am>sone

</abbr> ...
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.pPart.transcriptional"/>
 </alternate>
</content>
Schema Declaration
element am
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.editLike.attributes,
   ( text | tei_model.gLike | tei_model.pPart.transcriptional )*
}

<anchor>

<anchor> (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element. [8.4.2. Synchronization and Overlap 16.5. Correspondence and Alignment]
Modulelinking
Attributes
Member of
Contained by
May containEmpty element
Note

On this element, the global xml:id attribute must be supplied to specify an identifier for the point at which this element occurs within a document. The value used may be chosen freely provided that it is unique within the document and is a syntactically valid name. There is no requirement for values containing numbers to be in sequence.

Example
<s>The anchor is he<anchor xml:id="A234"/>re somewhere.</s>
<s>Help me find it.<ptr target="#A234"/>
</s>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element anchor { tei_att.global.attributes, tei_att.typed.attributes, empty }

<app>

<app> (apparatus entry) contains one entry in a critical apparatus, with an optional lemma and usually one or more readings or notes on the relevant passage. [12.1.1. The Apparatus Entry]
Moduletextcrit
Attributes
typeclassifies the variation contained in this element according to some convenient typology.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
fromidentifies the beginning of the lemma in the base text.
Status Optional
Datatype teidata.pointer
Note

This attribute should be used when either the double-end point method of apparatus markup, or the location-referenced method with a URL rather than canonical reference, are used.

toidentifies the endpoint of the lemma in the base text.
Status Optional
Datatype teidata.pointer
Note

This attribute is only used when the double-end point method of apparatus markup is used, when the encoded apparatus is not embedded in-line in the base-text.

loc(location) indicates the location of the variation, when the location-referenced method of apparatus markup is used.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
Note

This attribute is used only when the location-referenced encoding method is used. It supplies a string containing a canonical reference for the passage to which the variation applies.

Member of
Contained by
May contain
core: note
Example
<app>
 <lem wit="#El #Hg">Experience</lem>
 <rdg type="substantivewit="#La">Experiment</rdg>
 <rdg type="substantivewit="#Ra2">Eryment</rdg>
</app>
Example
<app type="substantive">
 <rdgGrp type="subvariants">
  <lem wit="#El #Hg">Experience</lem>
  <rdg wit="#Ha4">Experiens</rdg>
 </rdgGrp>
 <rdgGrp type="subvariants">
  <lem wit="#Cp #Ld1">Experiment</lem>
  <rdg wit="#La">Ex<g ref="#per"/>iment</rdg>
 </rdgGrp>
 <rdgGrp type="subvariants">
  <lem resp="#ed2013">Eriment</lem>
  <rdg wit="#Ra2">Eryment</rdg>
 </rdgGrp>
</app>
Example
<app loc="1">
 <rdg resp="#SEG">TIMΩΔA</rdg>
</app>
Example
<app loc="1-6">
 <note>Too badly worn to yield a text</note>
</app>
Example
<choice xml:id="choice3">
 <reg>σύμπαντα</reg>
 <orig>ΣΙΝΠΑΤΑΝ</orig>
</choice>
<!-- ... -->
<app from="#choice3">
 <note>Mommsen's fanciful normalization, reproduced here, has not been accepted by all recent editions</note>
</app>
Content model
<content>
 <sequence>
  <elementRef key="lemminOccurs="0"/>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.rdgLike"/>
   <classRef key="model.noteLike"/>
   <elementRef key="witDetail"/>
   <elementRef key="wit"/>
   <elementRef key="rdgGrp"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element app
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   attribute type { teidata.enumerated }?,
   attribute from { teidata.pointer }?,
   attribute to { teidata.pointer }?,
   attribute loc { list { teidata.word+ } }?,
   (
      lem?,
      (
         tei_model.rdgLike
       | tei_model.noteLike
       | tei_witDetail
       | tei_wit
       | tei_rdgGrp
      )*
   )
}

<author>

<author> (author) in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement]
Modulecore
Attributes
Member of
Contained by
core: bibl monogr
header: titleStmt
msdescription: msItemStruct
May contain
Note

Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use a generally recognized name authority file to supply the content for this element. The attributes key or ref may also be used to reference canonical information about the author(s) intended from any appropriate authority, such as a library catalogue or online resource.

In the case of a broadcast, use this element for the name of the company or network responsible for making the broadcast.

Where an author is unknown or unspecified, this element may contain text such as Unknown or Anonymous. When the appropriate TEI modules are in use, it may also contain detailed tagging of the names used for people, organizations or places, in particular where multiple names are given.

Example
<author>British Broadcasting Corporation</author>
<author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author>
<author>Anonymous</author>
<author>Bill and Melinda Gates Foundation</author>
<author>
 <persName>Beaumont, Francis</persName> and
<persName>John Fletcher</persName>
</author>
<author>
 <orgName key="BBC">British Broadcasting
   Corporation</orgName>: Radio 3 Network
</author>
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element author
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_att.datable.attributes,
   tei_macro.phraseSeq
}

<authority>

<authority> (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduleheader
Attributes
Member of
Contained by
core: monogr
May contain
Example
<authority>John Smith</authority>
Content model
<content>
 <macroRef key="macro.phraseSeq.limited"/>
</content>
Schema Declaration
element authority
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   tei_macro.phraseSeq.limited
}

<availability>

<availability> (availability) supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduleheader
Attributes
status(status) supplies a code identifying the current availability of the text.
Status Optional
Datatype teidata.enumerated
Legal values are:
free
(free) the text is freely available.
unknown
(unknown) the status of the text is unknown.
restricted
(restricted) the text is not freely available.
Member of
Contained by
May contain
core: p
header: licence
linking: ab
Note

A consistent format should be adopted

Example
<availability status="restricted">
 <p>Available for academic research purposes only.</p>
</availability>
<availability status="free">
 <p>In the public domain</p>
</availability>
<availability status="restricted">
 <p>Available under licence from the publishers.</p>
</availability>
Example
<availability>
 <licence target="http://opensource.org/licenses/MIT">
  <p>The MIT License
     applies to this document.</p>
  <p>Copyright (C) 2011 by The University of Victoria</p>
  <p>Permission is hereby granted, free of charge, to any person obtaining a copy
     of this software and associated documentation files (the "Software"), to deal
     in the Software without restriction, including without limitation the rights
     to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
     copies of the Software, and to permit persons to whom the Software is
     furnished to do so, subject to the following conditions:</p>
  <p>The above copyright notice and this permission notice shall be included in
     all copies or substantial portions of the Software.</p>
  <p>THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
     IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
     FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
     AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
     LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
     OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
     THE SOFTWARE.</p>
 </licence>
</availability>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="1">

  <classRef key="model.availabilityPart"/>
  <classRef key="model.pLike"/>
 </alternate>
</content>
Schema Declaration
element availability
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   attribute status { "free" | "unknown" | "restricted" }?,
   ( tei_model.availabilityPart | tei_model.pLike )+
}

<back>

<back> (back matter) contains any appendixes, etc. following the main part of a text. [4.7. Back Matter 4. Default Text Structure]
Moduletextstructure
Attributes
Contained by
textstructure: text
transcr: facsimile
May contain
Note

Because cultural conventions differ as to which elements are grouped as back matter and which as front matter, the content models for the back and front elements are identical.

Example
<back>
 <div type="appendix">
  <head>The Golden Dream or, the Ingenuous Confession</head>
  <p>TO shew the Depravity of human Nature, and how apt the Mind is to be misled by Trinkets
     and false Appearances, Mrs. Two-Shoes does acknowledge, that after she became rich, she
     had like to have been, too fond of Money
<!-- .... -->
  </p>
 </div>
<!-- ... -->
 <div type="epistle">
  <head>A letter from the Printer, which he desires may be inserted</head>
  <salute>Sir.</salute>
  <p>I have done with your Copy, so you may return it to the Vatican, if you please;
  
<!-- ... -->
  </p>
 </div>
 <div type="advert">
  <head>The Books usually read by the Scholars of Mrs Two-Shoes are these and are sold at Mr
     Newbery's at the Bible and Sun in St Paul's Church-yard.</head>
  <list>
   <item n="1">The Christmas Box, Price 1d.</item>
   <item n="2">The History of Giles Gingerbread, 1d.</item>
<!-- ... -->
   <item n="42">A Curious Collection of Travels, selected from the Writers of all Nations,
       10 Vol, Pr. bound 1l.</item>
  </list>
 </div>
 <div type="advert">
  <head>By the KING's Royal Patent, Are sold by J. NEWBERY, at the Bible and Sun in St.
     Paul's Church-Yard.</head>
  <list>
   <item n="1">Dr. James's Powders for Fevers, the Small-Pox, Measles, Colds, &amp;c. 2s.
       6d</item>
   <item n="2">Dr. Hooper's Female Pills, 1s.</item>
<!-- ... -->
  </list>
 </div>
</back>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.frontPart"/>
   <classRef key="model.pLike.front"/>
   <classRef key="model.pLike"/>
   <classRef key="model.listLike"/>
   <classRef key="model.global"/>
  </alternate>
  <alternate minOccurs="0">
   <sequence>
    <classRef key="model.div1Like"/>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <classRef key="model.frontPart"/>
     <classRef key="model.div1Like"/>
     <classRef key="model.global"/>
    </alternate>
   </sequence>
   <sequence>
    <classRef key="model.divLike"/>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <classRef key="model.frontPart"/>
     <classRef key="model.divLike"/>
     <classRef key="model.global"/>
    </alternate>
   </sequence>
  </alternate>
  <sequence minOccurs="0">
   <classRef key="model.divBottomPart"/>
   <alternate maxOccurs="unbounded"
    minOccurs="0">

    <classRef key="model.divBottomPart"/>
    <classRef key="model.global"/>
   </alternate>
  </sequence>
 </sequence>
</content>
Schema Declaration
element back
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   (
      (
         tei_model.frontPart
       | tei_model.pLike.front
       | tei_model.pLike
       | tei_model.listLike
       | tei_model.global
      )*,
      (
         (
            tei_model.div1Like,
            ( tei_model.frontPart | tei_model.div1Like | tei_model.global )*
         )
       | (
            tei_model.divLike,
            ( tei_model.frontPart | tei_model.divLike | tei_model.global )*
         )
      )?,
      (
         tei_model.divBottomPart,
         ( tei_model.divBottomPart | tei_model.global )*
      )?
   )
}

<bibl>

<bibl> (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements]
Modulecore
Attributes
Member of
Contained by
May contain
Note

Contains phrase-level elements, together with any combination of elements from the model.biblPart class

Example
<bibl>Blain, Clements and Grundy: Feminist Companion to Literature in English (Yale,
1990)</bibl>
Example
<bibl>
 <title level="a">The Interesting story of the Children in the Wood</title>. In
<author>Victor E Neuberg</author>, <title>The Penny Histories</title>.
<publisher>OUP</publisher>
 <date>1968</date>.
</bibl>
Example
<bibl subtype="book_chaptertype="article"
 xml:id="carlin_2003">

 <author>
  <name>
   <surname>Carlin</surname>
     (<forename>Claire</forename>)</name>
 </author>,
<title level="a">The Staging of Impotence : France’s last
   congrès</title> dans
<bibl type="monogr">
  <title level="m">Theatrum mundi : studies in honor of Ronald W.
     Tobin</title>, éd.
 <editor>
   <name>
    <forename>Claire</forename>
    <surname>Carlin</surname>
   </name>
  </editor> et
 <editor>
   <name>
    <forename>Kathleen</forename>
    <surname>Wine</surname>
   </name>
  </editor>,
 <pubPlace>Charlottesville, Va.</pubPlace>,
 <publisher>Rookwood Press</publisher>,
 <date when="2003">2003</date>.
 </bibl>
</bibl>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.highlighted"/>
  <classRef key="model.pPart.data"/>
  <classRef key="model.pPart.edit"/>
  <classRef key="model.segLike"/>
  <classRef key="model.ptrLike"/>
  <classRef key="model.biblPart"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element bibl
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   tei_att.typed.attributes,
   tei_att.sortable.attributes,
   tei_att.docStatus.attributes,
   (
      text
    | tei_model.gLike
    | tei_model.highlighted
    | tei_model.pPart.data
    | tei_model.pPart.edit
    | tei_model.segLike
    | tei_model.ptrLike
    | tei_model.biblPart
    | tei_model.global
   )*
}

<biblScope>

<biblScope> (scope of bibliographic reference) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work. [3.12.2.5. Scopes and Ranges in Bibliographic Citations]
Modulecore
Attributes
Member of
Contained by
May contain
Note

When a single page is being cited, use the from and to attributes with an identical value. When no clear endpoint is provided, the from attribute may be used without to; for example a citation such as ‘p. 3ff’ might be encoded <biblScope from="3">p. 3ff</biblScope>.

It is now considered good practice to supply this element as a sibling (rather than a child) of imprint, since it supplies information which does not constitute part of the imprint.

Example
<biblScope>pp 12–34</biblScope>
<biblScope from="12to="34unit="page"/>
<biblScope unit="volume">II</biblScope>
<biblScope unit="page">12</biblScope>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element biblScope
{
   tei_att.global.attributes,
   tei_att.citing.attributes,
   tei_macro.phraseSeq
}

<biblStruct>

<biblStruct> (structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements appear and in a specified order. [3.12.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements]
Modulecore
Attributes
Member of
Contained by
May contain
Example
<biblStruct>
 <monogr>
  <author>Blain, Virginia</author>
  <author>Clements, Patricia</author>
  <author>Grundy, Isobel</author>
  <title>The Feminist Companion to Literature in English: women writers from the middle ages
     to the present</title>
  <edition>first edition</edition>
  <imprint>
   <publisher>Yale University Press</publisher>
   <pubPlace>New Haven and London</pubPlace>
   <date>1990</date>
  </imprint>
 </monogr>
</biblStruct>
Example
<biblStruct type="newspaper">
 <analytic>
  <author>
   <forename>David</forename>
   <surname>Barstow</surname>
  </author>
  <author>
   <forename>Susanne</forename>
   <surname>Craig</surname>
  </author>
  <author>
   <forename>Russ</forename>
   <surname>Buettner</surname>
  </author>
  <title type="main">Trump Took Part in Suspect Schemes to Evade Tax Bills</title>
  <title type="sub">Behind the Myth of a Self-Made Billionaire, a Vast Inheritance From His Father</title>
 </analytic>
 <monogr>
  <title level="j">The New York Times</title>
  <imprint>
   <pubPlace>New York</pubPlace>
   <publisher>A. G. Sulzberger</publisher>
   <date when="2018-10-03">Wednesday, October 3, 2018</date>
  </imprint>
  <biblScope unit="volume">CLXVIII</biblScope>
  <biblScope unit="issue">58,104</biblScope>
  <biblScope unit="page">1</biblScope>
 </monogr>
</biblStruct>
Content model
<content>
 <sequence>
  <elementRef key="analytic"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence maxOccurs="unbounded"
   minOccurs="1">

   <elementRef key="monogr"/>
   <elementRef key="series"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.noteLike"/>
   <classRef key="model.ptrLike"/>
   <elementRef key="relatedItem"/>
   <elementRef key="citedRange"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element biblStruct
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   tei_att.typed.attributes,
   tei_att.sortable.attributes,
   tei_att.docStatus.attributes,
   (
      analytic*,
      ( tei_monogr, series* )+,
      ( tei_model.noteLike | tei_model.ptrLike | tei_relatedItem | citedRange )*
   )
}

<binding>

<binding> (binding) contains a description of one binding, i.e. type of covering, boards, etc. applied to a manuscript or other object. [10.7.3.1. Binding Descriptions]
Modulemsdescription
Attributes
calendarindicates one or more systems or calendars to which the date represented by the content of this element belongs.
Deprecatedwill be removed on 2024-11-11
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
contemporary(contemporary) specifies whether or not the binding is contemporary with the majority of its contents
Status Optional
Datatype teidata.xTruthValue
Note

The value true indicates that the binding is contemporaneous with its contents; the value false that it is not. The value unknown should be used when the date of either binding or manuscript is unknown

Contained by
msdescription: bindingDesc
May contain
core: p
linking: ab
msdescription: decoNote
Example
<binding contemporary="true">
 <p>Contemporary blind stamped leather over wooden boards with evidence of a fore edge clasp
   closing to the back cover.</p>
</binding>
Example
<bindingDesc>
 <binding contemporary="false">
  <p>Quarter bound by the Phillipps' binder, Bretherton, with his sticker on the front
     pastedown.</p>
 </binding>
 <binding contemporary="false">
  <p>Rebound by an unknown 19th c. company; edges cropped and gilt.</p>
 </binding>
</bindingDesc>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="1">

  <classRef key="model.pLike"/>
  <elementRef key="condition"/>
  <elementRef key="decoNote"/>
 </alternate>
</content>
Schema Declaration
element binding
{
   tei_att.global.attributes,
   tei_att.datable.attributes,
   attribute calendar { list { teidata.pointer+ } }?,
   attribute contemporary { teidata.xTruthValue }?,
   ( tei_model.pLike | condition | tei_decoNote )+
}

<bindingDesc>

<bindingDesc> (binding description) describes the present and former bindings of a manuscript or other object, either as a series of paragraphs or as a series of distinct binding elements, one for each binding of the manuscript. [10.7.3.1. Binding Descriptions]
Modulemsdescription
Attributes
Member of
Contained by
msdescription: physDesc
May contain
core: p
linking: ab
msdescription: binding decoNote
Example
<bindingDesc>
 <p>Sewing not visible; tightly rebound over
   19th-cent. pasteboards, reusing panels of 16th-cent. brown leather with
   gilt tooling à la fanfare, Paris c. 1580-90, the centre of each
   cover inlaid with a 17th-cent. oval medallion of red morocco tooled in
   gilt (perhaps replacing the identifying mark of a previous owner); the
   spine similarly tooled, without raised bands or title-piece; coloured
   endbands; the edges of the leaves and boards gilt.Boxed.</p>
</bindingDesc>
Content model
<content>
 <alternate>
  <alternate maxOccurs="unbounded"
   minOccurs="1">

   <classRef key="model.pLike"/>
   <elementRef key="decoNote"/>
   <elementRef key="condition"/>
  </alternate>
  <elementRef key="binding"
   maxOccurs="unboundedminOccurs="1"/>

 </alternate>
</content>
Schema Declaration
element bindingDesc
{
   tei_att.global.attributes,
   ( ( tei_model.pLike | tei_decoNote | condition )+ | tei_binding+ )
}

<body>

<body> (text body) contains the whole body of a single unitary text, excluding any front or back matter. [4. Default Text Structure]
Moduletextstructure
Attributes
Contained by
textstructure: text
May contain
Example
<body>
 <l>Nu scylun hergan hefaenricaes uard</l>
 <l>metudæs maecti end his modgidanc</l>
 <l>uerc uuldurfadur sue he uundra gihuaes</l>
 <l>eci dryctin or astelidæ</l>
 <l>he aerist scop aelda barnum</l>
 <l>heben til hrofe haleg scepen.</l>
 <l>tha middungeard moncynnæs uard</l>
 <l>eci dryctin æfter tiadæ</l>
 <l>firum foldu frea allmectig</l>
 <trailer>primo cantauit Cædmon istud carmen.</trailer>
</body>
Content model
<content>
 <sequence>
  <classRef key="model.global"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence minOccurs="0">
   <classRef key="model.divTop"/>
   <alternate maxOccurs="unbounded"
    minOccurs="0">

    <classRef key="model.global"/>
    <classRef key="model.divTop"/>
   </alternate>
  </sequence>
  <sequence minOccurs="0">
   <classRef key="model.divGenLike"/>
   <alternate maxOccurs="unbounded"
    minOccurs="0">

    <classRef key="model.global"/>
    <classRef key="model.divGenLike"/>
   </alternate>
  </sequence>
  <alternate>
   <sequence maxOccurs="unbounded"
    minOccurs="1">

    <classRef key="model.divLike"/>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <classRef key="model.global"/>
     <classRef key="model.divGenLike"/>
    </alternate>
   </sequence>
   <sequence maxOccurs="unbounded"
    minOccurs="1">

    <classRef key="model.div1Like"/>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <classRef key="model.global"/>
     <classRef key="model.divGenLike"/>
    </alternate>
   </sequence>
   <sequence>
    <sequence maxOccurs="unbounded"
     minOccurs="1">

     <alternate maxOccurs="1minOccurs="1">
      <elementRef key="schemaSpec"/>
      <classRef key="model.common"/>
     </alternate>
     <classRef key="model.global"
      maxOccurs="unboundedminOccurs="0"/>

    </sequence>
    <alternate minOccurs="0">
     <sequence maxOccurs="unbounded"
      minOccurs="1">

      <classRef key="model.divLike"/>
      <alternate maxOccurs="unbounded"
       minOccurs="0">

       <classRef key="model.global"/>
       <classRef key="model.divGenLike"/>
      </alternate>
     </sequence>
     <sequence maxOccurs="unbounded"
      minOccurs="1">

      <classRef key="model.div1Like"/>
      <alternate maxOccurs="unbounded"
       minOccurs="0">

       <classRef key="model.global"/>
       <classRef key="model.divGenLike"/>
      </alternate>
     </sequence>
    </alternate>
   </sequence>
  </alternate>
  <sequence maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.divBottom"/>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element body
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   (
      tei_model.global*,
      ( tei_model.divTop, ( tei_model.global | tei_model.divTop )* )?,
      ( tei_model.divGenLike, ( tei_model.global | tei_model.divGenLike )* )?,
      (
         ( tei_model.divLike, ( tei_model.global | tei_model.divGenLike )* )+
       | ( tei_model.div1Like, ( tei_model.global | tei_model.divGenLike )* )+
       | (
            ( ( schemaSpec | tei_model.common ), tei_model.global* )+,
            (
               (
                  tei_model.divLike,
                  ( tei_model.global | tei_model.divGenLike )*
               )+
             | (
                  tei_model.div1Like,
                  ( tei_model.global | tei_model.divGenLike )*
               )+
            )?
         )
      ),
      ( tei_model.divBottom, tei_model.global* )*
   )
}

<byline>

<byline> (byline) contains the primary statement of responsibility given for a work on its title page or at the head or end of the work. [4.2.2. Openers and Closers 4.5. Front Matter]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
msdescription: msItem
textcrit: rdg
May contain
Note

The byline on a title page may include either the name or a description for the document's author. Where the name is included, it may optionally be tagged using the docAuthor element.

Example
<byline>Written by a CITIZEN who continued all the
while in London. Never made publick before.</byline>
Example
<byline>Written from her own MEMORANDUMS</byline>
Example
<byline>By George Jones, Political Editor, in Washington</byline>
Example
<byline>BY
<docAuthor>THOMAS PHILIPOTT,</docAuthor>
Master of Arts,
(Somtimes)
Of Clare-Hall in Cambridge.</byline>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <elementRef key="docAuthor"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element byline
{
   tei_att.global.attributes,
   (
      text
    | tei_model.gLike
    | tei_model.phrase
    | tei_docAuthor
    | tei_model.global
   )*
}

<c>

<c> (character) represents a character. [17.1. Linguistic Segment Categories]
Moduleanalysis
Attributes
Member of
Contained by
May containCharacter data only
Note

Contains a single character, a <g> element, or a sequence of graphemes to be treated as a single character. The type attribute is used to indicate the function of this segmentation, taking values such as letter, punctuation, or digit etc.

Example
<phr>
 <c>M</c>
 <c>O</c>
 <c>A</c>
 <c>I</c>
 <w>doth</w>
 <w>sway</w>
 <w>my</w>
 <w>life</w>
</phr>
Content model
<content>
 <macroRef key="macro.xtext"/>
</content>
Schema Declaration
element c
{
   tei_att.global.attributes,
   tei_att.segLike.attributes,
   tei_att.typed.attributes,
   tei_att.notated.attributes,
   tei_macro.xtext
}

<change>

<change> (change) documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.6. The Revision Description 2.4.1. Creation 11.7. Identifying Changes and Revisions]
Moduleheader
Attributes
target(target) points to one or more elements that belong to this change.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Contained by
May contain
Note

The who attribute may be used to point to any other element, but will typically specify a <respStmt> or <person> element elsewhere in the header, identifying the person responsible for the change and their role in making it.

It is recommended that changes be recorded with the most recent first. The status attribute may be used to indicate the status of a document following the change documented.

Example
<titleStmt>
 <title> ... </title>
 <editor xml:id="LDB">Lou Burnard</editor>
 <respStmt xml:id="BZ">
  <resp>copy editing</resp>
  <name>Brett Zamir</name>
 </respStmt>
</titleStmt>
<!-- ... -->
<revisionDesc status="published">
 <change status="publicwhen="2008-02-02"
  who="#BZ">
Finished chapter 23</change>
 <change status="draftwhen="2008-01-02"
  who="#BZ">
Finished chapter 2</change>
 <change n="P2.2when="1991-12-21"
  who="#LDB">
Added examples to section 3</change>
 <change when="1991-11-11who="#MSM">Deleted chapter 10</change>
</revisionDesc>
Example
<profileDesc>
 <creation>
  <listChange>
   <change xml:id="DRAFT1">First draft in pencil</change>
   <change notBefore="1880-12-09"
    xml:id="DRAFT2">
First revision, mostly
       using green ink</change>
   <change notBefore="1881-02-13"
    xml:id="DRAFT3">
Final corrections as
       supplied to printer.</change>
  </listChange>
 </creation>
</profileDesc>
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element change
{
   tei_att.ascribed.attributes,
   tei_att.datable.attributes,
   tei_att.docStatus.attributes,
   tei_att.global.attributes,
   tei_att.typed.attributes,
   attribute target { list { teidata.pointer+ } }?,
   tei_macro.specialPara
}

<choice>

<choice> (choice) groups a number of alternative encodings for the same point in a text. [3.5. Simple Editorial Changes]
Modulecore
Attributes
Member of
Contained by
May contain
Note

Because the children of a choice element all represent alternative ways of encoding the same sequence, it is natural to think of them as mutually exclusive. However, there may be cases where a full representation of a text requires the alternative encodings to be considered as parallel.

Note also that choice elements may self-nest.

Where the purpose of an encoding is to record multiple witnesses of a single work, rather than to identify multiple possible encoding decisions at a given point, the app element and associated elements discussed in section 12.1. The Apparatus Entry, Readings, and Witnesses should be preferred.

Example

An American encoding of Gulliver's Travels which retains the British spelling but also provides a version regularized to American spelling might be encoded as follows.

<p>Lastly, That, upon his solemn oath to observe all the above
articles, the said man-mountain shall have a daily allowance of
meat and drink sufficient for the support of <choice>
  <sic>1724</sic>
  <corr>1728</corr>
 </choice> of our subjects,
with free access to our royal person, and other marks of our
<choice>
  <orig>favour</orig>
  <reg>favor</reg>
 </choice>.</p>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="2">

  <classRef key="model.choicePart"/>
  <elementRef key="choice"/>
 </alternate>
</content>
Schema Declaration
element choice
{
   tei_att.global.attributes,
   ( tei_model.choicePart | tei_choice )+
}

<cit>

<cit> (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example. [3.3.3. Quotation 4.3.1. Grouped Texts 9.3.5.1. Examples]
Modulecore
Attributes
Member of
Contained by
May contain
analysis: pc span
dictionaries: xr
linking: anchor
msdescription: msDesc
textcrit: app witDetail
Example
<cit>
 <q>and the breath of the whale is frequently attended with such an insupportable smell,
   as to bring on disorder of the brain.</q>
 <bibl>Ulloa's South America</bibl>
</cit>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="1">

  <classRef key="model.biblLike"/>
  <classRef key="model.egLike"/>
  <classRef key="model.entryPart"/>
  <classRef key="model.global"/>
  <classRef key="model.graphicLike"/>
  <classRef key="model.ptrLike"/>
  <classRef key="model.attributable"/>
  <elementRef key="pc"/>
  <elementRef key="q"/>
 </alternate>
</content>
Schema Declaration
element cit
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   (
      tei_model.biblLike
    | tei_model.egLike
    | tei_model.entryPart
    | tei_model.global
    | tei_model.graphicLike
    | tei_model.ptrLike
    | tei_model.attributable
    | tei_pc
    | tei_q
   )+
}

<closer>

<closer> (closer) groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter. [4.2.2. Openers and Closers 4.2. Elements Common to All Divisions]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
textstructure: back body div front
May contain
Example
<div type="letter">
 <p> perhaps you will favour me with a sight of it when convenient.</p>
 <closer>
  <salute>I remain, &amp;c. &amp;c.</salute>
  <signed>H. Colburn</signed>
 </closer>
</div>
Example
<div type="chapter">
 <p>
<!-- ... --> and his heart was going like mad and yes I said yes I will Yes.</p>
 <closer>
  <dateline>
   <name type="place">Trieste-Zürich-Paris,</name>
   <date>1914–1921</date>
  </dateline>
 </closer>
</div>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <elementRef key="signed"/>
  <elementRef key="dateline"/>
  <elementRef key="salute"/>
  <classRef key="model.phrase"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element closer
{
   tei_att.global.attributes,
   tei_att.written.attributes,
   (
      text
    | tei_model.gLike
    | tei_signed
    | tei_dateline
    | tei_salute
    | tei_model.phrase
    | tei_model.global
   )*
}

<collation>

<collation> (collation) contains a description of how the leaves, bifolia, or similar objects are physically arranged. [10.7.1. Object Description]
Modulemsdescription
Attributes
Contained by
msdescription: supportDesc
May contain
Example
<collation>The written leaves preceded by an original flyleaf,
conjoint with the pastedown.</collation>
Example
<collation>
 <p>
  <formula>1-5.8 6.6 (catchword, f. 46, does not match following text)
     7-8.8 9.10, 11.2 (through f. 82) 12-14.8 15.8(-7)</formula>
  <catchwords>Catchwords are written horizontally in center
     or towards the right lower margin in various manners:
     in red ink for quires 1-6 (which are also signed in red
     ink with letters of the alphabet and arabic numerals);
     quires 7-9 in ink of text within yellow decorated frames;
     quire 10 in red decorated frame; quire 12 in ink of text;
     quire 13 with red decorative slashes; quire 14 added in
     cursive hand.</catchwords>
 </p>
</collation>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element collation { tei_att.global.attributes, tei_macro.specialPara }

<corr>

<corr> (correction) contains the correct form of a passage apparently erroneous in the copy text. [3.5.1. Apparent Errors]
Modulecore
Attributes
Member of
Contained by
May contain
Example

If all that is desired is to call attention to the fact that the copy text has been corrected, corr may be used alone:

I don't know,
Juan. It's so far in the past now — how <corr>can we</corr> prove
or disprove anyone's theories?
Example

It is also possible, using the choice and sic elements, to provide an uncorrected reading:

I don't know, Juan. It's so far in the past now —
how <choice>
 <sic>we can</sic>
 <corr>can we</corr>
</choice> prove or
disprove anyone's theories?
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element corr
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_att.typed.attributes,
   tei_macro.paraContent
}

<correspAction>

<correspAction> (correspondence action) contains a structured description of the place, the name of a person/organization and the date related to the sending/receiving of a message or any other action related to the correspondence. [2.4.6. Correspondence Description]
Moduleheader
Attributes
typedescribes the nature of the action.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Suggested values include:
sent
information concerning the sending or dispatch of a message.
received
information concerning the receipt of a message.
transmitted
information concerning the transmission of a message, i.e. between the dispatch and the next receipt, redirect or forwarding.
redirected
information concerning the redirection of an unread message.
forwarded
information concerning the forwarding of a message.
Member of
Contained by
header: correspDesc
May contain
Example
<correspAction type="sent">
 <persName>Adelbert von Chamisso</persName>
 <settlement>Vertus</settlement>
 <date when="1807-01-29"/>
</correspAction>
Content model
<content>
 <alternate>
  <classRef key="model.correspActionPart"
   maxOccurs="unboundedminOccurs="1"/>

  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

 </alternate>
</content>
Schema Declaration
element correspAction
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   tei_att.sortable.attributes,
   attribute type
   {
      "sent"
    | "received"
    | "transmitted"
    | "redirected"
    | "forwarded"
    | teidata.enumerated
   }?,
   ( tei_model.correspActionPart+ | tei_model.pLike+ )
}

<correspDesc>

<correspDesc> (correspondence description) contains a description of the actions related to one act of correspondence. [2.4.6. Correspondence Description]
Moduleheader
Attributes
Member of
Contained by
header: profileDesc
May contain
core: note p
linking: ab
Example
<correspDesc>
 <correspAction type="sent">
  <persName>Carl Maria von Weber</persName>
  <settlement>Dresden</settlement>
  <date when="1817-06-23">23 June 1817</date>
 </correspAction>
 <correspAction type="received">
  <persName>Caroline Brandt</persName>
  <settlement>Prag</settlement>
 </correspAction>
 <correspContext>
  <ref target="http://www.weber-gesamtausgabe.de/A041209"
   type="prev">
Previous letter of
  <persName>Carl Maria von Weber</persName>
     to <persName>Caroline Brandt</persName>:
  <date from="1817-06-19to="1817-06-20">June 19/20, 1817</date>
  </ref>
  <ref target="http://www.weber-gesamtausgabe.de/A041217"
   type="next">
Next letter of
  <persName>Carl Maria von Weber</persName> to
  <persName>Caroline Brandt</persName>:
  <date when="1817-06-27">June 27, 1817</date>
  </ref>
 </correspContext>
</correspDesc>
Content model
<content>
 <alternate>
  <classRef key="model.correspDescPart"
   maxOccurs="unboundedminOccurs="1"/>

  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

 </alternate>
</content>
Schema Declaration
element correspDesc
{
   tei_att.declarable.attributes,
   tei_att.canonical.attributes,
   tei_att.global.attributes,
   tei_att.typed.attributes,
   ( tei_model.correspDescPart+ | tei_model.pLike+ )
}

<creation>

<creation> (creation) contains information about the creation of a text. [2.4.1. Creation 2.4. The Profile Description]
Moduleheader
Attributes
Member of
Contained by
header: profileDesc
May contain
Note

The creation element may be used to record details of a text's creation, e.g. the date and place it was composed, if these are of interest.

It may also contain a more structured account of the various stages or revisions associated with the evolution of a text; this should be encoded using the listChange element. It should not be confused with the publicationStmt element, which records date and place of publication.

Example
<creation>
 <date>Before 1987</date>
</creation>
Example
<creation>
 <date when="1988-07-10">10 July 1988</date>
</creation>
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.limitedPhrase"/>
  <elementRef key="listChange"/>
 </alternate>
</content>
Schema Declaration
element creation
{
   tei_att.global.attributes,
   tei_att.datable.attributes,
   ( text | tei_model.limitedPhrase | tei_listChange )*
}

<date>

<date> (date) contains a date in any format. [3.6.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.6. The Revision Description 3.12.2.4. Imprint, Size of a Document, and Reprint Information 15.2.3. The Setting Description 13.4. Dates]
Modulecore
Attributes
Member of
Contained by
May contain
Example
<date when="1980-02">early February 1980</date>
Example
Given on the <date when="1977-06-12">Twelfth Day
of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the Republic
the Two Hundredth and first and of the University the Eighty-Sixth.</date>
Example
<date when="1990-09">September 1990</date>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element date
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   tei_att.datable.attributes,
   tei_att.editLike.attributes,
   tei_att.dimensions.attributes,
   tei_att.typed.attributes,
   ( text | tei_model.gLike | tei_model.phrase | tei_model.global )*
}

<dateline>

<dateline> (dateline) contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer. [4.2.2. Openers and Closers]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
textstructure: back body closer div front opener
May contain
Example
<dateline>Walden, this 29. of August 1592</dateline>
Example
<div type="chapter">
 <p>
<!-- ... --> and his heart was going like mad and yes I said yes I will Yes.</p>
 <closer>
  <dateline>
   <name type="place">Trieste-Zürich-Paris,</name>
   <date>1914–1921</date>
  </dateline>
 </closer>
</div>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.global"/>
  <elementRef key="docDate"/>
 </alternate>
</content>
Schema Declaration
element dateline
{
   tei_att.global.attributes,
   ( text | tei_model.gLike | tei_model.phrase | tei_model.global | docDate )*
}

<decoNote>

<decoNote> (note on decoration) contains a note describing either a decorative component of a manuscript or other object, or a fairly homogenous class of such components. [10.7.3. Bindings, Seals, and Additional Material]
Modulemsdescription
Attributes
Member of
Contained by
core: bibl
May contain
Example
<decoDesc>
 <decoNote type="initial">
  <p>The start of each book of the Bible with
     a 10-line historiated illuminated initial;
     prefaces decorated with 6-line blue initials
     with red penwork flourishing; chapters marked by
     3-line plain red initials; verses with 1-line initials,
     alternately blue or red.</p>
 </decoNote>
</decoDesc>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element decoNote
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_macro.specialPara
}

<del>

<del> (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, or a previous annotator or corrector. [3.5.3. Additions, Deletions, and Omissions]
Modulecore
Attributes
Member of
Contained by
May contain
Note

This element should be used for deletion of shorter sequences of text, typically single words or phrases. The delSpan element should be used for longer sequences of text, for those containing structural subdivisions, and for those containing overlapping additions and deletions.

The text deleted must be at least partially legible in order for the encoder to be able to transcribe it (unless it is restored in a supplied tag). Illegible or lost text within a deletion may be marked using the gap tag to signal that text is present but has not been transcribed, or is no longer visible. Attributes on the gap element may be used to indicate how much text is omitted, the reason for omitting it, etc. If text is not fully legible, the unclear element (available when using the additional tagset for transcription of primary sources) should be used to signal the areas of text which cannot be read with confidence in a similar way.

Degrees of uncertainty over what can still be read, or whether a deletion was intended may be indicated by use of the <certainty> element (see 21. Certainty, Precision, and Responsibility).

There is a clear distinction in the TEI between del and <surplus> on the one hand and gap or unclear on the other. del indicates a deletion present in the source being transcribed, which states the author's or a later scribe's intent to cancel or remove text. <surplus> indicates material present in the source being transcribed which should have been so deleted, but which is not in fact. gap or unclear, by contrast, signal an editor's or encoder's decision to omit something or their inability to read the source text. See sections 11.3.1.7. Text Omitted from or Supplied in the Transcription and 11.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for the relationship between these and other related elements used in detailed transcription.

Example
<l>
 <del rend="overtyped">Mein</del> Frisch <del rend="overstriketype="primary">schwebt</del>
weht der Wind
</l>
Example
<del rend="overstrike">
 <gap quantity="5reason="illegible"
  unit="character"/>

</del>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element del
{
   tei_att.global.attributes,
   tei_att.transcriptional.attributes,
   tei_att.typed.attributes,
   tei_att.dimensions.attributes,
   tei_macro.paraContent
}

<delSpan>

<delSpan> (deleted span of text) marks the beginning of a longer sequence of text deleted, marked as deleted, or otherwise signaled as superfluous or spurious by an author, scribe, annotator, or corrector. [11.3.1.4. Additions and Deletions]
Moduletranscr
Attributes
Member of
Contained by
May containEmpty element
Note

Both the beginning and ending of the deleted sequence must be marked: the beginning by the delSpan element, the ending by the target of the spanTo attribute.

The text deleted must be at least partially legible, in order for the encoder to be able to transcribe it. If it is not legible at all, the delSpan tag should not be used. Rather, the gap tag should be employed to signal that text cannot be transcribed, with the value of the reason attribute giving the cause for the omission from the transcription as deletion. If it is not fully legible, the unclear element should be used to signal the areas of text which cannot be read with confidence. See further sections 11.3.1.7. Text Omitted from or Supplied in the Transcription and, for the close association of the delSpan tag with the gap, <damage>, unclear and supplied elements, 11.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination.

The delSpan tag should not be used for deletions made by editors or encoders. In these cases, either the corr tag or the gap tag should be used.

Example
<p>Paragraph partially deleted. This is the undeleted
portion <delSpan spanTo="#a23"/>and this the deleted
portion of the paragraph.</p>
<p>Paragraph deleted together with adjacent material.</p>
<p>Second fully deleted paragraph.</p>
<p>Paragraph partially deleted; in the middle of this
paragraph the deletion ends and the anchor point marks
the resumption <anchor xml:id="a23"/> of the text. ...</p>
Schematron

<sch:assert test="@spanTo">The @spanTo attribute of <sch:name/> is required.</sch:assert>
Schematron

<sch:assert test="@spanTo">L'attribut spanTo est requis.</sch:assert>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element delSpan
{
   tei_att.global.attributes,
   tei_att.transcriptional.attributes,
   tei_att.typed.attributes,
   tei_att.spanning.attributes,
   tei_att.dimensions.attributes,
   empty
}

<desc>

<desc> (description) contains a short description of the purpose, function, or use of its parent element, or when the parent is a documentation element, describes or defines the object being documented. [22.4.1. Description of Components]
Modulecore
Attributes
typecharacterizes the element in some sense, using any convenient classification scheme or typology.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Suggested values include:
deprecationInfo
(deprecation information) This element describes why or how its parent element is being deprecated, typically including recommendations for alternate encoding.
<dataSpec ident="teidata.point"
 module="teivalidUntil="2050-02-25">

 <desc type="deprecationInfo"
  versionDate="2018-09-14xml:lang="en">
Several standards bodies, including NIST in the USA,
   strongly recommend against ending the representation of a number
   with a decimal point. So instead of <q>3.</q> use either <q>3</q>
   or <q>3.0</q>.</desc>
<!-- ... -->
</dataSpec>
Member of
Contained by
May contain
Note

When used in a specification element such as <elementSpec>, TEI convention requires that this be expressed as a finite clause, begining with an active verb.

Example

Example of a desc element inside a documentation element.

<dataSpec ident="teidata.point"
 module="tei">

 <desc versionDate="2010-10-17"
  xml:lang="en">
defines the data type used to express a point in cartesian space.</desc>
 <content>
  <dataRef name="token"
   restriction="(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)"/>

 </content>
<!-- ... -->
</dataSpec>
Example

Example of a desc element in a non-documentation element.

<place xml:id="KERG2">
 <placeName>Kerguelen Islands</placeName>
<!-- ... -->
 <terrain>
  <desc>antarctic tundra</desc>
 </terrain>
<!-- ... -->
</place>
SchematronA desc with a type of deprecationInfo should only occur when its parent element is being deprecated. Furthermore, it should always occur in an element that is being deprecated when desc is a valid child of that element.

<sch:rule context="tei:desc[ @type eq 'deprecationInfo']">
<sch:assert test="../@validUntil">Information about a
deprecation should only be present in a specification element
that is being deprecated: that is, only an element that has a
@validUntil attribute should have a child <desc
type="deprecationInfo">.</sch:assert>
</sch:rule>
Content model
<content>
 <macroRef key="macro.limitedContent"/>
</content>
Schema Declaration
element desc
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   attribute type { "deprecationInfo" | teidata.enumerated }?,
   tei_macro.limitedContent
}

<distributor>

<distributor> (distributor) supplies the name of a person or other agency responsible for the distribution of a text. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduleheader
Attributes
Member of
Contained by
May contain
Example
<distributor>Oxford Text Archive</distributor>
<distributor>Redwood and Burn Ltd</distributor>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element distributor
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   tei_macro.phraseSeq
}

<div>

<div> (text division) contains a subdivision of the front, body, or back of a text. [4.1. Divisions of the Body]
Moduletextstructure
Attributes
Member of
Contained by
textcrit: rdg
textstructure: back body div front
May contain
Example
<body>
 <div type="part">
  <head>Fallacies of Authority</head>
  <p>The subject of which is Authority in various shapes, and the object, to repress all
     exercise of the reasoning faculty.</p>
  <div n="1type="chapter">
   <head>The Nature of Authority</head>
   <p>With reference to any proposed measures having for their object the greatest
       happiness of the greatest number [...]</p>
   <div n="1.1type="section">
    <head>Analysis of Authority</head>
    <p>What on any given occasion is the legitimate weight or influence to be attached to
         authority [...] </p>
   </div>
   <div n="1.2type="section">
    <head>Appeal to Authority, in What Cases Fallacious.</head>
    <p>Reference to authority is open to the charge of fallacy when [...] </p>
   </div>
  </div>
 </div>
</body>
Schematron

<sch:report test="(ancestor::tei:l or ancestor::tei:lg) and not(ancestor::tei:floatingText)"> Abstract model violation: Lines may not contain higher-level structural elements such as div, unless div is a descendant of floatingText.
</sch:report>
Schematron

<sch:report test="(ancestor::tei:p or ancestor::tei:ab) and not(ancestor::tei:floatingText)"> Abstract model violation: p and ab may not contain higher-level structural elements such as div, unless div is a descendant of floatingText.
</sch:report>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.divTop"/>
   <classRef key="model.global"/>
  </alternate>
  <sequence minOccurs="0">
   <alternate>
    <sequence maxOccurs="unbounded"
     minOccurs="1">

     <alternate>
      <classRef key="model.divLike"/>
      <classRef key="model.divGenLike"/>
     </alternate>
     <classRef key="model.global"
      maxOccurs="unboundedminOccurs="0"/>

    </sequence>
    <sequence>
     <sequence maxOccurs="unbounded"
      minOccurs="1">

      <alternate maxOccurs="1"
       minOccurs="1">

       <elementRef key="schemaSpec"/>
       <classRef key="model.common"/>
      </alternate>
      <classRef key="model.global"
       maxOccurs="unboundedminOccurs="0"/>

     </sequence>
     <sequence maxOccurs="unbounded"
      minOccurs="0">

      <alternate>
       <classRef key="model.divLike"/>
       <classRef key="model.divGenLike"/>
      </alternate>
      <classRef key="model.global"
       maxOccurs="unboundedminOccurs="0"/>

     </sequence>
    </sequence>
   </alternate>
   <sequence maxOccurs="unbounded"
    minOccurs="0">

    <classRef key="model.divBottom"/>
    <classRef key="model.global"
     maxOccurs="unboundedminOccurs="0"/>

   </sequence>
  </sequence>
 </sequence>
</content>
Schema Declaration
element div
{
   tei_att.global.attributes,
   tei_att.divLike.attributes,
   tei_att.typed.attributes,
   tei_att.declaring.attributes,
   tei_att.written.attributes,
   (
      ( tei_model.divTop | tei_model.global )*,
      (
         (
            ( ( tei_model.divLike | tei_model.divGenLike ), tei_model.global* )+
          | (
               ( ( schemaSpec | tei_model.common ), tei_model.global* )+,
               (
                  ( tei_model.divLike | tei_model.divGenLike ),
                  tei_model.global*
               )*
            )
         ),
         ( tei_model.divBottom, tei_model.global* )*
      )?
   )
}

<docAuthor>

<docAuthor> (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline). [4.6. Title Pages]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
msdescription: msItem
textcrit: rdg
May contain
Note

The document author's name often occurs within a byline, but the docAuthor element may be used whether the byline element is used or not. It should be used only for the author(s) of the entire document, not for author(s) of any subset or part of it. (Attributions of authorship of a subset or part of the document, for example of a chapter in a textbook or an article in a newspaper, may be encoded with byline without docAuthor.)

Example
<titlePage>
 <docTitle>
  <titlePart>Travels into Several Remote Nations of the World, in Four
     Parts.</titlePart>
 </docTitle>
 <byline> By <docAuthor>Lemuel Gulliver</docAuthor>, First a Surgeon,
   and then a Captain of several Ships</byline>
</titlePage>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element docAuthor
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   tei_macro.phraseSeq
}

<docImprint>

<docImprint> (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page. [4.6. Title Pages]
Moduletextstructure
Attributes
Member of
Contained by
msdescription: msItem
textcrit: rdg
textstructure: back front titlePage
May contain
Note

Cf. the imprint element of bibliographic citations. As with title, author, and editions, the shorter name is reserved for the element likely to be used more often.

Example
<docImprint>Oxford, Clarendon Press, 1987</docImprint>
Imprints may be somewhat more complex:
<docImprint>
 <pubPlace>London</pubPlace>
Printed for <name>E. Nutt</name>,
at
<pubPlace>Royal Exchange</pubPlace>;
<name>J. Roberts</name> in
<pubPlace>wick-Lane</pubPlace>;
<name>A. Dodd</name> without
<pubPlace>Temple-Bar</pubPlace>;
and <name>J. Graves</name> in
<pubPlace>St. James's-street.</pubPlace>
 <date>1722.</date>
</docImprint>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <elementRef key="pubPlace"/>
  <elementRef key="docDate"/>
  <elementRef key="publisher"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element docImprint
{
   tei_att.global.attributes,
   (
      text
    | tei_model.gLike
    | tei_model.phrase
    | tei_pubPlace
    | docDate
    | tei_publisher
    | tei_model.global
   )*
}

<docTitle>

<docTitle> (document title) contains the title of a document, including all its constituents, as given on a title page. [4.6. Title Pages]
Moduletextstructure
Attributes
Member of
Contained by
msdescription: msItem
textcrit: rdg
textstructure: back front titlePage
May contain
analysis: span
linking: anchor
textcrit: app witDetail
textstructure: titlePart
Example
<docTitle>
 <titlePart type="main">The DUNCIAD, VARIOURVM.</titlePart>
 <titlePart type="sub">WITH THE PROLEGOMENA of SCRIBLERUS.</titlePart>
</docTitle>
Content model
<content>
 <sequence>
  <classRef key="model.global"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence maxOccurs="unbounded"
   minOccurs="1">

   <elementRef key="titlePart"/>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element docTitle
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   ( tei_model.global*, ( tei_titlePart, tei_model.global* )+ )
}

<editor>

<editor> contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.12.2.2. Titles, Authors, and Editors]
Modulecore
Attributes
Member of
Contained by
core: bibl monogr
header: titleStmt
May contain
Note

A consistent format should be adopted.

Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use generally recognized authority lists for the exact form of personal names.

Example
<editor role="Technical_Editor">Ron Van den Branden</editor>
<editor role="Editor-in-Chief">John Walsh</editor>
<editor role="Managing_Editor">Anne Baillot</editor>
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element editor
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_att.datable.attributes,
   tei_macro.phraseSeq
}

<editorialDecl>

<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements]
Moduleheader
Attributes
Member of
Contained by
header: encodingDesc
May contain
core: p
linking: ab
Example
<editorialDecl>
 <p>All words converted to Modern American spelling using
   Websters 9th Collegiate dictionary</p>
 <p>All opening quotation marks converted to “ all closing
   quotation marks converted to &amp;cdq;.</p>
</editorialDecl>
Content model
<content>
 <classRef key="model.pLike"
  maxOccurs="unboundedminOccurs="1"/>

</content>
Schema Declaration
element editorialDecl
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   tei_model.pLike+
}

<emph>

<emph> (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect. [3.3.2.2. Emphatic Words and Phrases 3.3.2. Emphasis, Foreign Words, and Unusual Language]
Modulecore
Attributes
Member of
Contained by
May contain
Example
You took the car and did <emph>what</emph>?!!
Example
<q>What it all comes to is this,</q> he said.
<q>
 <emph>What
   does Christopher Robin do in the morning nowadays?</emph>
</q>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element emph { tei_att.global.attributes, tei_macro.paraContent }

<encodingDesc>

<encodingDesc> (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components]
Moduleheader
Attributes
Member of
Contained by
header: teiHeader
May contain
core: p
linking: ab
textcrit: variantEncoding
Example
<encodingDesc>
 <p>Basic encoding, capturing lexical information only. All
   hyphenation, punctuation, and variant spellings normalized. No
   formatting or layout information preserved.</p>
</encodingDesc>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="1">

  <classRef key="model.encodingDescPart"/>
  <classRef key="model.pLike"/>
 </alternate>
</content>
Schema Declaration
element encodingDesc
{
   tei_att.global.attributes,
   ( tei_model.encodingDescPart | tei_model.pLike )+
}

<epigraph>

<epigraph> (epigraph) contains a quotation, anonymous or attributed, appearing at the start or end of a section or on a title page. [4.2.3. Arguments, Epigraphs, and Postscripts 4.2. Elements Common to All Divisions 4.6. Title Pages]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
msdescription: msItem
textcrit: rdg
May contain
Example
<epigraph xml:lang="la">
 <cit>
  <bibl>Lucret.</bibl>
  <quote>
   <l part="F">petere inde coronam,</l>
   <l>Vnde prius nulli velarint tempora Musae.</l>
  </quote>
 </cit>
</epigraph>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <classRef key="model.common"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element epigraph
{
   tei_att.global.attributes,
   ( tei_model.common | tei_model.global )*
}

<ex>

<ex> (editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation. [11.3.1.2. Abbreviation and Expansion]
Moduletranscr
Attributes
Member of
Contained by
May containCharacter data only
Example
The address is Southmoor <choice>
 <expan>R<ex>oa</ex>d</expan>
 <abbr>Rd</abbr>
</choice>
Content model
<content>
 <macroRef key="macro.xtext"/>
</content>
Schema Declaration
element ex
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_att.dimensions.attributes,
   tei_macro.xtext
}

<expan>

<expan> (expansion) contains the expansion of an abbreviation. [3.6.5. Abbreviations and Their Expansions]
Modulecore
Attributes
Member of
Contained by
May contain
Note

The content of this element should be the expanded abbreviation, usually (but not always) a complete word or phrase. The ex element provided by the transcr module may be used to mark up sequences of letters supplied within such an expansion.

If abbreviations are expanded silently, this practice should be documented in the editorialDecl, either with a normalization element or a p.

Example
The address is Southmoor
<choice>
 <expan>Road</expan>
 <abbr>Rd</abbr>
</choice>
Example
<choice xml:lang="la">
 <abbr>Imp</abbr>
 <expan>Imp<ex>erator</ex>
 </expan>
</choice>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element expan
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_macro.phraseSeq
}

<extent>

<extent> (extent) describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.12.2.4. Imprint, Size of a Document, and Reprint Information 10.7.1. Object Description]
Moduleheader
Attributes
Member of
Contained by
core: bibl monogr
header: fileDesc
msdescription: supportDesc
May contain
Example
<extent>3200 sentences</extent>
<extent>between 10 and 20 Mb</extent>
<extent>ten 3.5 inch high density diskettes</extent>
Example

The measure element may be used to supply normalized or machine tractable versions of the size or sizes concerned.

<extent>
 <measure quantity="4.2unit="MiB">About four megabytes</measure>
 <measure quantity="245unit="pages">245 pages of source
   material</measure>
</extent>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element extent { tei_att.global.attributes, tei_macro.phraseSeq }

<facsimile>

<facsimile> contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text. [11.1. Digital Facsimiles]
Moduletranscr
Attributes
Member of
Contained by
textstructure: TEI
May contain
core: graphic
textstructure: back front
transcr: surface
Example
<facsimile>
 <graphic url="page1.png"/>
 <surface>
  <graphic url="page2-highRes.png"/>
  <graphic url="page2-lowRes.png"/>
 </surface>
 <graphic url="page3.png"/>
 <graphic url="page4.png"/>
</facsimile>
Example
<facsimile>
 <surface lrx="200lry="300ulx="0uly="0">
  <graphic url="Bovelles-49r.png"/>
 </surface>
</facsimile>
Schematron

<sch:rule context="tei:facsimile//tei:line | tei:facsimile//tei:zone">
<sch:report test="child::text()[ normalize-space(.) ne '']"> A facsimile element represents a text with images, thus
transcribed text should not be present within it.
</sch:report>
</sch:rule>
Content model
<content>
 <sequence>
  <elementRef key="frontminOccurs="0"/>
  <alternate maxOccurs="unbounded"
   minOccurs="1">

   <classRef key="model.graphicLike"/>
   <elementRef key="surface"/>
   <elementRef key="surfaceGrp"/>
  </alternate>
  <elementRef key="backminOccurs="0"/>
 </sequence>
</content>
Schema Declaration
element facsimile
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   (
      tei_front?,
      ( tei_model.graphicLike | tei_surface | surfaceGrp )+,
      tei_back?
   )
}

<fileDesc>

<fileDesc> (file description) contains a full bibliographic description of an electronic file. [2.2. The File Description 2.1.1. The TEI Header and Its Components]
Moduleheader
Attributes
Contained by
header: teiHeader
May contain
Note

The major source of information for those seeking to create a catalogue entry or bibliographic citation for an electronic file. As such, it provides a title and statements of responsibility together with details of the publication or distribution of the file, of any series to which it belongs, and detailed bibliographic notes for matters not addressed elsewhere in the header. It also contains a full bibliographic description for the source or sources from which the electronic text was derived.

Example
<fileDesc>
 <titleStmt>
  <title>The shortest possible TEI document</title>
 </titleStmt>
 <publicationStmt>
  <p>Distributed as part of TEI P5</p>
 </publicationStmt>
 <sourceDesc>
  <p>No print source exists: this is an original digital text</p>
 </sourceDesc>
</fileDesc>
Content model
<content>
 <sequence>
  <sequence>
   <elementRef key="titleStmt"/>
   <elementRef key="editionStmt"
    minOccurs="0"/>

   <elementRef key="extentminOccurs="0"/>
   <elementRef key="publicationStmt"/>
   <elementRef key="seriesStmt"
    maxOccurs="unboundedminOccurs="0"/>

   <elementRef key="notesStmt"
    minOccurs="0"/>

  </sequence>
  <elementRef key="sourceDesc"
   maxOccurs="unboundedminOccurs="1"/>

 </sequence>
</content>
Schema Declaration
element fileDesc
{
   tei_att.global.attributes,
   (
      (
         tei_titleStmt,
         editionStmt?,
         tei_extent?,
         tei_publicationStmt,
         seriesStmt*,
         tei_notesStmt?
      ),
      tei_sourceDesc+
   )
}

<filiation>

<filiation> (filiation) contains information concerning the manuscript or other object's filiation, i.e. its relationship to other surviving manuscripts or other objects of the same text or contents, its protographs, antigraphs and apographs. [10.6.1. The msItem and msItemStruct Elements]
Modulemsdescription
Attributes
Contained by
msdescription: msItemStruct
May contain
Example
<msContents>
 <msItem>
  <title>Beljakovski sbornik</title>
  <filiation type="protograph">Bulgarian</filiation>
  <filiation type="antigraph">Middle Bulgarian</filiation>
  <filiation type="apograph">
   <ref target="#DN17">Dujchev N 17</ref>
  </filiation>
 </msItem>
</msContents>
<!-- ... -->
<msDesc xml:id="DN17">
 <msIdentifier>
  <settlement>Faraway</settlement>
 </msIdentifier>
<!-- ... -->
</msDesc>

In this example, the reference to ‘Dujchev N17’ includes a link to some other manuscript description which has the identifier DN17.

Example
<msItem>
 <title>Guan-ben</title>
 <filiation>
  <p>The "Guan-ben" was widely current among mathematicians in the
     Qing dynasty, and "Zhao Qimei version" was also read. It is
     therefore difficult to know the correct filiation path to follow.
     The study of this era is much indebted to Li Di. We explain the
     outline of his conclusion here. Kong Guangsen
     (1752-1786)(17) was from the same town as Dai Zhen, so he obtained
     "Guan-ben" from him and studied it(18). Li Huang (d. 1811)
     (19) took part in editing Si Ku Quan Shu, so he must have had
     "Guan-ben". Then Zhang Dunren (1754-1834) obtained this version,
     and studied "Da Yan Zong Shu Shu" (The General Dayan
     Computation). He wrote Jiu Yi Suan Shu (Mathematics
     Searching for One, 1803) based on this version of Shu Xue Jiu
     Zhang (20).</p>
  <p>One of the most important persons in restoring our knowledge
     concerning the filiation of these books was Li Rui (1768(21)
     -1817)(see his biography). ... only two volumes remain of this
     manuscript, as far as chapter 6 (chapter 3 part 2) p.13, that is,
     question 2 of "Huan Tian San Ji" (square of three loops),
     which later has been lost.</p>
 </filiation>
</msItem>
<!--http://www2.nkfust.edu.tw/~jochi/ed1.htm-->
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element filiation
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_macro.specialPara
}

<foliation>

<foliation> (foliation) describes the numbering system or systems used to count the leaves or pages in a codex or similar object. [10.7.1.4. Foliation]
Modulemsdescription
Attributes
Contained by
msdescription: supportDesc
May contain
Example
<foliation>Contemporary foliation in red
roman numerals in the centre
of the outer margin.</foliation>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element foliation { tei_att.global.attributes, tei_macro.specialPara }

<foreign>

<foreign> (foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text. [3.3.2.1. Foreign Words or Expressions]
Modulecore
Attributes
Member of
Contained by
May contain
Note

The global xml:lang attribute should be supplied for this element to identify the language of the word or phrase marked. As elsewhere, its value should be a language tag as defined in 6.1. Language Identification.

This element is intended for use only where no other element is available to mark the phrase or words concerned. The global xml:lang attribute should be used in preference to this element where it is intended to mark the language of the whole of some text element.

The <distinct> element may be used to identify phrases belonging to sublanguages or registers not generally regarded as true languages.

Example
This is
heathen Greek to you still? Your <foreign xml:lang="la">lapis
philosophicus</foreign>?
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element foreign { tei_att.global.attributes, tei_macro.phraseSeq }

<forename>

<forename> (forename) contains a forename, given or baptismal name. [13.2.1. Personal Names]
Modulenamesdates
Attributes
Member of
Contained by
May contain
Example
<persName>
 <roleName>Ex-President</roleName>
 <forename>George</forename>
 <surname>Bush</surname>
</persName>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element forename
{
   tei_att.global.attributes,
   tei_att.personal.attributes,
   tei_att.typed.attributes,
   tei_macro.phraseSeq
}

<front>

<front> (front matter) contains any prefatory matter (headers, abstracts, title page, prefaces, dedications, etc.) found at the start of a document, before the main body. [4.6. Title Pages 4. Default Text Structure]
Moduletextstructure
Attributes
Contained by
textstructure: text
transcr: facsimile
May contain
Note

Because cultural conventions differ as to which elements are grouped as front matter and which as back matter, the content models for the front and back elements are identical.

Example
<front>
 <epigraph>
  <quote>Nam Sibyllam quidem Cumis ego ipse oculis meis vidi in ampulla
     pendere, et cum illi pueri dicerent: <q xml:lang="grc">Σίβυλλα τί
       θέλεις</q>; respondebat illa: <q xml:lang="grc">ὰποθανεῖν θέλω.</q>
  </quote>
 </epigraph>
 <div type="dedication">
  <p>For Ezra Pound <q xml:lang="it">il miglior fabbro.</q>
  </p>
 </div>
</front>
Example
<front>
 <div type="dedication">
  <p>To our three selves</p>
 </div>
 <div type="preface">
  <head>Author's Note</head>
  <p>All the characters in this book are purely imaginary, and if the
     author has used names that may suggest a reference to living persons
     she has done so inadvertently. ...</p>
 </div>
</front>
Example
<front>
 <div type="abstract">
  <div>
   <head> BACKGROUND:</head>
   <p>Food insecurity can put children at greater risk of obesity because
       of altered food choices and nonuniform consumption patterns.</p>
  </div>
  <div>
   <head> OBJECTIVE:</head>
   <p>We examined the association between obesity and both child-level
       food insecurity and personal food insecurity in US children.</p>
  </div>
  <div>
   <head> DESIGN:</head>
   <p>Data from 9,701 participants in the National Health and Nutrition
       Examination Survey, 2001-2010, aged 2 to 11 years were analyzed.
       Child-level food insecurity was assessed with the US Department of
       Agriculture's Food Security Survey Module based on eight
       child-specific questions. Personal food insecurity was assessed with
       five additional questions. Obesity was defined, using physical
       measurements, as body mass index (calculated as kg/m2) greater than
       or equal to the age- and sex-specific 95th percentile of the Centers
       for Disease Control and Prevention growth charts. Logistic
       regressions adjusted for sex, race/ethnic group, poverty level, and
       survey year were conducted to describe associations between obesity
       and food insecurity.</p>
  </div>
  <div>
   <head> RESULTS:</head>
   <p>Obesity was significantly associated with personal food insecurity
       for children aged 6 to 11 years (odds ratio=1.81; 95% CI 1.33 to
       2.48), but not in children aged 2 to 5 years (odds ratio=0.88; 95%
       CI 0.51 to 1.51). Child-level food insecurity was not associated
       with obesity among 2- to 5-year-olds or 6- to 11-year-olds.</p>
  </div>
  <div>
   <head> CONCLUSIONS:</head>
   <p>Personal food insecurity is associated with an increased risk of
       obesity only in children aged 6 to 11 years. Personal
       food-insecurity measures may give different results than aggregate
       food-insecurity measures in children.</p>
  </div>
 </div>
</front>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.frontPart"/>
   <classRef key="model.pLike"/>
   <classRef key="model.pLike.front"/>
   <classRef key="model.global"/>
  </alternate>
  <sequence minOccurs="0">
   <alternate>
    <sequence>
     <classRef key="model.div1Like"/>
     <alternate maxOccurs="unbounded"
      minOccurs="0">

      <classRef key="model.div1Like"/>
      <classRef key="model.frontPart"/>
      <classRef key="model.global"/>
     </alternate>
    </sequence>
    <sequence>
     <classRef key="model.divLike"/>
     <alternate maxOccurs="unbounded"
      minOccurs="0">

      <classRef key="model.divLike"/>
      <classRef key="model.frontPart"/>
      <classRef key="model.global"/>
     </alternate>
    </sequence>
   </alternate>
   <sequence minOccurs="0">
    <classRef key="model.divBottom"/>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <classRef key="model.divBottom"/>
     <classRef key="model.global"/>
    </alternate>
   </sequence>
  </sequence>
 </sequence>
</content>
Schema Declaration
element front
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   (
      (
         tei_model.frontPart
       | tei_model.pLike
       | tei_model.pLike.front
       | tei_model.global
      )*,
      (
         (
            (
               tei_model.div1Like,
               ( tei_model.div1Like | tei_model.frontPart | tei_model.global )*
            )
          | (
               tei_model.divLike,
               ( tei_model.divLike | tei_model.frontPart | tei_model.global )*
            )
         ),
         ( tei_model.divBottom, ( tei_model.divBottom | tei_model.global )* )?
      )?
   )
}

<fw>

<fw> (forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page. [11.6. Headers, Footers, and Similar Matter]
Moduletranscr
Attributes
typeclassifies the material encoded according to some useful typology.
Derived fromatt.typed
Status Recommended
Datatype teidata.enumerated
Sample values include:
header
a running title at the top of the page
footer
a running title at the bottom of the page
pageNum
(page number) a page number or foliation symbol
lineNum
(line number) a line number, either of prose or poetry
sig
(signature) a signature or gathering symbol
catch
(catchword) a catch-word
Member of
Contained by
May contain
Note

Where running heads are consistent throughout a chapter or section, it is usually more convenient to relate them to the chapter or section, e.g. by use of the rend attribute. The fw element is intended for cases where the running head changes from page to page, or where details of page layout and the internal structure of the running heads are of paramount importance.

Example
<fw place="bottomtype="sig">C3</fw>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element fw
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   tei_att.placement.attributes,
   tei_att.written.attributes,
   attribute type { teidata.enumerated }?,
   tei_macro.phraseSeq
}

<gap>

<gap> (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. [3.5.3. Additions, Deletions, and Omissions]
Modulecore
Attributes
reason(reason) gives the reason for omission
Status Optional
Datatype 1–∞ occurrences of teidata.enumerated separated by whitespace
Suggested values include:
cancelled
(cancelled)
deleted
(deleted)
editorial
(editorial) for features omitted from transcription due to editorial policy
illegible
(illegible)
inaudible
(inaudible)
irrelevant
(irrelevant)
sampling
(sampling)
agent(agent) in the case of text omitted because of damage, categorizes the cause of the damage, if it can be identified.
Status Optional
Datatype teidata.enumerated
Sample values include:
rubbing
(rubbing) damage results from rubbing of the leaf edges
mildew
(mildew) damage results from mildew on the leaf surface
smoke
(smoke) damage results from smoke
Member of
Contained by
May contain
core: desc
Note

The gap, unclear, and del core tag elements may be closely allied in use with the <damage> and supplied elements, available when using the additional tagset for transcription of primary sources. See section 11.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for discussion of which element is appropriate for which circumstance.

The gap tag simply signals the editors decision to omit or inability to transcribe a span of text. Other information, such as the interpretation that text was deliberately erased or covered, should be indicated using the relevant tags, such as del in the case of deliberate deletion.

Example
<gap quantity="4reason="illegible"
 unit="chars"/>
Example
<gap quantity="1reason="sampling"
 unit="essay"/>
Example
<del>
 <gap atLeast="4atMost="8"
  reason="illegibleunit="chars"/>

</del>
Example
<gap extent="several linesreason="lost"/>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <classRef key="model.descLike"/>
  <classRef key="model.certLike"/>
 </alternate>
</content>
Schema Declaration
element gap
{
   tei_att.global.attributes,
   tei_att.timed.attributes,
   tei_att.editLike.attributes,
   tei_att.dimensions.attributes,
   attribute reason
   {
      list
      {
         (
            "cancelled"
          | "deleted"
          | "editorial"
          | "illegible"
          | "inaudible"
          | "irrelevant"
          | "sampling"
          | teidata.enumerated
         )+
      }
   }?,
   attribute agent { teidata.enumerated }?,
   ( tei_model.descLike | tei_model.certLike )*
}

<gb>

<gb> (gathering beginning) marks the beginning of a new gathering or quire in a transcribed codex. [3.11.3. Milestone Elements]
Modulecore
Attributes
Member of
Contained by
May containEmpty element
Note

By convention, gb elements should appear at the start of the first page in the gathering. The global n attribute indicates the number or other value used to identify this gathering in a collation.

The type attribute may be used to further characterize the gathering in any respect.

Example
<gb n="iii"/>
<pb n="2r"/>
<!-- material from page 2 recto of gathering iii here -->
<pb n="2v"/>
<!-- material from page 2 verso of gathering iii here -->
Content model
<content>
 <empty/>
</content>
Schema Declaration
element gb
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.spanning.attributes,
   tei_att.breaking.attributes,
   tei_att.edition.attributes,
   empty
}

<graphic>

<graphic> (graphic) indicates the location of a graphic or illustration, either forming part of a text, or providing an image of it. [3.10. Graphics and Other Non-textual Components 11.1. Digital Facsimiles]
Modulecore
Attributes
Member of
Contained by
May contain
core: desc
Note

The mimeType attribute should be used to supply the MIME media type of the image specified by the url attribute.

Within the body of a text, a graphic element indicates the presence of a graphic component in the source itself. Within the context of a facsimile or <sourceDoc> element, however, a graphic element provides an additional digital representation of some part of the source being encoded.

Example
<figure>
 <graphic url="fig1.png"/>
 <head>Figure One: The View from the Bridge</head>
 <figDesc>A Whistleresque view showing four or five sailing boats in the foreground, and a
   series of buoys strung out between them.</figDesc>
</figure>
Example
<facsimile>
 <surfaceGrp n="leaf1">
  <surface>
   <graphic url="page1.png"/>
  </surface>
  <surface>
   <graphic url="page2-highRes.png"/>
   <graphic url="page2-lowRes.png"/>
  </surface>
 </surfaceGrp>
</facsimile>
Example
<facsimile>
 <surfaceGrp n="leaf1xml:id="spi001">
  <surface xml:id="spi001r">
   <graphic subtype="thumbnail"
    type="normalurl="spi/thumb/001r.jpg"/>

   <graphic subtype="low-restype="normal"
    url="spi/normal/lowRes/001r.jpg"/>

   <graphic subtype="high-res"
    type="normalurl="spi/normal/highRes/001r.jpg"/>

   <graphic subtype="low-res"
    type="high-contrasturl="spi/contrast/lowRes/001r.jpg"/>

   <graphic subtype="high-res"
    type="high-contrasturl="spi/contrast/highRes/001r.jpg"/>

  </surface>
  <surface xml:id="spi001v">
   <graphic subtype="thumbnail"
    type="normalurl="spi/thumb/001v.jpg"/>

   <graphic subtype="low-restype="normal"
    url="spi/normal/lowRes/001v.jpg"/>

   <graphic subtype="high-res"
    type="normalurl="spi/normal/highRes/001v.jpg"/>

   <graphic subtype="low-res"
    type="high-contrasturl="spi/contrast/lowRes/001v.jpg"/>

   <graphic subtype="high-res"
    type="high-contrasturl="spi/contrast/highRes/001v.jpg"/>

   <zone xml:id="spi001v_detail01">
    <graphic subtype="thumbnail"
     type="normalurl="spi/thumb/001v-detail01.jpg"/>

    <graphic subtype="low-res"
     type="normal"
     url="spi/normal/lowRes/001v-detail01.jpg"/>

    <graphic subtype="high-res"
     type="normal"
     url="spi/normal/highRes/001v-detail01.jpg"/>

    <graphic subtype="low-res"
     type="high-contrast"
     url="spi/contrast/lowRes/001v-detail01.jpg"/>

    <graphic subtype="high-res"
     type="high-contrast"
     url="spi/contrast/highRes/001v-detail01.jpg"/>

   </zone>
  </surface>
 </surfaceGrp>
</facsimile>
Content model
<content>
 <classRef key="model.descLike"
  maxOccurs="unboundedminOccurs="0"/>

</content>
Schema Declaration
element graphic
{
   tei_att.global.attributes,
   tei_att.media.attributes,
   tei_att.resourced.attributes,
   tei_att.declaring.attributes,
   tei_att.typed.attributes,
   tei_model.descLike*
}

<handDesc>

<handDesc> (description of hands) contains a description of all the different hands used in a manuscript or other object. [10.7.2. Writing, Decoration, and Other Notations]
Modulemsdescription
Attributes
hands(hands) specifies the number of distinct hands identified within the manuscript
Status Optional
Datatype teidata.count
Member of
Contained by
msdescription: physDesc
May contain
core: p
header: handNote
linking: ab
Example
<handDesc>
 <handNote scope="major">Written throughout in <term>angelicana formata</term>.</handNote>
</handDesc>
Example
<handDesc hands="2">
 <p>The manuscript is written in two contemporary hands, otherwise
   unknown, but clearly those of practised scribes. Hand I writes
   ff. 1r-22v and hand II ff. 23 and 24. Some scholars, notably
   Verner Dahlerup and Hreinn Benediktsson, have argued for a third hand
   on f. 24, but the evidence for this is insubstantial.</p>
</handDesc>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <sequence>
   <elementRef key="summaryminOccurs="0"/>
   <elementRef key="handNote"
    maxOccurs="unboundedminOccurs="1"/>

  </sequence>
 </alternate>
</content>
Schema Declaration
element handDesc
{
   tei_att.global.attributes,
   attribute hands { teidata.count }?,
   ( tei_model.pLike+ | ( summary?, tei_handNote+ ) )
}

<handNote>

<handNote> (note on hand) describes a particular style or hand distinguished within a manuscript. [10.7.2. Writing, Decoration, and Other Notations]
Moduleheader
Attributes
Contained by
msdescription: handDesc
transcr: handNotes
May contain
Example
<handNote scope="sole">
 <p>Written in insular
   phase II half-uncial with interlinear Old English gloss in an Anglo-Saxon pointed
   minuscule.</p>
</handNote>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element handNote
{
   tei_att.global.attributes,
   tei_att.handFeatures.attributes,
   tei_macro.specialPara
}

<handNotes>

<handNotes> contains one or more handNote elements documenting the different hands identified within the source texts. [11.3.2.1. Document Hands]
Moduletranscr
Attributes
Member of
Contained by
header: profileDesc
May contain
header: handNote
Example
<handNotes>
 <handNote medium="brown-ink"
  script="copperplatexml:id="H1">
Carefully written with regular descenders</handNote>
 <handNote medium="pencilscript="print"
  xml:id="H2">
Unschooled scrawl</handNote>
</handNotes>
Content model
<content>
 <elementRef key="handNote"
  maxOccurs="unboundedminOccurs="1"/>

</content>
Schema Declaration
element handNotes { tei_att.global.attributes, tei_handNote+ }

<handShift>

<handShift> (handwriting shift) marks the beginning of a sequence of text written in a new hand, or the beginning of a scribal stint. [11.3.2.1. Document Hands]
Moduletranscr
Attributes
newindicates a handNote element describing the hand concerned.
Status Recommended
Datatype teidata.pointer
Note

This attribute serves the same function as the hand attribute provided for those elements which are members of the att.transcriptional class. It may be renamed at a subsequent major release.

Member of
Contained by
May containEmpty element
Note

The handShift element may be used either to denote a shift in the document hand (as from one scribe to another, on one writing style to another). Or, it may indicate a shift within a document hand, as a change of writing style, character or ink. Like other milestone elements, it should appear at the point of transition from some other state to the state which it describes.

Example
<l>When wolde the cat dwelle in his ynne</l>
<handShift medium="greenish-ink"/>
<l>And if the cattes skynne be slyk <handShift medium="black-ink"/> and gaye</l>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element handShift
{
   tei_att.global.attributes,
   tei_att.handFeatures.attributes,
   attribute new { teidata.pointer }?,
   empty
}
<head> (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc. [4.2.1. Headings and Trailers]
Modulecore
Attributes
Member of
Contained by
core: lg
msdescription: msDesc
textcrit: listApp
textstructure: back body div front
May contain
Note

The head element is used for headings at all levels; software which treats (e.g.) chapter headings, section headings, and list titles differently must determine the proper processing of a head element based on its structural position. A head occurring as the first element of a list is the title of that list; one occurring as the first element of a <div1> is the title of that chapter or section.

Example

The most common use for the head element is to mark the headings of sections. In older writings, the headings or incipits may be rather longer than usual in modern works. If a section has an explicit ending as well as a heading, it should be marked as a <trailer>, as in this example:

<div1 n="Itype="book">
 <head>In the name of Christ here begins the first book of the ecclesiastical history of
   Georgius Florentinus, known as Gregory, Bishop of Tours.</head>
 <div2 type="section">
  <head>In the name of Christ here begins Book I of the history.</head>
  <p>Proposing as I do ...</p>
  <p>From the Passion of our Lord until the death of Saint Martin four hundred and twelve
     years passed.</p>
  <trailer>Here ends the first Book, which covers five thousand, five hundred and ninety-six
     years from the beginning of the world down to the death of Saint Martin.</trailer>
 </div2>
</div1>
Example

When headings are not inline with the running text (see e.g. the heading "Secunda conclusio") they might however be encoded as if. The actual placement in the source document can be captured with the place attribute.

<div type="subsection">
 <head place="margin">Secunda conclusio</head>
 <p>
  <lb n="1251"/>
  <hi rend="large">Potencia: habitus: et actus: recipiunt speciem ab obiectis<supplied>.</supplied>
  </hi>
  <lb n="1252"/>Probatur sic. Omne importans necessariam habitudinem ad proprium
   [...]
 </p>
</div>
Example

The head element is also used to mark headings of other units, such as lists:

With a few exceptions, connectives are equally
useful in all kinds of discourse: description, narration, exposition, argument. <list rend="bulleted">
 <head>Connectives</head>
 <item>above</item>
 <item>accordingly</item>
 <item>across from</item>
 <item>adjacent to</item>
 <item>again</item>
 <item>
<!-- ... -->
 </item>
</list>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <elementRef key="lg"/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.inter"/>
  <classRef key="model.lLike"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element head
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.placement.attributes,
   tei_att.written.attributes,
   (
      text
    | tei_lg
    | tei_model.gLike
    | tei_model.phrase
    | tei_model.inter
    | tei_model.lLike
    | tei_model.global
   )*
}

<hi>

<hi> (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made. [3.3.2.2. Emphatic Words and Phrases 3.3.2. Emphasis, Foreign Words, and Unusual Language]
Modulecore
Attributes
Member of
Contained by
May contain
Example
<hi rend="gothic">And this Indenture further witnesseth</hi>
that the said <hi rend="italic">Walter Shandy</hi>, merchant,
in consideration of the said intended marriage ...
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element hi
{
   tei_att.global.attributes,
   tei_att.written.attributes,
   tei_macro.paraContent
}

<history>

<history> (history) groups elements describing the full history of a manuscript, manuscript part, or other object. [10.8. History]
Modulemsdescription
Attributes
Contained by
msdescription: msDesc
May contain
core: p
linking: ab
msdescription: acquisition provenance
Example
<history>
 <origin>
  <p>Written in Durham during the mid twelfth
     century.</p>
 </origin>
 <provenance>
  <p>Recorded in two medieval
     catalogues of the books belonging to Durham Priory, made in 1391 and
     1405.</p>
 </provenance>
 <provenance>
  <p>Given to W. Olleyf by William Ebchester, Prior (1446-56)
     and later belonged to Henry Dalton, Prior of Holy Island (Lindisfarne)
     according to inscriptions on ff. 4v and 5.</p>
 </provenance>
 <acquisition>
  <p>Presented to Trinity College in 1738 by
     Thomas Gale and his son Roger.</p>
 </acquisition>
</history>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <sequence>
   <elementRef key="summaryminOccurs="0"/>
   <elementRef key="originminOccurs="0"/>
   <elementRef key="provenance"
    maxOccurs="unboundedminOccurs="0"/>

   <elementRef key="acquisition"
    minOccurs="0"/>

  </sequence>
 </alternate>
</content>
Schema Declaration
element history
{
   tei_att.global.attributes,
   (
      tei_model.pLike+
    | ( summary?, origin?, tei_provenance*, tei_acquisition? )
   )
}

<idno>

<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [13.3.1. Basic Principles 2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.12.2.4. Imprint, Size of a Document, and Reprint Information]
Moduleheader
Attributes
typecategorizes the identifier, for example as an ISBN, Social Security number, etc.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Suggested values include:
ISBN
International Standard Book Number: a 13- or (if assigned prior to 2007) 10-digit identifying number assigned by the publishing industry to a published book or similar item, registered with the International ISBN Agency.
ISSN
International Standard Serial Number: an eight-digit number to uniquely identify a serial publication.
DOI
Digital Object Identifier: a unique string of letters and numbers assigned to an electronic document.
URI
Uniform Resource Identifier: a string of characters to uniquely identify a resource, following the syntax of RFC 3986.
VIAF
A data number in the Virtual Internet Authority File assigned to link different names in catalogs around the world for the same entity.
ESTC
English Short-Title Catalogue number: an identifying number assigned to a document in English printed in the British Isles or North America before 1801.
OCLC
OCLC control number (record number) for the union catalog record in WorldCat, a union catalog for member libraries in the Online Computer Library Center global cooperative.
Member of
Contained by
May contain
header: idno
character data
Note

idno should be used for labels which identify an object or concept in a formal cataloguing system such as a database or an RDF store, or in a distributed system such as the World Wide Web. Some suggested values for type on idno are ISBN, ISSN, DOI, and URI.

Example
<idno type="ISBN">978-1-906964-22-1</idno>
<idno type="ISSN">0143-3385</idno>
<idno type="DOI">10.1000/123</idno>
<idno type="URI">http://www.worldcat.org/oclc/185922478</idno>
<idno type="URI">http://authority.nzetc.org/463/</idno>
<idno type="LT">Thomason Tract E.537(17)</idno>
<idno type="Wing">C695</idno>
<idno type="oldCat">
 <g ref="#sym"/>345
</idno>

In the last case, the identifier includes a non-Unicode character which is defined elsewhere by means of a <glyph> or <char> element referenced here as #sym.

Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <elementRef key="idno"/>
 </alternate>
</content>
Schema Declaration
element idno
{
   tei_att.global.attributes,
   tei_att.sortable.attributes,
   tei_att.datable.attributes,
   tei_att.typed.attribute.subtype,
   attribute type
   {
      "ISBN"
    | "ISSN"
    | "DOI"
    | "URI"
    | "VIAF"
    | "ESTC"
    | "OCLC"
    | teidata.enumerated
   }?,
   ( text | tei_model.gLike | tei_idno )*
}

<imprint>

<imprint> groups information relating to the publication or distribution of a bibliographic item. [3.12.2.4. Imprint, Size of a Document, and Reprint Information]
Modulecore
Attributes
Contained by
core: monogr
May contain
Example
<imprint>
 <pubPlace>Oxford</pubPlace>
 <publisher>Clarendon Press</publisher>
 <date>1987</date>
</imprint>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <elementRef key="classCode"/>
   <elementRef key="catRef"/>
  </alternate>
  <sequence maxOccurs="unbounded"
   minOccurs="1">

   <alternate>
    <classRef key="model.imprintPart"/>
    <classRef key="model.dateLike"/>
   </alternate>
   <elementRef key="respStmt"
    maxOccurs="unboundedminOccurs="0"/>

   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element imprint
{
   tei_att.global.attributes,
   (
      ( classCode | catRef )*,
      (
         ( tei_model.imprintPart | tei_model.dateLike ),
         respStmt*,
         tei_model.global*
      )+
   )
}

<index>

<index> (index entry) marks a location to be indexed for whatever purpose. [3.9.2. Index Entries]
Modulecore
Attributes
indexNamea single word which follows the rules defining a legal XML name (see https://www.w3.org/TR/REC-xml/#dt-name), supplying a name to specify which index (of several) the index entry belongs to.
Status Optional
Datatype teidata.name
Note

This attribute makes it possible to create multiple indexes for a text.

Member of
Contained by
May contain
core: index term
Example
David's other principal backer, Josiah ha-Kohen
<index indexName="NAMES">
 <term>Josiah ha-Kohen b. Azarya</term>
</index> b. Azarya, son of one of the last gaons of Sura <index indexName="PLACES">
 <term>Sura</term>
</index> was David's own first cousin.
Content model
<content>
 <sequence maxOccurs="unbounded"
  minOccurs="0">

  <elementRef key="term"/>
  <elementRef key="indexminOccurs="0"/>
 </sequence>
</content>
Schema Declaration
element index
{
   tei_att.global.attributes,
   tei_att.spanning.attributes,
   attribute indexName { teidata.name }?,
   ( tei_term, tei_index? )*
}

<institution>

<institution> (institution) contains the name of an organization such as a university or library, with which a manuscript or other object is identified, generally its holding institution. [10.4. The Manuscript Identifier]
Modulemsdescription
Attributes
Contained by
msdescription: msIdentifier
May contain
Example
<msIdentifier>
 <settlement>Oxford</settlement>
 <institution>University of Oxford</institution>
 <repository>Bodleian Library</repository>
 <idno>MS. Bodley 406</idno>
</msIdentifier>
Content model
<content>
 <macroRef key="macro.phraseSeq.limited"/>
</content>
Schema Declaration
element institution
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_macro.phraseSeq.limited
}

<item>

<item> (item) contains one component of a list. [3.8. Lists 2.6. The Revision Description]
Modulecore
Attributes
Contained by
May contain
Note

May contain simple prose or a sequence of chunks.

Whatever string of characters is used to label a list item in the copy text may be used as the value of the global n attribute, but it is not required that numbering be recorded explicitly. In ordered lists, the n attribute on the item element is by definition synonymous with the use of the <label> element to record the enumerator of the list item. In glossary lists, however, the term being defined should be given with the <label> element, not n.

Example
<list rend="numbered">
 <head>Here begin the chapter headings of Book IV</head>
 <item n="4.1">The death of Queen Clotild.</item>
 <item n="4.2">How King Lothar wanted to appropriate one third of the Church revenues.</item>
 <item n="4.3">The wives and children of Lothar.</item>
 <item n="4.4">The Counts of the Bretons.</item>
 <item n="4.5">Saint Gall the Bishop.</item>
 <item n="4.6">The priest Cato.</item>
 <item> ...</item>
</list>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element item
{
   tei_att.global.attributes,
   tei_att.sortable.attributes,
   tei_macro.specialPara
}

<keywords>

<keywords> (keywords) contains a list of keywords or phrases identifying the topic or nature of a text. [2.4.3. The Text Classification]
Moduleheader
Attributes
schemeidentifies the controlled vocabulary within which the set of keywords concerned is defined, for example by a <taxonomy> element, or by some other resource.
Status Optional
Datatype teidata.pointer
Contained by
header: textClass
May contain
core: term
Note

Each individual keyword (including compound subject headings) should be supplied as a term element directly within the keywords element. An alternative usage, in which each term appears within an item inside a <list> is permitted for backwards compatibility, but is deprecated.

If no control list exists for the keywords used, then no value should be supplied for the scheme attribute.

Example
<keywords scheme="http://classificationweb.net">
 <term>Babbage, Charles</term>
 <term>Mathematicians - Great Britain - Biography</term>
</keywords>
Example
<keywords>
 <term>Fermented beverages</term>
 <term>Central Andes</term>
 <term>Schinus molle</term>
 <term>Molle beer</term>
 <term>Indigenous peoples</term>
 <term>Ethnography</term>
 <term>Archaeology</term>
</keywords>
Content model
<content>
 <alternate>
  <elementRef key="term"
   maxOccurs="unboundedminOccurs="1"/>

  <elementRef key="list"/>
 </alternate>
</content>
Schema Declaration
element keywords
{
   tei_att.global.attributes,
   attribute scheme { teidata.pointer }?,
   ( tei_term+ | list )
}

<l>

<l> (verse line) contains a single, possibly incomplete, line of verse. [3.13.1. Core Tags for Verse 3.13. Passages of Verse or Drama 7.2.5. Speech Contents]
Modulecore
Attributes
Member of
Contained by
May contain
Example
<l met="x/x/x/x/x/real="/xx/x/x/x/">Shall I compare thee to a summer's day?</l>
Schematron

<sch:report test="ancestor::tei:l[not(.//tei:note//tei:l[. = current()])]"> Abstract model violation: Lines may not contain lines or lg elements.
</sch:report>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.inter"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element l
{
   tei_att.global.attributes,
   tei_att.metrical.attributes,
   tei_att.enjamb.attributes,
   tei_att.fragmentable.attributes,
   (
      text
    | tei_model.gLike
    | tei_model.phrase
    | tei_model.inter
    | tei_model.global
   )*
}

<layout>

<layout> (layout) describes how text is laid out on the page or surface of the object, including information about any ruling, pricking, or other evidence of page-preparation techniques. [10.7.2. Writing, Decoration, and Other Notations]
Modulemsdescription
Attributes
columns(columns) specifies the number of columns per page
Status Optional
Datatype 1–2 occurrences of teidata.count separated by whitespace
Note

If a single number is given, all pages referenced have this number of columns. If two numbers are given, the number of columns per page varies between the values supplied. Where columns is omitted the number is assumed to be 1.

Columns may be independent of page orientation or reading direction, and a single textual stream may have one or more columns.

streams(textual streams) indicates the number of streams per page, each of which contains an independent textual stream
Status Optional
Datatype 1–2 occurrences of teidata.count separated by whitespace
Note

If a single number is given, all pages referenced have this number of textual streams. If two numbers are given, the number of textual streams per page varies between the values supplied. Where streams is omitted the number is assumed to be 1 and unless specified elsewhere the script orientation of the source is identical to that used in the TEI document.

ruledLines(ruled lines) specifies the number of ruled lines per column
Status Optional
Datatype 1–2 occurrences of teidata.count separated by whitespace
Note

If a single number is given, all columns have this number of ruled lines. If two numbers are given, the number of ruled lines per column varies between the values supplied.

writtenLines(written lines) specifies the number of written lines per column
Status Optional
Datatype 1–2 occurrences of teidata.count separated by whitespace
Note

If a single number is given, all columns have this number of written lines. If two numbers are given, the number of written lines per column varies between the values supplied.

Contained by
msdescription: layoutDesc
May contain
Example
<layout columns="1ruledLines="25 32">Most pages have between 25 and 32 long lines ruled in lead.</layout>
Example
<layout columns="2ruledLines="42">
 <p>2 columns of 42 lines ruled in ink, with central rule
   between the columns.</p>
</layout>
Example
<layout columns="1 2writtenLines="40 50">
 <p>Some pages have 2 columns, with central rule
   between the columns; each column with between 40 and 50 lines of writing.</p>
</layout>
Example
<layout columns="3streams="3"/>
<!-- Further down in document body -->
<div type="page">
 <ab>
  <pb/>
   一二三<cb type="top-stream"/>
   一二三<cb type="mid-stream"/>
   一二三<cb type="bottom-stream"/>
<!-- cb here for demo purposes -->
 </ab>
</div>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element layout
{
   tei_att.global.attributes,
   attribute columns { list { teidata.count, teidata.count? } }?,
   attribute streams { list { teidata.count, teidata.count? } }?,
   attribute ruledLines { list { teidata.count, teidata.count? } }?,
   attribute writtenLines { list { teidata.count, teidata.count? } }?,
   tei_macro.specialPara
}

<layoutDesc>

<layoutDesc> (layout description) collects the set of layout descriptions applicable to a manuscript or other object. [10.7.2. Writing, Decoration, and Other Notations]
Modulemsdescription
Attributes
Contained by
msdescription: objectDesc
May contain
core: p
linking: ab
msdescription: layout
Example
<layoutDesc>
 <p>Most pages have between 25 and 32 long lines ruled in lead.</p>
</layoutDesc>
Example
<layoutDesc>
 <layout columns="2ruledLines="42">
  <p>
   <locus from="f12rto="f15v"/>
     2 columns of 42 lines pricked and ruled in ink, with
     central rule between the columns.</p>
 </layout>
 <layout columns="3">
  <p>
   <locus from="f16"/>Prickings for three columns are visible.</p>
 </layout>
</layoutDesc>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <sequence>
   <elementRef key="summaryminOccurs="0"/>
   <elementRef key="layout"
    maxOccurs="unboundedminOccurs="1"/>

  </sequence>
 </alternate>
</content>
Schema Declaration
element layoutDesc
{
   tei_att.global.attributes,
   ( tei_model.pLike+ | ( summary?, tei_layout+ ) )
}

<lb>

<lb> (line beginning) marks the beginning of a new (typographic) line in some edition or version of a text. [3.11.3. Milestone Elements 7.2.5. Speech Contents]
Modulecore
Attributes
Member of
Contained by
May containEmpty element
Note

By convention, lb elements should appear at the point in the text where a new line starts. The n attribute, if used, indicates the number or other value associated with the text between this point and the next lb element, typically the sequence number of the line within the page, or other appropriate unit. This element is intended to be used for marking actual line breaks on a manuscript or printed page, at the point where they occur; it should not be used to tag structural units such as lines of verse (for which the l element is available) except in circumstances where structural units cannot otherwise be marked.

The type attribute may be used to characterize the line break in any respect. The more specialized attributes break, ed, or edRef should be preferred when the intent is to indicate whether or not the line break is word-breaking, or to note the source from which it derives.

Example

This example shows typographical line breaks within metrical lines, where they occur at different places in different editions:

<l>Of Mans First Disobedience,<lb ed="1674"/> and<lb ed="1667"/> the Fruit</l>
<l>Of that Forbidden Tree, whose<lb ed="1667 1674"/> mortal tast</l>
<l>Brought Death into the World,<lb ed="1667"/> and all<lb ed="1674"/> our woe,</l>
Example

This example encodes typographical line breaks as a means of preserving the visual appearance of a title page. The break attribute is used to show that the line break does not (as elsewhere) mark the start of a new word.

<titlePart>
 <lb/>With Additions, ne-<lb break="no"/>ver before Printed.
</titlePart>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element lb
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.edition.attributes,
   tei_att.spanning.attributes,
   tei_att.breaking.attributes,
   empty
}

<lg>

<lg> (line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc. [3.13.1. Core Tags for Verse 3.13. Passages of Verse or Drama 7.2.5. Speech Contents]
Modulecore
Attributes
Member of
Contained by
May contain
Note

contains verse lines or nested line groups only, possibly prefixed by a heading.

Example
<lg type="free">
 <l>Let me be my own fool</l>
 <l>of my own making, the sum of it</l>
</lg>
<lg type="free">
 <l>is equivocal.</l>
 <l>One says of the drunken farmer:</l>
</lg>
<lg type="free">
 <l>leave him lay off it. And this is</l>
 <l>the explanation.</l>
</lg>
Schematron

<sch:assert test="count(descendant::tei:lg|descendant::tei:l|descendant::tei:gap) > 0">An lg element
must contain at least one child l, lg, or gap element.</sch:assert>
Schematron

<sch:report test="ancestor::tei:l[not(.//tei:note//tei:lg[. = current()])]"> Abstract model violation: Lines may not contain line groups.
</sch:report>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.divTop"/>
   <classRef key="model.global"/>
  </alternate>
  <alternate>
   <classRef key="model.lLike"/>
   <classRef key="model.stageLike"/>
   <classRef key="model.labelLike"/>
   <classRef key="model.pPart.transcriptional"/>
   <elementRef key="lg"/>
  </alternate>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.lLike"/>
   <classRef key="model.stageLike"/>
   <classRef key="model.labelLike"/>
   <classRef key="model.pPart.transcriptional"/>
   <classRef key="model.global"/>
   <elementRef key="lg"/>
  </alternate>
  <sequence maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.divBottom"/>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element lg
{
   tei_att.global.attributes,
   tei_att.divLike.attributes,
   tei_att.typed.attributes,
   tei_att.declaring.attributes,
   (
      ( tei_model.divTop | tei_model.global )*,
      (
         tei_model.lLike
       | tei_model.stageLike
       | tei_model.labelLike
       | tei_model.pPart.transcriptional
       | tei_lg
      ),
      (
         tei_model.lLike
       | tei_model.stageLike
       | tei_model.labelLike
       | tei_model.pPart.transcriptional
       | tei_model.global
       | tei_lg
      )*,
      ( tei_model.divBottom, tei_model.global* )*
   )
}

<licence>

<licence> contains information about a licence or other legal agreement applicable to the text. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduleheader
Attributes
Member of
Contained by
header: availability
May contain
Note

A licence element should be supplied for each licence agreement applicable to the text in question. The target attribute may be used to reference a full version of the licence. The when, notBefore, notAfter, from or to attributes may be used in combination to indicate the date or dates of applicability of the licence.

Example
<licence target="http://www.nzetc.org/tm/scholarly/tei-NZETC-Help.html#licensing"> Licence: Creative Commons Attribution-Share Alike 3.0 New Zealand Licence
</licence>
Example
<availability>
 <licence notBefore="2013-01-01"
  target="http://creativecommons.org/licenses/by/3.0/">

  <p>The Creative Commons Attribution 3.0 Unported (CC BY 3.0) Licence
     applies to this document.</p>
  <p>The licence was added on January 1, 2013.</p>
 </licence>
</availability>
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element licence
{
   tei_att.global.attributes,
   tei_att.pointing.attributes,
   tei_att.datable.attributes,
   tei_macro.specialPara
}

<listApp>

<listApp> (list of apparatus entries) contains a list of apparatus entries. [12.2. Linking the Apparatus to the Text]
Moduletextcrit
Attributes
Member of
Contained by
May contain
core: desc head
textcrit: app listApp
Note

listApp elements would normally be located in the back part of a document, but they may appear elsewhere.

Example

In the following example from the exegetical Yasna, the base text is encoded in the body of the document, and two separate listApp elements are used in the back, containing variant readings written in different scripts.

<body>
 <div>
  <lg rend="italictype="stanza"
   xml:id="Y-36.01xml:lang="pal-Avst">

   <l xml:id="Y-36.01_L-1">
    <w xml:id="Y-36.01_L1_W-01">ahiiā</w>
    <w xml:id="Y-36.01_L1_W-02">ϑβā</w>
    <w xml:id="Y-36.01_L1_W-03">āϑrō</w>
    <w xml:id="Y-36.01_L1_W-04">vərəzə̄nā</w>
    <w xml:id="Y-36.01_L1_W-05">paouruiiē</w>
    <w xml:id="Y-36.01_L1_W-06">pairijasāmaiδē</w>
    <w xml:id="Y-36.01_L1_W-07">mazdā</w>
    <w xml:id="Y-36.01_L1_W-08">ahurā</w>
   </l>
<!-- ... -->
  </lg>
 </div>
</body>
<!-- ... -->
<back>
 <div>
  <listApp xml:id="CA_Y-36"
   xml:lang="pal-Avst">

   <head>Variants from witnesses in Avestan script</head>
   <app from="#Y-36.01_L1_W-01">
    <rdg wit="#Pt4 #F2 #J2 #M1">ahiiā</rdg>
   </app>
   <app from="#Y-36.01_L1_W-02">
    <rdg wit="#Pt4 #F2 #J2 #M1">ϑβā</rdg>
   </app>
   <app from="#Y-36.01_L1_W-03">
    <rdg wit="#Pt4 #J2 #M1">āϑrō</rdg>
    <rdg wit="#F2">āϑrōi</rdg>
   </app>
<!-- ... -->
  </listApp>
  <listApp xml:id="CA_PY-36"
   xml:lang="pal-Phlv">

   <head>Variants from witnesses written in Pahlavi script</head>
   <app from="#PY-36.01_L1_W-01">
    <rdg wit="#Pt4 #F2 #J2 #M1">ʾytwnˈ</rdg>
   </app>
   <app from="#PY-36.01_L1_W-02">
    <rdg wit="#Pt4 #F2 #J2 #M1">ʾwˈ</rdg>
   </app>
   <app from="#PY-36.01_L1_W-03">
    <rdg wit="#Pt4 #F2 #J2 #M1">ḤNʾ</rdg>
   </app>
<!-- ... -->
  </listApp>
 </div>
</back>
Content model
<content>
 <sequence>
  <classRef key="model.headLike"
   maxOccurs="unboundedminOccurs="0"/>

  <elementRef key="desc"
   maxOccurs="unboundedminOccurs="0"/>

  <alternate maxOccurs="unbounded"
   minOccurs="1">

   <elementRef key="app"/>
   <elementRef key="listApp"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element listApp
{
   tei_att.global.attributes,
   tei_att.sortable.attributes,
   tei_att.declarable.attributes,
   tei_att.typed.attributes,
   ( tei_model.headLike*, tei_desc*, ( tei_app | tei_listApp )+ )
}

<listChange>

<listChange> groups a number of change descriptions associated with either the creation of a source text or the revision of an encoded text. [2.6. The Revision Description 11.7. Identifying Changes and Revisions]
Moduleheader
Attributes
orderedindicates whether the ordering of its child change elements is to be considered significant or not
Status Optional
Datatype teidata.truthValue
Default true
Member of
Contained by
May contain
core: desc
Note

When this element appears within the creation element it documents the set of revision campaigns or stages identified during the evolution of the original text. When it appears within the revisionDesc element, it documents only changes made during the evolution of the encoded representation of that text.

Example
<revisionDesc>
 <listChange>
  <change when="1991-11-11who="#LB"> deleted chapter 10 </change>
  <change when="1991-11-02who="#MSM"> completed first draft </change>
 </listChange>
</revisionDesc>
Example
<profileDesc>
 <creation>
  <listChange ordered="true">
   <change xml:id="CHG-1">First stage, written in ink by a writer</change>
   <change xml:id="CHG-2">Second stage, written in Goethe's hand using pencil</change>
   <change xml:id="CHG-3">Fixation of the revised passages and further revisions by
       Goethe using ink</change>
   <change xml:id="CHG-4">Addition of another stanza in a different hand,
       probably at a later stage</change>
  </listChange>
 </creation>
</profileDesc>
Content model
<content>
 <sequence>
  <elementRef key="desc"
   maxOccurs="unboundedminOccurs="0"/>

  <alternate maxOccurs="unbounded"
   minOccurs="1">

   <elementRef key="listChange"/>
   <elementRef key="change"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element listChange
{
   tei_att.global.attributes,
   tei_att.sortable.attributes,
   tei_att.typed.attributes,
   attribute ordered { teidata.truthValue }?,
   ( tei_desc*, ( tei_listChange | tei_change )+ )
}

<listTranspose>

<listTranspose> supplies a list of transpositions, each of which is indicated at some point in a document typically by means of metamarks. [11.3.4.5. Transpositions]
Moduletranscr
Attributes
Member of
Contained by
May contain
core: desc
transcr: transpose
Example
<listTranspose>
 <transpose>
  <ptr target="#ib02"/>
  <ptr target="#ib01"/>
 </transpose>
</listTranspose>

This example might be used for a source document which indicates in some way that the elements identified by ib02 and code ib01 should be read in that order (ib02 followed by ib01), rather than in the reading order in which they are presented in the source.

Content model
<content>
 <sequence>
  <elementRef key="desc"
   maxOccurs="unboundedminOccurs="0"/>

  <elementRef key="transpose"
   maxOccurs="unboundedminOccurs="1"/>

 </sequence>
</content>
Schema Declaration
element listTranspose
{
   tei_att.global.attributes,
   ( tei_desc*, tei_transpose+ )
}

<locus>

<locus> (locus) defines a location within a manuscript, manuscript part, or other object typically as a (possibly discontinuous) sequence of folio references. [10.3.5. References to Locations within a Manuscript]
Modulemsdescription
Attributes
scheme(scheme) identifies the foliation scheme in terms of which the location is being specified by pointing to some foliation element defining it, or to some other equivalent resource.
Status Optional
Datatype teidata.pointer
from(from) specifies the starting point of the location in a normalized form, typically a page number.
Status Optional
Datatype teidata.word
to(to) specifies the end-point of the location in a normalized form, typically as a page number.
Status Optional
Datatype teidata.word
Member of
Contained by
May contain
core: hi
msdescription: locus
character data
Note

The target attribute should only be used to point to elements that contain or indicate a transcription of the locus being described, as in the ‘Ben Jonson’ example.

To associate a locus element with a page image or other comparable representation, the global facs attribute should be used, as shown in the ‘Birds Praise of Love’ example. The facs attribute may be used to indicate one or more image files, as in that example, or alternatively it may point to one or more appropriate XML elements, such as the surface, zone, graphic, or <binaryObject> elements.

When a single page is being cited, use the from and to attributes with an identical value. When no clear endpoint is provided, the from attribute may be used without to; for example a citation such as ‘p. 3ff’ might be encoded <locus from="3">p. 3ff</locus>.

Example

<!-- within ms description --><msItem n="1">
 <locus from="1rtarget="#F1r #F1v #F2r"
  to="2r">
ff. 1r-2r</locus>
 <author>Ben Jonson</author>
 <title>Ode to himself</title>
 <rubric rend="italics"> An Ode<lb/> to him selfe.</rubric>
 <incipit>Com leaue the loathed stage</incipit>
 <explicit>And see his chariot triumph ore his wayne.</explicit>
 <bibl>
  <name>Beal</name>, <title>Index 1450-1625</title>, JnB 380</bibl>
</msItem>
<!-- within transcription ... -->
<pb xml:id="F1r"/>
<!-- ... -->
<pb xml:id="F1v"/>
<!-- ... -->
<pb xml:id="F2r"/>
<!-- ... -->
Example

The facs attribute is available globally when the transcr module is included in a schema. It may be used to point directly to an image file, as in the following example:

<msItem>
 <locus facs="images/08v.jpg images/09r.jpg images/09v.jpg images/10r.jpg images/10v.jpg">fols. 8v-10v</locus>
 <title>Birds Praise of Love</title>
 <bibl>
  <title>IMEV</title>
  <biblScope>1506</biblScope>
 </bibl>
</msItem>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <elementRef key="hi"/>
  <elementRef key="locus"/>
 </alternate>
</content>
Schema Declaration
element locus
{
   tei_att.global.attributes,
   tei_att.pointing.attributes,
   tei_att.typed.attributes,
   attribute scheme { teidata.pointer }?,
   attribute from { teidata.word }?,
   attribute to { teidata.word }?,
   ( text | tei_model.gLike | tei_hi | tei_locus )*
}

<material>

<material> (material) contains a word or phrase describing the material of which the object being described is composed. [10.3.2. Material and Object Type]
Modulemsdescription
Attributes
functiondescribes the function or use of the material in relation to the object as a whole.
Status Optional
Datatype teidata.enumerated
Sample values include:
binding
covering material of a codex
endband
sewing at the head or tail of the codex spine to strengthen the binding, often decorative
slipcase
removable protective cover for a set of one or more codices
support
the surface for writing
tie
a ribbon or string used to bind or close a codex or rolled scroll
Note

The sample values here are for descriptive bibliography. Other sets of sample values might include armrests, legs, tabletop, pan, and back for furniture; or wall, floor, window, column, ceiling, roof, stairs, chimney for architecture.

targetidentifies one or more elements to which the metamark applies.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Member of
Contained by
May contain
Note

The ref attribute may be used to point to one or more items within a taxonomy of types of material, defined either internally or externally.

Example
<physDesc>
 <p>
  <material>Parchment</material> leaves with a
 <material>sharkskin</material> binding.</p>
</physDesc>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element material
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   tei_att.typed.attributes,
   attribute function { teidata.enumerated }?,
   attribute target { list { teidata.pointer+ } }?,
   tei_macro.phraseSeq
}

<measure>

<measure> (measure) contains a word or phrase referring to some quantity of an object or commodity, usually comprising a number, a unit, and a commodity name. [3.6.3. Numbers and Measures]
Modulecore
Attributes
typespecifies the type of measurement in any convenient typology.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Member of
Contained by
May contain
Example

This example references a definition of a measurement unit declared in the TEI header:

<measure type="weight">
 <num>2</num> pounds of flesh
</measure>
<measure type="currency">£10-11-6d</measure>
<measure type="areaunitRef="#merk">2 <unit>merks</unit> of old extent</measure>
<!-- In the TEI Header: -->
<encodingDesc>
 <unitDecl>
  <unitDef type="areaxml:id="merk">
   <label>merk</label>
   <placeName ref="#Scotland"/>
   <desc>A merk was an area of land determined variably by its agricultural
       productivity.</desc>
  </unitDef>
 </unitDecl>
</encodingDesc>
Example
<measure commodity="rumquantity="40"
 unit="hogshead">
2 score hh rum</measure>
<measure commodity="rosesquantity="12"
 unit="count">
1 doz. roses</measure>
<measure commodity="tulipsquantity="1"
 unit="count">
a yellow tulip</measure>
Example
<head>Long papers.</head>
<p>Speakers will be given 30 minutes each: 20 minutes for
presentation, 10 minutes for discussion. Proposals should not
exceed <measure commodity="wordsmax="500"
  unit="count">
500
   words</measure>. This presentation type is suitable for
substantial research, theoretical or critical discussions.</p>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element measure
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   tei_att.measurement.attributes,
   tei_att.ranging.attributes,
   attribute type { teidata.enumerated }?,
   tei_macro.phraseSeq
}

<metamark>

<metamark> contains or describes any kind of graphic or written signal within a document the function of which is to determine how it should be read rather than forming part of the actual content of the document. [11.3.4.2. Metamarks]
Moduletranscr
Attributes
functiondescribes the function (for example status, insertion, deletion, transposition) of the metamark.
Status Optional
Datatype teidata.word
targetidentifies one or more elements to which the metamark applies.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Member of
Contained by
May contain
Example
<surface>
 <metamark function="usedrend="line"
  target="#X2"/>

 <zone xml:id="zone-X2">
  <line>I am that halfgrown <add>angry</add> boy, fallen asleep</line>
  <line>The tears of foolish passion yet undried</line>
  <line>upon my cheeks.</line>
<!-- ... -->
  <line>I pass through <add>the</add> travels and <del>fortunes</del> of
  <retrace>thirty</retrace>
  </line>
  <line>years and become old,</line>
  <line>Each in its due order comes and goes,</line>
  <line>And thus a message for me comes.</line>
  <line>The</line>
 </zone>
 <metamark function="used"
  target="#zone-X2">
Entered - Yes</metamark>
</surface>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element metamark
{
   tei_att.spanning.attributes,
   tei_att.placement.attributes,
   tei_att.global.attributes,
   attribute function { teidata.word }?,
   attribute target { list { teidata.pointer+ } }?,
   tei_macro.specialPara
}

<milestone>

<milestone> (milestone) marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element. [3.11.3. Milestone Elements]
Modulecore
Attributes
Member of
Contained by
May containEmpty element
Note

For this element, the global n attribute indicates the new number or other value for the unit which changes at this milestone. The special value unnumbered should be used in passages which fall outside the normal numbering scheme, such as chapter or other headings, poem numbers or titles, etc.

The order in which milestone elements are given at a given point is not normally significant.

Example
<milestone ed="Lan="23unit="Dreissiger"/>
... <milestone ed="AVn="24unit="verse"/> ...
Content model
<content>
 <empty/>
</content>
Schema Declaration
element milestone
{
   tei_att.global.attributes,
   tei_att.milestoneUnit.attributes,
   tei_att.typed.attributes,
   tei_att.edition.attributes,
   tei_att.spanning.attributes,
   tei_att.breaking.attributes,
   empty
}

<mod>

<mod> represents any kind of modification identified within a single document. [11.3.4.1. Generic Modification]
Moduletranscr
Attributes
Member of
Contained by
May contain
Example
<mod type="subst">
 <add>pleasing</add>
 <del>agreable</del>
</mod>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element mod
{
   tei_att.global.attributes,
   tei_att.transcriptional.attributes,
   tei_att.typed.attributes,
   tei_att.spanning.attributes,
   tei_att.dimensions.attributes,
   tei_macro.paraContent
}

<monogr>

<monogr> (monographic level) contains bibliographic elements describing an item (e.g. a book or journal) published as an independent item (i.e. as a separate physical object). [3.12.2.1. Analytic, Monographic, and Series Levels]
Modulecore
Attributes
Contained by
May contain
Note

May contain specialized bibliographic elements, in a prescribed order.

The monogr element may only occur only within a biblStruct, where its use is mandatory for the description of a monographic-level bibliographic item.

Example
<biblStruct>
 <analytic>
  <author>Chesnutt, David</author>
  <title>Historical Editions in the States</title>
 </analytic>
 <monogr>
  <title level="j">Computers and the Humanities</title>
  <imprint>
   <date when="1991-12">(December, 1991):</date>
  </imprint>
  <biblScope>25.6</biblScope>
  <biblScope from="377to="380unit="page">377–380</biblScope>
 </monogr>
</biblStruct>
Example
<biblStruct type="book">
 <monogr>
  <author>
   <persName>
    <forename>Leo Joachim</forename>
    <surname>Frachtenberg</surname>
   </persName>
  </author>
  <title level="mtype="main">Lower Umpqua Texts</title>
  <imprint>
   <pubPlace>New York</pubPlace>
   <publisher>Columbia University Press</publisher>
   <date>1914</date>
  </imprint>
 </monogr>
 <series>
  <title level="stype="main">Columbia University Contributions to
     Anthropology</title>
  <biblScope unit="volume">4</biblScope>
 </series>
</biblStruct>
Content model
<content>
 <sequence>
  <alternate minOccurs="0">
   <sequence>
    <alternate>
     <elementRef key="author"/>
     <elementRef key="editor"/>
     <elementRef key="meeting"/>
     <elementRef key="respStmt"/>
    </alternate>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <elementRef key="author"/>
     <elementRef key="editor"/>
     <elementRef key="meeting"/>
     <elementRef key="respStmt"/>
    </alternate>
    <elementRef key="title"
     maxOccurs="unboundedminOccurs="1"/>

    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <classRef key="model.ptrLike"/>
     <elementRef key="idno"/>
     <elementRef key="textLang"/>
     <elementRef key="editor"/>
     <elementRef key="respStmt"/>
    </alternate>
   </sequence>
   <sequence>
    <alternate maxOccurs="unbounded"
     minOccurs="1">

     <elementRef key="title"/>
     <classRef key="model.ptrLike"/>
     <elementRef key="idno"/>
    </alternate>
    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <elementRef key="textLang"/>
     <elementRef key="author"/>
     <elementRef key="editor"/>
     <elementRef key="meeting"/>
     <elementRef key="respStmt"/>
    </alternate>
   </sequence>
   <sequence>
    <elementRef key="authority"/>
    <elementRef key="idno"/>
   </sequence>
  </alternate>
  <elementRef key="availability"
   maxOccurs="unboundedminOccurs="0"/>

  <classRef key="model.noteLike"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence maxOccurs="unbounded"
   minOccurs="0">

   <elementRef key="edition"/>
   <alternate maxOccurs="unbounded"
    minOccurs="0">

    <elementRef key="idno"/>
    <classRef key="model.ptrLike"/>
    <elementRef key="editor"/>
    <elementRef key="sponsor"/>
    <elementRef key="funder"/>
    <elementRef key="respStmt"/>
   </alternate>
  </sequence>
  <elementRef key="imprint"/>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <elementRef key="imprint"/>
   <elementRef key="extent"/>
   <elementRef key="biblScope"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element monogr
{
   tei_att.global.attributes,
   (
      (
         (
            ( tei_author | tei_editor | meeting | respStmt ),
            ( tei_author | tei_editor | meeting | respStmt )*,
            tei_title+,
            ( tei_model.ptrLike | tei_idno | textLang | tei_editor | respStmt )*
         )
       | (
            ( tei_title | tei_model.ptrLike | tei_idno )+,
            ( textLang | tei_author | tei_editor | meeting | respStmt )*
         )
       | ( tei_authority, tei_idno )
      )?,
      tei_availability*,
      tei_model.noteLike*,
      (
         edition,
         (
            tei_idno
          | tei_model.ptrLike
          | tei_editor
          | sponsor
          | funder
          | respStmt
         )*
      )*,
      tei_imprint,
      ( tei_imprint | tei_extent | tei_biblScope )*
   )
}

<msContents>

<msContents> (manuscript contents) describes the intellectual content of a manuscript, manuscript part, or other object either as a series of paragraphs or as a series of structured manuscript items. [10.6. Intellectual Content]
Modulemsdescription
Attributes
Contained by
msdescription: msDesc
May contain
core: p
linking: ab
msdescription: msItem msItemStruct
textstructure: titlePage
Note

Unless it contains a simple prose description, this element should contain at least one of the elements <summary>, msItem, or msItemStruct. This constraint is not currently enforced by the schema.

Example
<msContents class="#sermons">
 <p>A collection of Lollard sermons</p>
</msContents>
Example
<msContents>
 <msItem n="1">
  <locus>fols. 5r-7v</locus>
  <title>An ABC</title>
  <bibl>
   <title>IMEV</title>
   <biblScope>239</biblScope>
  </bibl>
 </msItem>
 <msItem n="2">
  <locus>fols. 7v-8v</locus>
  <title xml:lang="frm">Lenvoy de Chaucer a Scogan</title>
  <bibl>
   <title>IMEV</title>
   <biblScope>3747</biblScope>
  </bibl>
 </msItem>
 <msItem n="3">
  <locus>fol. 8v</locus>
  <title>Truth</title>
  <bibl>
   <title>IMEV</title>
   <biblScope>809</biblScope>
  </bibl>
 </msItem>
 <msItem n="4">
  <locus>fols. 8v-10v</locus>
  <title>Birds Praise of Love</title>
  <bibl>
   <title>IMEV</title>
   <biblScope>1506</biblScope>
  </bibl>
 </msItem>
 <msItem n="5">
  <locus>fols. 10v-11v</locus>
  <title xml:lang="la">De amico ad amicam</title>
  <title xml:lang="la">Responcio</title>
  <bibl>
   <title>IMEV</title>
   <biblScope>16 &amp; 19</biblScope>
  </bibl>
 </msItem>
 <msItem n="6">
  <locus>fols. 14r-126v</locus>
  <title>Troilus and Criseyde</title>
  <note>Bk. 1:71-Bk. 5:1701, with additional losses due to mutilation throughout</note>
 </msItem>
</msContents>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <sequence>
   <elementRef key="summaryminOccurs="0"/>
   <elementRef key="textLangminOccurs="0"/>
   <elementRef key="titlePage"
    minOccurs="0"/>

   <alternate maxOccurs="unbounded"
    minOccurs="0">

    <elementRef key="msItem"/>
    <elementRef key="msItemStruct"/>
   </alternate>
  </sequence>
 </alternate>
</content>
Schema Declaration
element msContents
{
   tei_att.global.attributes,
   tei_att.msExcerpt.attributes,
   tei_att.msClass.attributes,
   (
      tei_model.pLike+
    | (
         summary?,
         textLang?,
         tei_titlePage?,
         ( tei_msItem | tei_msItemStruct )*
      )
   )
}

<msDesc>

<msDesc> (manuscript description) contains a description of a single identifiable manuscript or other text-bearing object such as an early printed book. [10.1. Overview]
Modulemsdescription
Attributes
Member of
Contained by
May contain
core: head p
linking: ab
Note

Although the msDesc has primarily been designed with a view to encoding manuscript descriptions, it may also be used for other objects such as early printed books, fascicles, epigraphs, or any text-bearing objects that require substantial description. If an object is not text-bearing or the reasons for describing the object is not primarily the textual content, the more general <object> may be more suitable.

Example
<msDesc>
 <msIdentifier>
  <settlement>Oxford</settlement>
  <repository>Bodleian Library</repository>
  <idno type="Bod">MS Poet. Rawl. D. 169.</idno>
 </msIdentifier>
 <msContents>
  <msItem>
   <author>Geoffrey Chaucer</author>
   <title>The Canterbury Tales</title>
  </msItem>
 </msContents>
 <physDesc>
  <objectDesc>
   <p>A parchment codex of 136 folios, measuring approx
       28 by 19 inches, and containing 24 quires.</p>
   <p>The pages are margined and ruled throughout.</p>
   <p>Four hands have been identified in the manuscript: the first 44
       folios being written in two cursive anglicana scripts, while the
       remainder is for the most part in a mixed secretary hand.</p>
  </objectDesc>
 </physDesc>
</msDesc>
Schematron

<sch:rule context="tei:msContents|tei:physDesc|tei:history|tei:additional">
<sch:let name="givalue="name(.)"/>
<sch:report test="preceding-sibling::*[ name(.) eq $gi ] and not( following-sibling::*[ name(.) eq $gi ] )"> Only one <sch:name/> is allowed as a child of <sch:value-of select="name(..)"/>.
</sch:report>
</sch:rule>
Content model
<content>
 <sequence>
  <elementRef key="msIdentifier"/>
  <classRef key="model.headLike"
   maxOccurs="unboundedminOccurs="0"/>

  <alternate>
   <classRef key="model.pLike"
    maxOccurs="unboundedminOccurs="1"/>

   <alternate maxOccurs="unbounded"
    minOccurs="0">

    <elementRef key="msContents"/>
    <elementRef key="physDesc"/>
    <elementRef key="history"/>
    <elementRef key="additional"/>
    <elementRef key="msPart"/>
    <elementRef key="msFrag"/>
   </alternate>
  </alternate>
 </sequence>
</content>
Schema Declaration
element msDesc
{
   tei_att.global.attributes,
   tei_att.sortable.attributes,
   tei_att.typed.attributes,
   tei_att.declaring.attributes,
   tei_att.docStatus.attributes,
   (
      tei_msIdentifier,
      tei_model.headLike*,
      (
         tei_model.pLike+
       | (
            tei_msContents
          | tei_physDesc
          | tei_history
          | additional
          | msPart
          | msFrag
         )*
      )
   )
}

<msIdentifier>

<msIdentifier> (manuscript identifier) contains the information required to identify the manuscript or similar object being described. [10.4. The Manuscript Identifier]
Modulemsdescription
Attributes
Member of
Contained by
core: bibl
msdescription: msDesc
May contain
header: idno
msdescription: institution repository
namesdates: placeName settlement
Example
<msIdentifier>
 <settlement>San Marino</settlement>
 <repository>Huntington Library</repository>
 <idno>MS.El.26.C.9</idno>
</msIdentifier>
Schematron

<sch:report test="not(parent::tei:msPart) and (local-name(*[1])='idno' or local-name(*[1])='altIdentifier' or normalize-space(.)='')">An msIdentifier must contain either a repository or location.</sch:report>
Content model
<content>
 <sequence>
  <sequence>
   <classRef expand="sequenceOptional"
    key="model.placeNamePart"/>

   <elementRef key="institution"
    minOccurs="0"/>

   <elementRef key="repository"
    minOccurs="0"/>

   <elementRef key="collection"
    maxOccurs="unboundedminOccurs="0"/>

   <elementRef key="idno"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <elementRef key="msName"/>
   <elementRef key="objectName"/>
   <elementRef key="altIdentifier"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element msIdentifier
{
   tei_att.global.attributes,
   (
      (
         tei_placeName?,
         tei_settlement?,
         tei_institution?,
         tei_repository?,
         collection*,
         tei_idno*
      ),
      ( msName | objectName | altIdentifier )*
   )
}

<msItem>

<msItem> (manuscript item) describes an individual work or item within the intellectual content of a manuscript, manuscript part, or other object. [10.6.1. The msItem and msItemStruct Elements]
Modulemsdescription
Attributes
Contained by
msdescription: msContents
May contain
Example
<msItem class="#saga">
 <locus>ff. 1r-24v</locus>
 <title>Agrip af Noregs konunga sögum</title>
 <incipit>regi oc h<ex>ann</ex> setiho
 <gap extent="7reason="illegible"/>sc
   heim se<ex>m</ex> þio</incipit>
 <explicit>h<ex>on</ex> hev<ex>er</ex>
  <ex>oc</ex>þa buit hesta .ij. aNan viþ
   fé enh<ex>on</ex>o<ex>m</ex> aNan til
   reiþ<ex>ar</ex>
 </explicit>
 <textLang mainLang="non">Old Norse/Icelandic</textLang>
</msItem>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <elementRef key="locus"/>
   <elementRef key="locusGrp"/>
  </alternate>
  <alternate>
   <classRef key="model.pLike"
    maxOccurs="unboundedminOccurs="1"/>

   <alternate maxOccurs="unbounded"
    minOccurs="1">

    <classRef key="model.titlepagePart"/>
    <classRef key="model.msItemPart"/>
    <classRef key="model.global"/>
   </alternate>
  </alternate>
 </sequence>
</content>
Schema Declaration
element msItem
{
   tei_att.global.attributes,
   tei_att.msExcerpt.attributes,
   tei_att.msClass.attributes,
   (
      ( tei_locus | locusGrp )*,
      (
         tei_model.pLike+
       | ( tei_model.titlepagePart | model.msItemPart | tei_model.global )+
      )
   )
}

<msItemStruct>

<msItemStruct> (structured manuscript item) contains a structured description for an individual work or item within the intellectual content of a manuscript, manuscript part, or other object. [10.6.1. The msItem and msItemStruct Elements]
Modulemsdescription
Attributes
Contained by
msdescription: msContents msItemStruct
May contain
Example
<msItemStruct class="#biblComm"
 defective="falsen="2">

 <locus from="24vto="97v">24v-97v</locus>
 <author>Apringius de Beja</author>
 <title type="uniformxml:lang="la">Tractatus in Apocalypsin</title>
 <rubric>Incipit Trac<supplied reason="omitted">ta</supplied>tus
   in apoka<lb/>lipsin eruditissimi uiri <lb/> Apringi ep<ex>iscop</ex>i
   Pacensis eccl<ex>esi</ex>e</rubric>
 <finalRubric>EXPLIC<ex>IT</ex> EXPO<lb/>SITIO APOCALIPSIS
   QVA<ex>M</ex> EXPOSVIT DOM<lb/>NVS APRINGIUS EP<ex>ISCOPU</ex>S.
   DEO GR<ex>ACI</ex>AS AGO. FI<lb/>NITO LABORE ISTO.</finalRubric>
 <bibl>
  <ref target="http://amiBibl.xml#Apringius1900">Apringius</ref>, ed. Férotin</bibl>
 <textLang mainLang="la">Latin</textLang>
</msItemStruct>
Content model
<content>
 <sequence>
  <alternate minOccurs="0">
   <elementRef key="locus"/>
   <elementRef key="locusGrp"/>
  </alternate>
  <alternate>
   <classRef key="model.pLike"
    maxOccurs="unboundedminOccurs="1"/>

   <sequence>
    <elementRef key="author"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="respStmt"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="title"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="rubricminOccurs="0"/>
    <elementRef key="incipitminOccurs="0"/>
    <elementRef key="msItemStruct"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="explicit"
     minOccurs="0"/>

    <elementRef key="finalRubric"
     minOccurs="0"/>

    <elementRef key="colophon"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="decoNote"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="listBibl"
     maxOccurs="unboundedminOccurs="0"/>

    <alternate maxOccurs="unbounded"
     minOccurs="0">

     <elementRef key="bibl"/>
     <elementRef key="biblStruct"/>
    </alternate>
    <elementRef key="filiation"
     minOccurs="0"/>

    <classRef key="model.noteLike"
     maxOccurs="unboundedminOccurs="0"/>

    <elementRef key="textLang"
     minOccurs="0"/>

   </sequence>
  </alternate>
 </sequence>
</content>
Schema Declaration
element msItemStruct
{
   tei_att.global.attributes,
   tei_att.msExcerpt.attributes,
   tei_att.msClass.attributes,
   (
      ( tei_locus | locusGrp )?,
      (
         tei_model.pLike+
       | (
            tei_author*,
            respStmt*,
            tei_title*,
            rubric?,
            incipit?,
            tei_msItemStruct*,
            explicit?,
            finalRubric?,
            colophon*,
            tei_decoNote*,
            listBibl*,
            ( tei_bibl | tei_biblStruct )*,
            tei_filiation?,
            tei_model.noteLike*,
            textLang?
         )
      )
   )
}
<nameLink> (name link) contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or of. [13.2.1. Personal Names]
Modulenamesdates
Attributes
Member of
Contained by
May contain
Example
<persName>
 <forename>Frederick</forename>
 <nameLink>van der</nameLink>
 <surname>Tronck</surname>
</persName>
Example
<persName>
 <forename>Alfred</forename>
 <nameLink>de</nameLink>
 <surname>Musset</surname>
</persName>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element nameLink
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_macro.phraseSeq
}

<normalization>

<normalization> (normalization) indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements]
Moduleheader
Attributes
methodindicates the method adopted to indicate normalizations within the text.
Status Optional
Datatype teidata.enumerated
Legal values are:
silent
normalization made silently [Default]
markup
normalization represented using markup
Contained by
May contain
core: p
linking: ab
Example
<editorialDecl>
 <normalization method="markup">
  <p>Where both upper- and lower-case i, j, u, v, and vv have been normalized, to modern
     20th century typographical practice, the <gi>choice</gi> element has been used to
     enclose <gi>orig</gi> and <gi>reg</gi> elements giving the original and new values
     respectively. ... </p>
 </normalization>
 <normalization method="silent">
  <p>Spacing between words and following punctuation has been regularized to zero spaces;
     spacing between words has been regularized to one space.</p>
 </normalization>
 <normalization source="http://www.dict.sztaki.hu/webster">
  <p>Spelling converted throughout to Modern American usage, based on Websters 9th
     Collegiate dictionary.</p>
 </normalization>
</editorialDecl>
Content model
<content>
 <classRef key="model.pLike"
  maxOccurs="unboundedminOccurs="1"/>

</content>
Schema Declaration
element normalization
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   attribute method { "silent" | "markup" }?,
   tei_model.pLike+
}

<note>

<note> (note) contains a note or annotation. [3.9.1. Notes and Simple Annotation 2.2.6. The Notes Statement 3.12.2.8. Notes and Statement of Language 9.3.5.4. Notes within Entries]
Modulecore
Attributes
Member of
Contained by
May contain
Example

In the following example, the translator has supplied a footnote containing an explanation of the term translated as "painterly":

And yet it is not only
in the great line of Italian renaissance art, but even in the
painterly <note place="bottomresp="#MDMH"
 type="gloss">

 <term xml:lang="de">Malerisch</term>. This word has, in the German, two
distinct meanings, one objective, a quality residing in the object,
the other subjective, a mode of apprehension and creation. To avoid
confusion, they have been distinguished in English as
<mentioned>picturesque</mentioned> and
<mentioned>painterly</mentioned> respectively.
</note> style of the
Dutch genre painters of the seventeenth century that drapery has this
psychological significance.

<!-- elsewhere in the document -->
<respStmt xml:id="MDMH">
 <resp>translation from German to English</resp>
 <name>Hottinger, Marie Donald Mackie</name>
</respStmt>

For this example to be valid, the code MDMH must be defined elsewhere, for example by means of a responsibility statement in the associated TEI header.

Example

The global n attribute may be used to supply the symbol or number used to mark the note's point of attachment in the source text, as in the following example:

Mevorakh b. Saadya's mother, the matriarch of the
family during the second half of the eleventh century, <note anchored="truen="126"> The
alleged mention of Judah Nagid's mother in a letter from 1071 is, in fact, a reference to
Judah's children; cf. above, nn. 111 and 54. </note> is well known from Geniza documents
published by Jacob Mann.

However, if notes are numbered in sequence and their numbering can be reconstructed automatically by processing software, it may well be considered unnecessary to record the note numbers.

Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element note
{
   tei_att.global.attributes,
   tei_att.placement.attributes,
   tei_att.pointing.attributes,
   tei_att.typed.attributes,
   tei_att.written.attributes,
   tei_att.anchoring.attributes,
   tei_macro.specialPara
}

<notesStmt>

<notesStmt> (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description. [2.2.6. The Notes Statement 2.2. The File Description]
Moduleheader
Attributes
Contained by
header: fileDesc
May contain
Note

Information of different kinds should not be grouped together into the same note.

Example
<notesStmt>
 <note>Historical commentary provided by Mark Cohen</note>
 <note>OCR scanning done at University of Toronto</note>
</notesStmt>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="1">

  <classRef key="model.noteLike"/>
  <elementRef key="relatedItem"/>
 </alternate>
</content>
Schema Declaration
element notesStmt
{
   tei_att.global.attributes,
   ( tei_model.noteLike | tei_relatedItem )+
}

<num>

<num> (number) contains a number, written in any form. [3.6.3. Numbers and Measures]
Modulecore
Attributes
typeindicates the type of numeric value.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Suggested values include:
cardinal
absolute number, e.g. 21, 21.5
ordinal
ordinal number, e.g. 21st
fraction
fraction, e.g. one half or three-quarters
percentage
a percentage
Note

If a different typology is desired, other values can be used for this attribute.

valuesupplies the value of the number in standard form.
Status Optional
Datatype teidata.numeric
Values a numeric value.
Note

The standard form used is defined by the TEI datatype teidata.numeric.

Member of
Contained by
May contain
Note

Detailed analyses of quantities and units of measure in historical documents may also use the feature structure mechanism described in chapter 18. Feature Structures. The num element is intended for use in simple applications.

Example
<p>I reached <num type="cardinalvalue="21">twenty-one</num> on
my <num type="ordinalvalue="21">twenty-first</num> birthday</p>
<p>Light travels at <num value="3E10">3×10<hi rend="sup">10</hi>
 </num> cm per second.</p>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element num
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   tei_att.ranging.attributes,
   attribute type
   {
      "cardinal" | "ordinal" | "fraction" | "percentage" | teidata.enumerated
   }?,
   attribute value { teidata.numeric }?,
   tei_macro.phraseSeq
}

<objectDesc>

<objectDesc> (object description) contains a description of the physical components making up the object which is being described. [10.7.1. Object Description]
Modulemsdescription
Attributes
form(form) a short project-specific name identifying the physical form of the carrier, for example as a codex, roll, fragment, partial leaf, cutting etc.
Status Optional
Datatype teidata.enumerated
Note

Definitions for the terms used may typically be provided by a <valList> element in the project schema specification.

Member of
Contained by
msdescription: physDesc
May contain
core: p
linking: ab
msdescription: layoutDesc supportDesc
Example
<objectDesc form="codex">
 <supportDesc material="mixed">
  <p>Early modern
  <material>parchment</material> and
  <material>paper</material>.</p>
 </supportDesc>
 <layoutDesc>
  <layout ruledLines="25 32"/>
 </layoutDesc>
</objectDesc>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <sequence>
   <elementRef key="supportDesc"
    minOccurs="0"/>

   <elementRef key="layoutDesc"
    minOccurs="0"/>

  </sequence>
 </alternate>
</content>
Schema Declaration
element objectDesc
{
   tei_att.global.attributes,
   attribute form { teidata.enumerated }?,
   ( tei_model.pLike+ | ( tei_supportDesc?, tei_layoutDesc? ) )
}

<opener>

<opener> (opener) groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter. [4.2. Elements Common to All Divisions]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
textstructure: body div
May contain
Example
<opener>
 <dateline>Walden, this 29. of August 1592</dateline>
</opener>
Example
<opener>
 <dateline>
  <name type="place">Great Marlborough Street</name>
  <date>November 11, 1848</date>
 </dateline>
 <salute>My dear Sir,</salute>
</opener>
<p>I am sorry to say that absence from town and other circumstances have prevented me from
earlier enquiring...</p>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <elementRef key="argument"/>
  <elementRef key="byline"/>
  <elementRef key="dateline"/>
  <elementRef key="epigraph"/>
  <elementRef key="salute"/>
  <elementRef key="signed"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element opener
{
   tei_att.global.attributes,
   tei_att.written.attributes,
   (
      text
    | tei_model.gLike
    | tei_model.phrase
    | argument
    | tei_byline
    | tei_dateline
    | tei_epigraph
    | tei_salute
    | tei_signed
    | tei_model.global
   )*
}

<orig>

<orig> (original form) contains a reading which is marked as following the original, rather than being normalized or corrected. [3.5.2. Regularization and Normalization 12. Critical Apparatus]
Modulecore
Attributes
Member of
Contained by
May contain
Example

If all that is desired is to call attention to the original version in the copy text, orig may be used alone:

<l>But this will be a <orig>meere</orig> confusion</l>
<l>And hardly shall we all be <orig>vnderstoode</orig>
</l>
Example

More usually, an orig will be combined with a regularized form within a choice element:

<l>But this will be a <choice>
  <orig>meere</orig>
  <reg>mere</reg>
 </choice> confusion</l>
<l>And hardly shall we all be <choice>
  <orig>vnderstoode</orig>
  <reg>understood</reg>
 </choice>
</l>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element orig { tei_att.global.attributes, tei_macro.paraContent }

<p>

<p> (paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents]
Modulecore
Attributes
Member of
Contained by
May contain
Example
<p>Hallgerd was outside. <q>There is blood on your axe,</q> she said. <q>What have you
   done?</q>
</p>
<p>
 <q>I have now arranged that you can be married a second time,</q> replied Thjostolf.
</p>
<p>
 <q>Then you must mean that Thorvald is dead,</q> she said.
</p>
<p>
 <q>Yes,</q> said Thjostolf. <q>And now you must think up some plan for me.</q>
</p>
Schematron

<sch:report test="(ancestor::tei:ab or ancestor::tei:p) and not( ancestor::tei:floatingText |parent::tei:exemplum |parent::tei:item |parent::tei:note |parent::tei:q |parent::tei:quote |parent::tei:remarks |parent::tei:said |parent::tei:sp |parent::tei:stage |parent::tei:cell |parent::tei:figure )"> Abstract model violation: Paragraphs may not occur inside other paragraphs or ab elements.
</sch:report>
Schematron

<sch:report test="(ancestor::tei:l or ancestor::tei:lg) and not( ancestor::tei:floatingText |parent::tei:figure |parent::tei:note )"> Abstract model violation: Lines may not contain higher-level structural elements such as div, p, or ab, unless p is a child of figure or note, or is a descendant of floatingText.
</sch:report>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element p
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   tei_att.fragmentable.attributes,
   tei_att.written.attributes,
   tei_macro.paraContent
}

<pb>

<pb> (page beginning) marks the beginning of a new page in a paginated document. [3.11.3. Milestone Elements]
Modulecore
Attributes
Member of
Contained by
May containEmpty element
Note

A pb element should appear at the start of the page which it identifies. The global n attribute indicates the number or other value associated with this page. This will normally be the page number or signature printed on it, since the physical sequence number is implicit in the presence of the pb element itself.

The type attribute may be used to characterize the page break in any respect. The more specialized attributes break, ed, or edRef should be preferred when the intent is to indicate whether or not the page break is word-breaking, or to note the source from which it derives.

Example

Page numbers may vary in different editions of a text.

<p> ... <pb ed="ed2n="145"/>
<!-- Page 145 in edition "ed2" starts here --> ... <pb ed="ed1n="283"/>
<!-- Page 283 in edition "ed1" starts here--> ... </p>
Example

A page break may be associated with a facsimile image of the page it introduces by means of the facs attribute

<body>
 <pb facs="page1.pngn="1"/>
<!-- page1.png contains an image of the page; the text it contains is encoded here -->
 <p>
<!-- ... -->
 </p>
 <pb facs="page2.pngn="2"/>
<!-- similarly, for page 2 -->
 <p>
<!-- ... -->
 </p>
</body>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element pb
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.edition.attributes,
   tei_att.spanning.attributes,
   tei_att.breaking.attributes,
   empty
}

<pc>

<pc> (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark. [17.1.2. Below the Word Level 17.4.2. Lightweight Linguistic Annotation]
Moduleanalysis
Attributes
forceindicates the extent to which this punctuation mark conventionally separates words or phrases
Status Optional
Datatype teidata.enumerated
Legal values are:
strong
the punctuation mark is a word separator
weak
the punctuation mark is not a word separator
inter
the punctuation mark may or may not be a word separator
unitprovides a name for the kind of unit delimited by this punctuation mark.
Status Optional
Datatype teidata.enumerated
preindicates whether this punctuation mark precedes or follows the unit it delimits.
Status Optional
Datatype teidata.truthValue
Member of
Contained by
May contain
Example
<phr>
 <w>do</w>
 <w>you</w>
 <w>understand</w>
 <pc type="interrogative">?</pc>
</phr>
Example

Example encoding of the German sentence Wir fahren in den Urlaub., encoded with attributes from att.linguistic discussed in section [ID AILALW in TEI Guidelines].

<s>
 <w msd="1.Pl.*.Nompos="PPER">Wir</w>
 <w msd="1.Pl.Pres.Indpos="VVFIN">fahren</w>
 <w msd="--pos="APPR">in</w>
 <w msd="Def.Masc.Akk.Sg.pos="ART">den</w>
 <w msd="Masc.Akk.Sg.pos="NN">Urlaub</w>
 <pc join="leftmsd="--pos="$.">.</pc>
</s>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <elementRef key="c"/>
  <classRef key="model.pPart.edit"/>
 </alternate>
</content>
Schema Declaration
element pc
{
   tei_att.global.attributes,
   tei_att.segLike.attributes,
   tei_att.typed.attributes,
   tei_att.linguistic.attributes,
   attribute force { "strong" | "weak" | "inter" }?,
   attribute unit { teidata.enumerated }?,
   attribute pre { teidata.truthValue }?,
   ( text | tei_model.gLike | tei_c | tei_model.pPart.edit )*
}

<persName>

<persName> (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc. [13.2.1. Personal Names]
Modulenamesdates
Attributes
calendarindicates one or more systems or calendars to which the date represented by the content of this element belongs.
Deprecatedwill be removed on 2024-11-11
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Member of
Contained by
May contain
Example
<persName>
 <forename>Edward</forename>
 <forename>George</forename>
 <surname type="linked">Bulwer-Lytton</surname>, <roleName>Baron Lytton of
 <placeName>Knebworth</placeName>
 </roleName>
</persName>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element persName
{
   tei_att.global.attributes,
   tei_att.datable.attributes,
   tei_att.editLike.attributes,
   tei_att.personal.attributes,
   tei_att.typed.attributes,
   attribute calendar { list { teidata.pointer+ } }?,
   tei_macro.phraseSeq
}

<physDesc>

<physDesc> (physical description) contains a full physical description of a manuscript, manuscript part, or other object optionally subdivided using more specialized elements from the model.physDescPart class. [10.7. Physical Description]
Modulemsdescription
Attributes
Contained by
msdescription: msDesc
May contain
core: p
linking: ab
Example
<physDesc>
 <objectDesc form="codex">
  <supportDesc material="perg">
   <support>Parchment.</support>
   <extent>i + 55 leaves
   <dimensions scope="alltype="leaf"
     unit="inch">

     <height></height>
     <width>5⅜</width>
    </dimensions>
   </extent>
  </supportDesc>
  <layoutDesc>
   <layout columns="2">In double columns.</layout>
  </layoutDesc>
 </objectDesc>
 <handDesc>
  <p>Written in more than one hand.</p>
 </handDesc>
 <decoDesc>
  <p>With a few coloured capitals.</p>
 </decoDesc>
</physDesc>
Content model
<content>
 <sequence>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="0"/>

  <classRef expand="sequenceOptional"
   key="model.physDescPart"/>

 </sequence>
</content>
Schema Declaration
element physDesc
{
   tei_att.global.attributes,
   (
      tei_model.pLike*,
      tei_objectDesc?,
      tei_handDesc?,
      tei_bindingDesc?,
      tei_accMat?
   )
}

<placeName>

<placeName> (place name) contains an absolute or relative place name. [13.2.3. Place Names]
Modulenamesdates
Attributes
calendarindicates one or more systems or calendars to which the date represented by the content of this element belongs.
Deprecatedwill be removed on 2024-11-11
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Member of
Contained by
May contain
Example
<placeName>
 <settlement>Rochester</settlement>
 <region>New York</region>
</placeName>
Example
<placeName>
 <geogName>Arrochar Alps</geogName>
 <region>Argylshire</region>
</placeName>
Example
<placeName>
 <measure>10 miles</measure>
 <offset>Northeast of</offset>
 <settlement>Attica</settlement>
</placeName>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element placeName
{
   tei_att.datable.attributes,
   tei_att.editLike.attributes,
   tei_att.global.attributes,
   tei_att.personal.attributes,
   tei_att.typed.attributes,
   attribute calendar { list { teidata.pointer+ } }?,
   tei_macro.phraseSeq
}

<profileDesc>

<profileDesc> (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components]
Moduleheader
Attributes
Member of
Contained by
header: teiHeader
May contain
Note

Although the content model permits it, it is rarely meaningful to supply multiple occurrences for any of the child elements of profileDesc unless these are documenting multiple texts.

Example
<profileDesc>
 <langUsage>
  <language ident="fr">French</language>
 </langUsage>
 <textDesc n="novel">
  <channel mode="w">print; part issues</channel>
  <constitution type="single"/>
  <derivation type="original"/>
  <domain type="art"/>
  <factuality type="fiction"/>
  <interaction type="none"/>
  <preparedness type="prepared"/>
  <purpose degree="hightype="entertain"/>
  <purpose degree="mediumtype="inform"/>
 </textDesc>
 <settingDesc>
  <setting>
   <name>Paris, France</name>
   <time>Late 19th century</time>
  </setting>
 </settingDesc>
</profileDesc>
Content model
<content>
 <classRef key="model.profileDescPart"
  maxOccurs="unboundedminOccurs="0"/>

</content>
Schema Declaration
element profileDesc { tei_att.global.attributes, tei_model.profileDescPart* }

<projectDesc>

<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 15.3.2. Declarable Elements]
Moduleheader
Attributes
Member of
Contained by
header: encodingDesc
May contain
core: p
linking: ab
Example
<projectDesc>
 <p>Texts collected for use in the Claremont Shakespeare Clinic, June 1990</p>
</projectDesc>
Content model
<content>
 <classRef key="model.pLike"
  maxOccurs="unboundedminOccurs="1"/>

</content>
Schema Declaration
element projectDesc
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   tei_model.pLike+
}

<provenance>

<provenance> (provenance) contains any descriptive or other information concerning a single identifiable episode during the history of a manuscript, manuscript part, or other object after its creation but before its acquisition. [10.8. History]
Modulemsdescription
Attributes
calendarindicates one or more systems or calendars to which the date represented by the content of this element belongs.
Deprecatedwill be removed on 2024-11-11
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Contained by
msdescription: history
May contain
Example
<provenance>Listed as the property of Lawrence Sterne in 1788.</provenance>
<provenance>Sold at Sothebys in 1899.</provenance>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element provenance
{
   tei_att.global.attributes,
   tei_att.datable.attributes,
   tei_att.typed.attributes,
   attribute calendar { list { teidata.pointer+ } }?,
   tei_macro.specialPara
}

<ptr>

<ptr> (pointer) defines a pointer to another location. [3.7. Simple Links and Cross-References 16.1. Links]
Modulecore
Attributes
Member of
Contained by
May containEmpty element
Example
<ptr target="#p143 #p144"/>
<ptr target="http://www.tei-c.org"/>
<ptr cRef="1.3.4"/>
Schematron

<sch:report test="@target and @cRef">Only one of the
attributes @target and @cRef may be supplied on <sch:name/>.</sch:report>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element ptr
{
   tei_att.cReferencing.attributes,
   tei_att.declaring.attributes,
   tei_att.global.attributes,
   tei_att.pointing.attributes,
   tei_att.typed.attributes,
   empty
}

<publicationStmt>

<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description]
Moduleheader
Attributes
Contained by
header: fileDesc
May contain
Note

Where a publication statement contains several members of the model.publicationStmtPart.agency or model.publicationStmtPart.detail classes rather than one or more paragraphs or anonymous blocks, care should be taken to ensure that the repeated elements are presented in a meaningful order. It is a conformance requirement that elements supplying information about publication place, address, identifier, availability, and date be given following the name of the publisher, distributor, or authority concerned, and preferably in that order.

Example
<publicationStmt>
 <publisher>C. Muquardt </publisher>
 <pubPlace>Bruxelles &amp; Leipzig</pubPlace>
 <date when="1846"/>
</publicationStmt>
Example
<publicationStmt>
 <publisher>Chadwyck Healey</publisher>
 <pubPlace>Cambridge</pubPlace>
 <availability>
  <p>Available under licence only</p>
 </availability>
 <date when="1992">1992</date>
</publicationStmt>
Example
<publicationStmt>
 <publisher>Zea Books</publisher>
 <pubPlace>Lincoln, NE</pubPlace>
 <date>2017</date>
 <availability>
  <p>This is an open access work licensed under a Creative Commons Attribution 4.0 International license.</p>
 </availability>
 <ptr target="http://digitalcommons.unl.edu/zeabook/55"/>
</publicationStmt>
Content model
<content>
 <alternate>
  <sequence maxOccurs="unbounded"
   minOccurs="1">

   <classRef key="model.publicationStmtPart.agency"/>
   <classRef key="model.publicationStmtPart.detail"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

 </alternate>
</content>
Schema Declaration
element publicationStmt
{
   tei_att.global.attributes,
   (
      (
         tei_model.publicationStmtPart.agency,
         tei_model.publicationStmtPart.detail*
      )+
    | tei_model.pLike+
   )
}

<publisher>

<publisher> (publisher) provides the name of the organization responsible for the publication or distribution of a bibliographic item. [3.12.2.4. Imprint, Size of a Document, and Reprint Information 2.2.4. Publication, Distribution, Licensing, etc.]
Modulecore
Attributes
Member of
Contained by
textstructure: docImprint
May contain
Note

Use the full form of the name by which a company is usually referred to, rather than any abbreviation of it which may appear on a title page

Example
<imprint>
 <pubPlace>Oxford</pubPlace>
 <publisher>Clarendon Press</publisher>
 <date>1987</date>
</imprint>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element publisher
{
   tei_att.global.attributes,
   tei_att.canonical.attributes,
   tei_macro.phraseSeq
}

<pubPlace>

<pubPlace> (publication place) contains the name of the place where a bibliographic item was published. [3.12.2.4. Imprint, Size of a Document, and Reprint Information]
Modulecore
Attributes
Member of
Contained by
textstructure: docImprint
May contain
Example
<publicationStmt>
 <publisher>Oxford University Press</publisher>
 <pubPlace>Oxford</pubPlace>
 <date>1989</date>
</publicationStmt>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element pubPlace
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_macro.phraseSeq
}

<q>

<q> (quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used. [3.3.3. Quotation]
Modulecore
Attributes
type(type) may be used to indicate whether the offset passage is spoken or thought, or to characterize it more finely.
Status Optional
Datatype teidata.enumerated
Suggested values include:
spoken
(spoken) representation of speech
thought
(thought) representation of thought, e.g. internal monologue
written
(written) quotation from a written source
soCalled
(so called) authorial distance
foreign
(foreign) foreign words
distinct
(distinct) linguistically distinct
term
technical term
emph
(emph) rhetorically emphasized
mentioned
(mentioned) refering to itself, not its normal referent
Member of
Contained by
May contain
Note

May be used to indicate that a passage is distinguished from the surrounding text for reasons concerning which no claim is made. When used in this manner, q may be thought of as syntactic sugar for hi with a value of rend that indicates the use of such mechanisms as quotation marks.

Example
It is spelled <q>Tübingen</q> — to enter the
letter <q>u</q> with an umlaut hold down the <q>option</q> key and press
<q>0 0 f c</q>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element q
{
   tei_att.global.attributes,
   tei_att.ascribed.directed.attributes,
   attribute type
   {
      "spoken"
    | "thought"
    | "written"
    | "soCalled"
    | "foreign"
    | "distinct"
    | "term"
    | "emph"
    | "mentioned"
    | teidata.enumerated
   }?,
   tei_macro.specialPara
}

<quote>

<quote> (quotation) contains a phrase or passage attributed by the narrator or author to some agency external to the text. [3.3.3. Quotation 4.3.1. Grouped Texts]
Modulecore
Attributes
Member of
Contained by
May contain
Note

If a bibliographic citation is supplied for the source of a quotation, the two may be grouped using the cit element.

Example
Lexicography has shown little sign of being affected by the
work of followers of J.R. Firth, probably best summarized in his
slogan, <quote>You shall know a word by the company it
keeps</quote>
<ref>(Firth, 1957)</ref>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element quote
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.msExcerpt.attributes,
   tei_att.notated.attributes,
   tei_macro.specialPara
}

<rdg>

<rdg> (reading) contains a single reading within a textual variation. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Attributes
Member of
Contained by
textcrit: app rdgGrp
May contain
Example
<rdg wit="#Ra2">Eryment</rdg>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.divLike"/>
  <classRef key="model.divPart"/>
  <elementRef key="titlePage"/>
  <elementRef key="argument"/>
  <elementRef key="byline"/>
  <elementRef key="docAuthor"/>
  <elementRef key="docDate"/>
  <elementRef key="docEdition"/>
  <elementRef key="docImprint"/>
  <elementRef key="docTitle"/>
  <elementRef key="epigraph"/>
  <elementRef key="imprimatur"/>
  <elementRef key="titlePart"/>
  <elementRef key="epilogue"/>
  <elementRef key="performance"/>
  <elementRef key="prologue"/>
  <elementRef key="set"/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.inter"/>
  <classRef key="model.global"/>
  <classRef key="model.rdgPart"/>
 </alternate>
</content>
Schema Declaration
element rdg
{
   tei_att.global.attributes,
   tei_att.textCritical.attributes,
   tei_att.witnessed.attributes,
   (
      text
    | tei_model.divLike
    | tei_model.divPart
    | tei_titlePage
    | argument
    | tei_byline
    | tei_docAuthor
    | docDate
    | docEdition
    | tei_docImprint
    | tei_docTitle
    | tei_epigraph
    | imprimatur
    | tei_titlePart
    | epilogue
    | performance
    | prologue
    | set
    | tei_model.gLike
    | tei_model.phrase
    | tei_model.inter
    | tei_model.global
    | tei_model.rdgPart
   )*
}

<rdgGrp>

<rdgGrp> (reading group) within a textual variation, groups two or more readings perceived to have a genetic relationship or other affinity. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Attributes
Contained by
textcrit: app rdgGrp
May contain
core: note
Note

May contain readings and nested reading groups.

Usually, only one <lem> element should appear within a single apparatus entry, whether it appears outside a rdgGrp element or within it.

Example
<app>
 <lem wit="#El #Ra2">though</lem>
 <rdgGrp type="orthographic">
  <rdg wit="#Hg">thogh</rdg>
  <rdg wit="#La">thouhe</rdg>
 </rdgGrp>
</app>
Schematron

<sch:assert test="count(tei:lem) < 2">Only one <lem> element may appear within a <rdgGrp></sch:assert>
Content model
<content>
 <sequence>
  <elementRef key="lemminOccurs="0"/>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.rdgLike"/>
   <classRef key="model.noteLike"/>
   <elementRef key="witDetail"/>
   <elementRef key="wit"/>
   <elementRef key="rdgGrp"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element rdgGrp
{
   tei_att.global.attributes,
   tei_att.textCritical.attributes,
   (
      lem?,
      (
         tei_model.rdgLike
       | tei_model.noteLike
       | tei_witDetail
       | tei_wit
       | tei_rdgGrp
      )*
   )
}

<redo>

<redo> indicates one or more cancelled interventions in a document which have subsequently been marked as reaffirmed or repeated. [11.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications]
Moduletranscr
Attributes
targetpoints to one or more elements representing the interventions which are being reasserted.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Member of
Contained by
May containEmpty element
Example
<line>
 <redo cause="fixhand="#g_t"
  target="#redo-1"/>

 <mod hand="#g_blrend="strikethrough"
  spanTo="#anchor-1xml:id="redo-1"/>
Ihr hagren, triſten, krummgezog<mod rend="strikethrough">nen</mod>ener Nacken
</line>
<line>Wenn ihr nur piepſet iſt die Welt ſchon matt.<anchor xml:id="anchor-1"/>
</line>

This encoding represents the following sequence of events:

  • "Ihr hagren, triſten, krummgezog nenener Nacken/ Wenn ihr nur piepſet iſt die Welt ſchon matt." is written
  • the redundant letters "nen" in "nenener" are deleted
  • the whole passage is deleted by hand g_bl using strikethrough
  • the deletion is reasserted by another hand (identified here as g_t)
Content model
<content>
 <empty/>
</content>
Schema Declaration
element redo
{
   tei_att.global.attributes,
   tei_att.spanning.attributes,
   tei_att.transcriptional.attributes,
   tei_att.dimensions.attributes,
   attribute target { list { teidata.pointer+ } }?,
   empty
}

<ref>

<ref> (reference) defines a reference to another location, possibly modified by additional text or comment. [3.7. Simple Links and Cross-References 16.1. Links]
Modulecore
Attributes
Member of
Contained by
May contain
Note

The target and cRef attributes are mutually exclusive.

Example
See especially <ref target="http://www.natcorp.ox.ac.uk/Texts/A02.xml#s2">the second
sentence</ref>
Example
See also <ref target="#locution">s.v. <term>locution</term>
</ref>.
Schematron

<sch:report test="@target and @cRef">Only one of the
attributes @target' and @cRef' may be supplied on <sch:name/>
</sch:report>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element ref
{
   tei_att.cReferencing.attributes,
   tei_att.declaring.attributes,
   tei_att.global.attributes,
   tei_att.pointing.attributes,
   tei_att.typed.attributes,
   tei_macro.paraContent
}

<reg>

<reg> (regularization) contains a reading which has been regularized or normalized in some sense. [3.5.2. Regularization and Normalization 12. Critical Apparatus]
Modulecore
Attributes
Member of
Contained by
May contain
Example

If all that is desired is to call attention to the fact that the copy text has been regularized, reg may be used alone:

<q>Please <reg>knock</reg> if an <reg>answer</reg> is <reg>required</reg>
</q>
Example

It is also possible to identify the individual responsible for the regularization, and, using the choice and orig elements, to provide both the original and regularized readings:

<q>Please <choice>
  <reg resp="#LB">knock</reg>
  <orig>cnk</orig>
 </choice> if an <choice>
  <reg>answer</reg>
  <orig>nsr</orig>
 </choice> is <choice>
  <reg>required</reg>
  <orig>reqd</orig>
 </choice>
</q>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element reg
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_att.typed.attributes,
   tei_macro.paraContent
}

<relatedItem>

<relatedItem> contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it. [3.12.2.7. Related Items]
Modulecore
Attributes
targetpoints to the related bibliographic element by means of an absolute or relative URI reference
Status Optional
Datatype teidata.pointer
Member of
Contained by
May contain
msdescription: msDesc
Note

If the target attribute is used to reference the related bibliographic item, the element must be empty.

Example
<biblStruct>
 <monogr>
  <author>Shirley, James</author>
  <title type="main">The gentlemen of Venice</title>
  <imprint>
   <pubPlace>New York</pubPlace>
   <publisher>Readex Microprint</publisher>
   <date>1953</date>
  </imprint>
  <extent>1 microprint card, 23 x 15 cm.</extent>
 </monogr>
 <series>
  <title>Three centuries of drama: English, 1642–1700</title>
 </series>
 <relatedItem type="otherForm">
  <biblStruct>
   <monogr>
    <author>Shirley, James</author>
    <title type="main">The gentlemen of Venice</title>
    <title type="sub">a tragi-comedie presented at the private house in Salisbury
         Court by Her Majesties servants</title>
    <imprint>
     <pubPlace>London</pubPlace>
     <publisher>H. Moseley</publisher>
     <date>1655</date>
    </imprint>
    <extent>78 p.</extent>
   </monogr>
  </biblStruct>
 </relatedItem>
</biblStruct>
Schematron

<sch:report test="@target and count( child::* ) > 0">If the @target attribute on <sch:name/> is used, the
relatedItem element must be empty</sch:report>
<sch:assert test="@target or child::*">A relatedItem element should have either a 'target' attribute
or a child element to indicate the related bibliographic item</sch:assert>
Content model
<content>
 <alternate minOccurs="0">
  <classRef key="model.biblLike"/>
  <classRef key="model.ptrLike"/>
 </alternate>
</content>
Schema Declaration
element relatedItem
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   attribute target { teidata.pointer }?,
   ( tei_model.biblLike | tei_model.ptrLike )?
}

<repository>

<repository> (repository) contains the name of a repository within which manuscripts or other objects are stored, possibly forming part of an institution. [10.4. The Manuscript Identifier]
Modulemsdescription
Attributes
Contained by
msdescription: msIdentifier
May contain
Example
<msIdentifier>
 <settlement>Oxford</settlement>
 <institution>University of Oxford</institution>
 <repository>Bodleian Library</repository>
 <idno>MS. Bodley 406</idno>
</msIdentifier>
Content model
<content>
 <macroRef key="macro.phraseSeq.limited"/>
</content>
Schema Declaration
element repository
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_macro.phraseSeq.limited
}

<restore>

<restore> (restore) indicates restoration of text to an earlier state by cancellation of an editorial or authorial marking or instruction. [11.3.1.6. Cancellation of Deletions and Other Markings]
Moduletranscr
Attributes
Member of
Contained by
May contain
Note

On this element, the type attribute categorizes the way that the cancelled intervention has been indicated in some way, for example by means of a marginal note, over-inking, additional markup, etc.

Example
For I hate this
<restore hand="#dhl"
 type="marginalStetNote">

 <del>my</del>
</restore> body
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element restore
{
   tei_att.global.attributes,
   tei_att.transcriptional.attributes,
   tei_att.typed.attributes,
   tei_att.dimensions.attributes,
   tei_macro.paraContent
}

<retrace>

<retrace> contains a sequence of writing which has been retraced, for example by over-inking, to clarify or fix it. [11.3.4.3. Fixation and Clarification]
Moduletranscr
Attributes
Member of
Contained by
May contain
Note

Multiple retraces are indicated by nesting one retrace within another. In principle, a retrace differs from a substitution in that second and subsequent rewrites do not materially alter the content of an element. Where minor changes have been made during the retracing action however these may be marked up using del, add, etc. with an appropriate value for the change attribute.

Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element retrace
{
   tei_att.global.attributes,
   tei_att.spanning.attributes,
   tei_att.transcriptional.attributes,
   tei_att.dimensions.attributes,
   tei_macro.paraContent
}

<revisionDesc>

<revisionDesc> (revision description) summarizes the revision history for a file. [2.6. The Revision Description 2.1.1. The TEI Header and Its Components]
Moduleheader
Attributes
Contained by
header: teiHeader
May contain
Note

If present on this element, the status attribute should indicate the current status of the document. The same attribute may appear on any change to record the status at the time of that change. Conventionally change elements should be given in reverse date order, with the most recent change at the start of the list.

Example
<revisionDesc status="embargoed">
 <change when="1991-11-11who="#LB"> deleted chapter 10 </change>
</revisionDesc>
Content model
<content>
 <alternate>
  <elementRef key="list"
   maxOccurs="unboundedminOccurs="1"/>

  <elementRef key="listChange"
   maxOccurs="unboundedminOccurs="1"/>

  <elementRef key="change"
   maxOccurs="unboundedminOccurs="1"/>

 </alternate>
</content>
Schema Declaration
element revisionDesc
{
   tei_att.global.attributes,
   tei_att.docStatus.attributes,
   ( list+ | tei_listChange+ | tei_change+ )
}

<rhyme>

<rhyme> marks the rhyming part of a metrical line. [6.5. Rhyme]
Moduleverse
Attributes
labelprovides a label (usually a single letter) to identify which part of a rhyme scheme this rhyming string instantiates.
Status Recommended
Datatype teidata.word
Note

Within a particular scope, all rhyme elements with the same value for their label attribute are assumed to rhyme with each other. The scope is defined by the nearest ancestor element for which the rhyme attribute has been supplied.

Member of
Contained by
May contain
Example
<lg rhyme="abababcc">
 <l>'Tis pity learned virgins ever <rhyme label="a">wed</rhyme>
 </l>
 <l>With persons of no sort of edu<rhyme label="b">cation</rhyme>,</l>
 <l>Or gentlemen, who, though well born and <rhyme label="a">bred</rhyme>,</l>
 <l>Grow tired of scientific conver<rhyme label="b">sation</rhyme>:</l>
 <l>I don't choose to say much on this <rhyme label="a">head</rhyme>,</l>
 <l>I'm a plain man, and in a single <rhyme label="b">station</rhyme>,</l>
 <l>But — Oh! ye lords of ladies inte<rhyme label="c">llectual</rhyme>,</l>
 <l>Inform us truly, have they not hen-<rhyme label="c">peck'd you all</rhyme>?</l>
</lg>
Example
<lg>
 <l>Tyger! Tyger! burning <rhyme label="a">bright</rhyme>
 </l>
 <l>In the forests of the <rhyme label="a">night</rhyme>,</l>
 <l>What immortal hand or <rhyme label="b">eye</rhyme>
 </l>
 <l>Could frame thy fearful <rhyme label="btype="eye-rhyme">symmetry</rhyme>?</l>
</lg>
Example
<lg>
 <l>"Hark! Lakshman! Hark, again that <rhyme label="a">cry</rhyme>!</l>
 <l>It is, — it is my husband's <rhyme label="b">voice</rhyme>!</l>
 <l>hasten, to his succour <rhyme label="a">fly</rhyme>,</l>
 <l>No more hast thou, dear friend, a <rhyme label="b">choice</rhyme>.</l>
 <l>He calls on thee, perhaps his <rhyme label="c">foes</rhyme>
 </l>
 <l>Environ him on all sides <rhyme label="d">round</rhyme>,</l>
 <l>That wail, — it means death's final <rhyme label="c">throes</rhyme>!</l>
 <l>Why standest thou, as magic-<rhyme label="d">bound</rhyme>?</l>
</lg>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element rhyme
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   attribute label { teidata.word }?,
   tei_macro.paraContent
}

<rs>

<rs> (referencing string) contains a general purpose name or referring string. [13.2.1. Personal Names 3.6.1. Referring Strings]
Modulecore
Attributes
Member of
Contained by
May contain
Example
<q>My dear <rs type="person">Mr. Bennet</rs>, </q> said <rs type="person">his lady</rs>
to him one day,
<q>have you heard that <rs type="place">Netherfield Park</rs> is let at
last?</q>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element rs
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_att.typed.attributes,
   tei_macro.phraseSeq
}

<s>

<s> (s-unit) contains a sentence-like division of a text. [17.1. Linguistic Segment Categories 8.4.1. Segmentation]
Moduleanalysis
Attributes
Member of
Contained by
May contain
Note

The s element may be used to mark orthographic sentences, or any other segmentation of a text, provided that the segmentation is end-to-end, complete, and non-nesting. For segmentation which is partial or recursive, the seg should be used instead.

The type attribute may be used to indicate the type of segmentation intended, according to any convenient typology.

Example
<head>
 <s>A short affair</s>
</head>
<s>When are you leaving?</s>
<s>Tomorrow.</s>
Schematron

<sch:report test="tei:s">You may not nest one s element within
another: use seg instead</sch:report>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element s
{
   tei_att.global.attributes,
   tei_att.segLike.attributes,
   tei_att.typed.attributes,
   tei_att.notated.attributes,
   tei_macro.phraseSeq
}

<said>

<said> (speech or thought) indicates passages thought or spoken aloud, whether explicitly indicated in the source or not, whether directly or indirectly reported, whether by real people or fictional characters. [3.3.3. Quotation]
Modulecore
Attributes
aloudmay be used to indicate whether the quoted matter is regarded as having been vocalized or signed.
Status Optional
Datatype teidata.xTruthValue
<p> Celia thought privately, <said aloud="false">Dorothea quite despises Sir James Chettam;
   I believe she would not accept him.</said> Celia felt that this was a pity.

<!-- ... -->
</p>
Note

The value true indicates the encoded passage was expressed outwardly (whether spoken, signed, sung, screamed, chanted, etc.); the value false indicates that the encoded passage was thought, but not outwardly expressed.

directmay be used to indicate whether the quoted matter is regarded as direct or indirect speech.
Status Optional
Datatype teidata.xTruthValue
Default true

<!-- in the header --><editorialDecl>
 <quotation marks="none"/>
</editorialDecl>
<!-- ... -->
<p>Tantripp had brought a card, and said that <said direct="false">there was a gentleman waiting in the lobby</said>.
The courier had told him that <said direct="false">only Mrs. Casaubon was at home</said>,
but he said <said direct="false">he was a relation of Mr. Casaubon's: would she see him?</said>
</p>
Note

The value true indicates the speech or thought is represented directly; the value false that speech or thought is represented indirectly, e.g. by use of a marked verbal aspect.

Member of
Contained by
May contain
Example

<!-- in the header --><editorialDecl>
 <quotation marks="all"/>
</editorialDecl>
<!-- ... -->
<p>
 <said>"Our minstrel here will warm the old man's heart with song, dazzle him with jewels and
   gold"</said>, a troublemaker simpered. <said>"He'll trample on the Duke's camellias, spill
   his wine, and blunt his sword, and say his name begins with X, and in the end the Duke
   will say, <said>'Take Saralinda, with my blessing, O lordly Prince of Rags and Tags, O
     rider of the sun!'</said>"</said>
</p>
Example
<p>
 <said aloud="truerend="pre(“) post(”)">Hmmm</said>, said a small voice in his ear.
<said aloud="truerend="pre(“) post(”)">Difficult. Very difficult. Plenty of courage, I see.
   Not a bad mind either. there's talent, oh my goodness, yes — and a nice thirst to prove
   yourself, now that's interesting. … So where shall I put you?</said>
</p>
<p>Harry gripped the edges of the stool and thought, <said aloud="falserend="italic">Not
   Slytherin, not Slytherin</said>.</p>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element said
{
   tei_att.global.attributes,
   tei_att.ascribed.directed.attributes,
   attribute aloud { teidata.xTruthValue }?,
   attribute direct { teidata.xTruthValue }?,
   tei_macro.specialPara
}

<salute>

<salute> (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc. [4.2.2. Openers and Closers]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
textstructure: body closer div front opener
May contain
Example
<salute>To all courteous mindes, that will voutchsafe the readinge.</salute>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element salute
{
   tei_att.global.attributes,
   tei_att.written.attributes,
   tei_macro.paraContent
}

<secl>

<secl> (secluded text) Secluded. Marks text present in the source which the editor believes to be genuine but out of its original place (which is unknown). [11.3.1.7. Text Omitted from or Supplied in the Transcription]
Moduletranscr
Attributes
reasonone or more words indicating why this text has been secluded, e.g. interpolated etc.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
Member of
Contained by
May contain
Example
<rdg source="#Pescani">
 <secl>
  <l n="15xml:id="l15">Alphesiboea suos ulta est pro coniuge fratres,</l>
  <l n="16xml:id="l16">sanguinis et cari vincula rupit amor.</l>
 </secl>
</rdg>
<note>secl. Pescani</note>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element secl
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_att.dimensions.attributes,
   attribute reason { list { teidata.word+ } }?,
   tei_macro.paraContent
}

<seg>

<seg> (arbitrary segment) represents any segmentation of text below the ‘chunk’ level. [16.3. Blocks, Segments, and Anchors 6.2. Components of the Verse Line 7.2.5. Speech Contents]
Modulelinking
Attributes
Member of
Contained by
May contain
Note

The seg element may be used at the encoder's discretion to mark any segments of the text of interest for processing. One use of the element is to mark text features for which no appropriate markup is otherwise defined. Another use is to provide an identifier for some segment which is to be pointed at by some other element—i.e. to provide a target, or a part of a target, for a ptr or other similar element.

Example
<seg>When are you leaving?</seg>
<seg>Tomorrow.</seg>
Example
<s>
 <seg rend="capstype="initial-cap">So father's only</seg> glory was the ballfield.
</s>
Example
<seg type="preamble">
 <seg>Sigmund, <seg type="patronym">the son of Volsung</seg>, was a king in Frankish country.</seg>
 <seg>Sinfiotli was the eldest of his sons ...</seg>
 <seg>Borghild, Sigmund's wife, had a brother ... </seg>
</seg>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element seg
{
   tei_att.global.attributes,
   tei_att.segLike.attributes,
   tei_att.typed.attributes,
   tei_att.written.attributes,
   tei_att.notated.attributes,
   tei_macro.paraContent
}

<settlement>

<settlement> (settlement) contains the name of a settlement such as a city, town, or village identified as a single geo-political or administrative unit. [13.2.3. Place Names]
Modulenamesdates
Attributes
calendarindicates one or more systems or calendars to which the date represented by the content of this element belongs.
Deprecatedwill be removed on 2024-11-11
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Member of
Contained by
May contain
Example
<placeName>
 <settlement type="town">Glasgow</settlement>
 <region>Scotland</region>
</placeName>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element settlement
{
   tei_att.global.attributes,
   tei_att.naming.attributes,
   tei_att.typed.attributes,
   tei_att.datable.attributes,
   attribute calendar { list { teidata.pointer+ } }?,
   tei_macro.phraseSeq
}

<sic>

<sic> (Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate. [3.5.1. Apparent Errors]
Modulecore
Attributes
Member of
Contained by
May contain
Example
for his nose was as sharp as
a pen, and <sic>a Table</sic> of green fields.
Example

If all that is desired is to call attention to the apparent problem in the copy text, sic may be used alone:

I don't know, Juan. It's so far in the past now
— how <sic>we can</sic> prove or disprove anyone's theories?
Example

It is also possible, using the choice and corr elements, to provide a corrected reading:

I don't know, Juan. It's so far in the past now
— how <choice>
 <sic>we can</sic>
 <corr>can we</corr>
</choice> prove or disprove anyone's theories?
Example
for his nose was as sharp as
a pen, and <choice>
 <sic>a Table</sic>
 <corr>a' babbld</corr>
</choice> of green fields.
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element sic { tei_att.global.attributes, tei_macro.paraContent }

<signed>

<signed> (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text. [4.2.2. Openers and Closers]
Moduletextstructure
Attributes
Member of
Contained by
core: lg
textstructure: back body closer div front opener
May contain
Example
<signed>Thine to command <name>Humph. Moseley</name>
</signed>
Example
<closer>
 <signed>Sign'd and Seal'd,
 <list>
   <item>John Bull,</item>
   <item>Nic. Frog.</item>
  </list>
 </signed>
</closer>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element signed
{
   tei_att.global.attributes,
   tei_att.written.attributes,
   tei_macro.paraContent
}

<soCalled>

<soCalled> (so called) contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics. [3.3.3. Quotation]
Modulecore
Attributes
Member of
Contained by
May contain
Example
To edge his way along
the crowded paths of life, warning all human sympathy to keep its distance, was what the
knowing ones call <soCalled>nuts</soCalled> to Scrooge.
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element soCalled { tei_att.global.attributes, tei_macro.phraseSeq }

<sourceDesc>

<sourceDesc> (source description) describes the source(s) from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. The Source Description]
Moduleheader
Attributes
Contained by
header: fileDesc
May contain
linking: ab
msdescription: msDesc
textcrit: listApp
Example
<sourceDesc>
 <bibl>
  <title level="a">The Interesting story of the Children in the Wood</title>. In
 <author>Victor E Neuberg</author>, <title>The Penny Histories</title>.
 <publisher>OUP</publisher>
  <date>1968</date>. </bibl>
</sourceDesc>
Example
<sourceDesc>
 <p>Born digital: no previous source exists.</p>
</sourceDesc>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <alternate maxOccurs="unbounded"
   minOccurs="1">

   <classRef key="model.biblLike"/>
   <classRef key="model.sourceDescPart"/>
   <classRef key="model.listLike"/>
  </alternate>
 </alternate>
</content>
Schema Declaration
element sourceDesc
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   (
      tei_model.pLike+
    | ( tei_model.biblLike | tei_model.sourceDescPart | tei_model.listLike )+
   )
}

<space>

<space> (space) indicates the location of a significant space in the text. [11.4.1. Space]
Moduletranscr
Attributes
resp(responsible party) (responsible party) indicates the individual responsible for identifying and measuring the space
Derived fromatt.global.responsibility
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
dim(dimension) indicates whether the space is horizontal or vertical.
Status Recommended
Datatype teidata.enumerated
Legal values are:
horizontal
the space is horizontal.
vertical
the space is vertical.
Note

For irregular shapes in two dimensions, the value for this attribute should reflect the more important of the two dimensions. In conventional left-right scripts, a space with both vertical and horizontal components should be classed as vertical.

Member of
Contained by
May contain
core: desc
Note

This element should be used wherever it is desired to record an unusual space in the source text, e.g. space left for a word to be filled in later, for later rubrication, etc. It is not intended to be used to mark normal inter-word space or the like.

Example
By god if wommen had writen storyes
As <space quantity="7unit="minims"/> han within her oratoryes
Example
στρατηλάτ<space quantity="1unit="chars"/>ου
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <classRef key="model.descLike"/>
  <classRef key="model.certLike"/>
 </alternate>
</content>
Schema Declaration
element space
{
   tei_att.global.attribute.xmlid,
   tei_att.global.attribute.n,
   tei_att.global.attribute.xmllang,
   tei_att.global.attribute.xmlspace,
   tei_att.global.rendition.attribute.rend,
   tei_att.global.linking.attribute.corresp,
   tei_att.global.linking.attribute.next,
   tei_att.global.linking.attribute.prev,
   tei_att.global.analytic.attribute.ana,
   tei_att.global.facs.attribute.facs,
   tei_att.global.change.attribute.change,
   tei_att.global.responsibility.attribute.cert,
   tei_att.global.source.attribute.source,
   tei_att.typed.attributes,
   tei_att.dimensions.attributes,
   attribute resp { list { teidata.pointer+ } }?,
   attribute dim { "horizontal" | "vertical" }?,
   ( tei_model.descLike | tei_model.certLike )*
}

<span>

<span> associates an interpretative annotation directly with a span of text. [17.3. Spans and Interpretations]
Moduleanalysis
Attributes
fromgives the identifier of the node which is the starting point of the span of text being annotated; if not accompanied by a to attribute, gives the identifier of the node of the entire span of text being annotated.
Status Optional
Datatype teidata.pointer
togives the identifier of the node which is the end-point of the span of text being annotated.
Status Optional
Datatype teidata.pointer
Member of
Contained by
May contain
Example
<p xml:id="para2">(The "aftermath" starts here)</p>
<p xml:id="para3">(The "aftermath" continues here)</p>
<p xml:id="para4">(The "aftermath" ends in this paragraph)</p>
<!-- ... -->
<span from="#para2to="#para4"
 type="structure">
aftermath</span>
Schematron

<sch:report test="@from and @target">Only one of the attributes @target and @from may be supplied on <sch:name/>
</sch:report>
Schematron

<sch:report test="@to and @target">Only one of the attributes @target and @to may be supplied on <sch:name/>
</sch:report>
Schematron

<sch:report test="@to and not(@from)">If @to is supplied on <sch:name/>, @from must be supplied as well</sch:report>
Schematron

<sch:report test="contains(normalize-space(@to),' ') or contains(normalize-space(@from),' ')">The attributes @to and @from on <sch:name/> may each contain only a single value</sch:report>
Content model
<content>
 <macroRef key="macro.phraseSeq.limited"/>
</content>
Schema Declaration
element span
{
   tei_att.global.attributes,
   tei_att.interpLike.attributes,
   tei_att.pointing.attributes,
   attribute from { teidata.pointer }?,
   attribute to { teidata.pointer }?,
   tei_macro.phraseSeq.limited
}

<standOff>

<standOff> Functions as a container element for linked data, contextual information, and stand-off annotations embedded in a TEI document. [16.10. The standOff Container]
Modulelinking
Attributes
Member of
Contained by
textstructure: TEI
May contain
analysis: span
header: listChange
linking: seg
msdescription: msDesc
textcrit: listApp
Example

This example shows an encoding of morphosyntactic features similar to the encoding system used by ISO 24611 (MAF).

<TEI xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
<!-- ... -->
 </teiHeader>
 <text>
  <body>
<!-- ... -->
   <p>
    <w xml:id="w51">I</w>
    <w xml:id="w52">wanna</w>
    <w xml:id="w53">put</w>
    <w xml:id="w54">up</w>
    <w xml:id="w55">new</w>
    <w xml:id="w56">wallpaper</w>
    <pc>.</pc>
   </p>
<!-- ... -->
  </body>
 </text>
 <standOff type="morphosyntax">
  <spanGrp type="wordForm">
   <span ana="#fs01target="#w51"/>
   <span ana="#fs02target="#w52"/>
   <span ana="#fs03target="#w52"/>
   <span ana="#fs04target="#w53 #w54"/>
   <span ana="#fs05target="#w55"/>
   <span ana="#fs06target="#w56"/>
  </spanGrp>
  <fs xml:id="fs01">
   <f name="lemma">
    <string>I</string>
   </f>
   <f name="pos">
    <symbol value="PP"/>
   </f>
  </fs>
  <fs xml:id="fs02">
   <f name="lemma">
    <string>want</string>
   </f>
   <f name="pos">
    <symbol value="VBP"/>
   </f>
  </fs>
  <fs xml:id="fs03">
   <f name="lemma">
    <string>to</string>
   </f>
   <f name="pos">
    <symbol value="TO"/>
   </f>
  </fs>
  <fs xml:id="fs04">
   <f name="lemma">
    <string>put up</string>
   </f>
   <f name="pos">
    <symbol value="VB"/>
   </f>
  </fs>
  <fs xml:id="fs05">
   <f name="lemma">
    <string>new</string>
   </f>
   <f name="pos">
    <symbol value="JJ"/>
   </f>
  </fs>
  <fs xml:id="fs06">
   <f name="lemma">
    <string>wallpaper</string>
   </f>
   <f name="pos">
    <symbol value="NN"/>
   </f>
  </fs>
 </standOff>
</TEI>
Example

This example shows an encoding of contextual information which is referred to from the main text.

<TEI xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
<!-- ... -->
 </teiHeader>
 <standOff>
  <listPlace>
   <place xml:id="LATL">
    <placeName>Atlanta</placeName>
    <location>
     <region key="US-GA">Georgia</region>
     <country key="USA">United States of America</country>
     <geo>33.755 -84.39</geo>
    </location>
    <population quantity="489359"
     source="https://www.biggestuscities.com/city/atlanta-georgiatype="interpolatedCensuswhen="1963"/>

   </place>
   <place xml:id="LBHM">
    <placeName>Birmingham</placeName>
    <location>
     <region key="US-AL">Alabama</region>
     <country key="USA">United States of America</country>
     <geo>33.653333 -86.808889</geo>
    </location>
    <population quantity="332891"
     source="https://www.biggestuscities.com/city/birmingham-alabamatype="interpolatedCensuswhen="1963"/>

   </place>
  </listPlace>
 </standOff>
 <text>
  <body>
<!-- ... -->
   <p>Moreover, I am <choice>
     <sic>congnizant</sic>
     <corr>cognizant</corr>
    </choice> of the interrelatedness of all communities and
   <lb/>states. I cannot sit idly by in <placeName ref="#LATL">Atlanta</placeName> and not be concerned about what happens
   <lb/>in <placeName ref="#LBHM">Birmingham</placeName>. <seg xml:id="FQ17">Injustice anywhere is a threat to justice everywhere.</seg> We
   <lb/>are caught in an inescapable network of mutuality, tied in a single garment
   <lb/>of destiny. Whatever affects one directly affects all indirectly. Never
   <lb/>again can we afford to live with the narrow, provincial <soCalled rendition="#Rqms">outside agitator</soCalled>
    <lb/>idea. Anyone who lives inside the United States can never be considered
   <lb/>an outsider anywhere in this country.</p>
<!-- ... -->
  </body>
 </text>
</TEI>
Schematron

<sch:assert test="@type or not(ancestor::tei:standOff)">This
<sch:name/> element must have a @type attribute, since it is
nested inside a <sch:name/>
</sch:assert>
Content model
<content>
 <classRef key="model.standOffPart"
  maxOccurs="unboundedminOccurs="1"/>

</content>
Schema Declaration
element standOff
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   tei_att.declaring.attributes,
   tei_model.standOffPart+
}

<supplied>

<supplied> (supplied) signifies text supplied by the transcriber or editor for any reason; for example because the original cannot be read due to physical damage, or because of an obvious omission by the author or scribe. [11.3.3.1. Damage, Illegibility, and Supplied Text]
Moduletranscr
Attributes
reasonone or more words indicating why the text has had to be supplied, e.g. overbinding, faded-ink, lost-folio, omitted-in-original.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
Member of
Contained by
May contain
Note

The <damage>, gap, del, unclear and supplied elements may be closely allied in use. See section 11.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for discussion of which element is appropriate for which circumstance.

Example
I am dr Sr yr
<supplied reason="illegible"
 source="#amanuensis_copy">
very humble Servt</supplied>
Sydney Smith
Example
<supplied reason="omitted-in-original">Dedication</supplied> to the duke of Bejar
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element supplied
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_att.dimensions.attributes,
   attribute reason { list { teidata.word+ } }?,
   tei_macro.paraContent
}

<support>

<support> (support) contains a description of the materials etc. which make up the physical support for the written part of a manuscript or other object. [10.7.1. Object Description]
Modulemsdescription
Attributes
Contained by
msdescription: supportDesc
May contain
Example
<objectDesc form="roll">
 <supportDesc>
  <support> Parchment roll with <material>silk</material> ribbons.
  </support>
 </supportDesc>
</objectDesc>
Content model
<content>
 <macroRef key="macro.specialPara"/>
</content>
Schema Declaration
element support { tei_att.global.attributes, tei_macro.specialPara }

<supportDesc>

<supportDesc> (support description) groups elements describing the physical support for the written part of a manuscript or other object. [10.7.1. Object Description]
Modulemsdescription
Attributes
material(material) a short project-defined name for the material composing the majority of the support
Status Optional
Datatype teidata.enumerated
Suggested values include:
paper
parch
(parchment)
mixed
Contained by
msdescription: objectDesc
May contain
core: p
header: extent
linking: ab
msdescription: collation foliation support
Example
<supportDesc>
 <support> Parchment roll with <material>silk</material> ribbons.
 </support>
</supportDesc>
Content model
<content>
 <alternate>
  <classRef key="model.pLike"
   maxOccurs="unboundedminOccurs="1"/>

  <sequence>
   <elementRef key="supportminOccurs="0"/>
   <elementRef key="extentminOccurs="0"/>
   <elementRef key="foliation"
    maxOccurs="unboundedminOccurs="0"/>

   <elementRef key="collation"
    minOccurs="0"/>

   <elementRef key="condition"
    minOccurs="0"/>

  </sequence>
 </alternate>
</content>
Schema Declaration
element supportDesc
{
   tei_att.global.attributes,
   attribute material { "paper" | "parch" | "mixed" | teidata.enumerated }?,
   (
      tei_model.pLike+
    | ( tei_support?, tei_extent?, tei_foliation*, tei_collation?, condition? )
   )
}

<surface>

<surface> defines a written surface as a two-dimensional coordinate space, optionally grouping one or more graphic representations of that space, zones of interest within that space, and transcriptions of the writing within them. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription]
Moduletranscr
Attributes
attachmentdescribes the method by which this surface is or was connected to the main surface
Status Optional
Datatype teidata.enumerated
Sample values include:
glued
glued in place
pinned
pinned or stapled in place
sewn
sewn in place
flippingindicates whether the surface is attached and folded in such a way as to provide two writing surfaces
Status Optional
Datatype teidata.truthValue
Contained by
May contain
Note

The surface element represents any two-dimensional space on some physical surface forming part of the source material, such as a piece of paper, a face of a monument, a billboard, a scroll, a leaf etc.

The coordinate space defined by this element may be thought of as a grid lrx - ulx units wide and uly - lry units high.

The surface element may contain graphic representations or transcriptions of written zones, or both. The coordinate values used by every zone element contained by this element are to be understood with reference to the same grid.

Where it is useful or meaningful to do so, any grouping of multiple surface elements may be indicated using the <surfaceGrp> element.

Example
<facsimile>
 <surface lrx="200lry="300ulx="0uly="0">
  <graphic url="Bovelles-49r.png"/>
 </surface>
</facsimile>
Content model
<content>
 <sequence>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.global"/>
   <classRef key="model.labelLike"/>
   <classRef key="model.graphicLike"/>
  </alternate>
  <sequence maxOccurs="unbounded"
   minOccurs="0">

   <alternate>
    <elementRef key="zone"/>
    <elementRef key="line"/>
    <elementRef key="path"/>
    <elementRef key="surface"/>
    <elementRef key="surfaceGrp"/>
   </alternate>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element surface
{
   tei_att.global.attributes,
   tei_att.coordinated.attributes,
   tei_att.declaring.attributes,
   tei_att.typed.attributes,
   attribute attachment { teidata.enumerated }?,
   attribute flipping { teidata.truthValue }?,
   (
      ( tei_model.global | tei_model.labelLike | tei_model.graphicLike )*,
      (
         ( tei_zone | line | path | tei_surface | surfaceGrp ),
         tei_model.global*
      )*
   )
}

<surname>

<surname> (surname) contains a family (inherited) name, as opposed to a given, baptismal, or nick name. [13.2.1. Personal Names]
Modulenamesdates
Attributes
Member of
Contained by
May contain
Example
<surname type="combine">St John Stevas</surname>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element surname
{
   tei_att.global.attributes,
   tei_att.personal.attributes,
   tei_att.typed.attributes,
   tei_macro.phraseSeq
}

<TEI>

<TEI> (TEI document) contains a single TEI-conformant document, combining a single TEI header with one or more members of the model.resource class. Multiple TEI elements may be combined within a TEI (or <teiCorpus>) element. [4. Default Text Structure 15.1. Varieties of Composite Text]
Moduletextstructure
Attributes
Contained by
textstructure: TEI
May contain
header: teiHeader
linking: standOff
textstructure: TEI text
transcr: facsimile
Note

This element is required. It is customary to specify the TEI namespace http://www.tei-c.org/ns/1.0 on it, for example: <TEI version="4.4.0" xml:lang="it" xmlns="http://www.tei-c.org/ns/1.0">.

Example
<TEI version="3.3.0" xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
  <fileDesc>
   <titleStmt>
    <title>The shortest TEI Document Imaginable</title>
   </titleStmt>
   <publicationStmt>
    <p>First published as part of TEI P2, this is the P5
         version using a namespace.</p>
   </publicationStmt>
   <sourceDesc>
    <p>No source: this is an original work.</p>
   </sourceDesc>
  </fileDesc>
 </teiHeader>
 <text>
  <body>
   <p>This is about the shortest TEI document imaginable.</p>
  </body>
 </text>
</TEI>
Example
<TEI version="2.9.1" xmlns="http://www.tei-c.org/ns/1.0">
 <teiHeader>
  <fileDesc>
   <titleStmt>
    <title>A TEI Document containing four page images </title>
   </titleStmt>
   <publicationStmt>
    <p>Unpublished demonstration file.</p>
   </publicationStmt>
   <sourceDesc>
    <p>No source: this is an original work.</p>
   </sourceDesc>
  </fileDesc>
 </teiHeader>
 <facsimile>
  <graphic url="page1.png"/>
  <graphic url="page2.png"/>
  <graphic url="page3.png"/>
  <graphic url="page4.png"/>
 </facsimile>
</TEI>
Schematron

<sch:ns prefix="tei"
 uri="http://www.tei-c.org/ns/1.0"/>

<sch:ns prefix="xs"
 uri="http://www.w3.org/2001/XMLSchema"/>
Schematron

<sch:ns prefix="rng"
 uri="http://relaxng.org/ns/structure/1.0"/>

<sch:ns prefix="rna"
 uri="http://relaxng.org/ns/compatibility/annotations/1.0"/>
Schematron

<sch:ns prefix="sch"
 uri="http://purl.oclc.org/dsdl/schematron"/>

<sch:ns prefix="sch1x"
 uri="http://www.ascc.net/xml/schematron"/>
Content model
<content>
 <sequence maxOccurs="1minOccurs="1">
  <elementRef key="teiHeader"/>
  <alternate maxOccurs="1minOccurs="1">
   <sequence maxOccurs="1minOccurs="1">
    <classRef key="model.resource"
     maxOccurs="unboundedminOccurs="1"/>

    <elementRef key="TEI"
     maxOccurs="unboundedminOccurs="0"/>

   </sequence>
   <elementRef key="TEI"
    maxOccurs="unboundedminOccurs="1"/>

  </alternate>
 </sequence>
</content>
Schema Declaration
element TEI
{
   tei_att.global.attributes,
   tei_att.typed.attributes,
   ( tei_teiHeader, ( ( tei_model.resource+, tei_TEI* ) | tei_TEI+ ) )
}

<teiHeader>

<teiHeader> (TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources. [2.1.1. The TEI Header and Its Components 15.1. Varieties of Composite Text]
Moduleheader
Attributes
Contained by
textstructure: TEI
May contain
Note

One of the few elements unconditionally required in any TEI document.

Example
<teiHeader>
 <fileDesc>
  <titleStmt>
   <title>Shakespeare: the first folio (1623) in electronic form</title>
   <author>Shakespeare, William (1564–1616)</author>
   <respStmt>
    <resp>Originally prepared by</resp>
    <name>Trevor Howard-Hill</name>
   </respStmt>
   <respStmt>
    <resp>Revised and edited by</resp>
    <name>Christine Avern-Carr</name>
   </respStmt>
  </titleStmt>
  <publicationStmt>
   <distributor>Oxford Text Archive</distributor>
   <address>
    <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>
   </address>
   <idno type="OTA">119</idno>
   <availability>
    <p>Freely available on a non-commercial basis.</p>
   </availability>
   <date when="1968">1968</date>
  </publicationStmt>
  <sourceDesc>
   <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,
       1968)</bibl>
  </sourceDesc>
 </fileDesc>
 <encodingDesc>
  <projectDesc>
   <p>Originally prepared for use in the production of a series of old-spelling
       concordances in 1968, this text was extensively checked and revised for use during the
       editing of the new Oxford Shakespeare (Wells and Taylor, 1989).</p>
  </projectDesc>
  <editorialDecl>
   <correction>
    <p>Turned letters are silently corrected.</p>
   </correction>
   <normalization>
    <p>Original spelling and typography is retained, except that long s and ligatured
         forms are not encoded.</p>
   </normalization>
  </editorialDecl>
  <refsDecl xml:id="ASLREF">
   <cRefPattern matchPattern="(\S+) ([^.]+)\.(.*)"
    replacementPattern="#xpath(//div1[@n='$1']/div2/[@n='$2']//lb[@n='$3'])">

    <p>A reference is created by assembling the following, in the reverse order as that
         listed here: <list>
      <item>the <att>n</att> value of the preceding <gi>lb</gi>
      </item>
      <item>a period</item>
      <item>the <att>n</att> value of the ancestor <gi>div2</gi>
      </item>
      <item>a space</item>
      <item>the <att>n</att> value of the parent <gi>div1</gi>
      </item>
     </list>
    </p>
   </cRefPattern>
  </refsDecl>
 </encodingDesc>
 <revisionDesc>
  <list>
   <item>
    <date when="1989-04-12">12 Apr 89</date> Last checked by CAC</item>
   <item>
    <date when="1989-03-01">1 Mar 89</date> LB made new file</item>
  </list>
 </revisionDesc>
</teiHeader>
Content model
<content>
 <sequence>
  <elementRef key="fileDesc"/>
  <classRef key="model.teiHeaderPart"
   maxOccurs="unboundedminOccurs="0"/>

  <elementRef key="revisionDesc"
   minOccurs="0"/>

 </sequence>
</content>
Schema Declaration
element teiHeader
{
   tei_att.global.attributes,
   ( tei_fileDesc, tei_model.teiHeaderPart*, tei_revisionDesc? )
}

<term>

<term> (term) contains a single-word, multi-word, or symbolic designation which is regarded as a technical term. [3.4.1. Terms and Glosses]
Modulecore
Attributes
Member of
Contained by
May contain
Note

When this element appears within an index element, it is understood to supply the form under which an index entry is to be made for that location. Elsewhere, it is understood simply to indicate that its content is to be regarded as a technical or specialised term. It may be associated with a <gloss> element by means of its ref attribute; alternatively a <gloss> element may point to a term element by means of its target attribute.

In formal terminological work, there is frequently discussion over whether terms must be atomic or may include multi-word lexical items, symbolic designations, or phraseological units. The term element may be used to mark any of these. No position is taken on the philosophical issue of what a term can be; the looser definition simply allows the term element to be used by practitioners of any persuasion.

As with other members of the att.canonical class, instances of this element occuring in a text may be associated with a canonical definition, either by means of a URI (using the ref attribute), or by means of some system-specific code value (using the key attribute). Because the mutually exclusive target and cRef attributes overlap with the function of the ref attribute, they are deprecated and may be removed at a subsequent release.

Example
A computational device that infers structure
from grammatical strings of words is known as a <term>parser</term>, and much of the history
of NLP over the last 20 years has been occupied with the design of parsers.
Example
We may define <term rend="scxml:id="TDPV1">discoursal point of view</term> as
<gloss target="#TDPV1">the relationship, expressed
through discourse structure, between the implied author or some other addresser, and the
fiction.</gloss>
Example
We may define <term ref="#TDPV2rend="sc">discoursal point of view</term> as
<gloss xml:id="TDPV2">the relationship, expressed
through discourse structure, between the implied author or some other addresser, and the
fiction.</gloss>
Example
We discuss Leech's concept of <term ref="myGlossary.xml#TDPV2rend="sc">discoursal point of view</term> below.
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element term
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   tei_att.pointing.attributes,
   tei_att.typed.attributes,
   tei_att.canonical.attributes,
   tei_att.sortable.attributes,
   tei_att.cReferencing.attributes,
   tei_macro.phraseSeq
}

<text>

<text> (text) contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 15.1. Varieties of Composite Text]
Moduletextstructure
Attributes
Member of
Contained by
textstructure: TEI
May contain
analysis: span
linking: anchor
textcrit: app witDetail
textstructure: back body front
Note

This element should not be used to represent a text which is inserted at an arbitrary point within the structure of another, for example as in an embedded or quoted narrative; the <floatingText> is provided for this purpose.

Example
<text>
 <front>
  <docTitle>
   <titlePart>Autumn Haze</titlePart>
  </docTitle>
 </front>
 <body>
  <l>Is it a dragonfly or a maple leaf</l>
  <l>That settles softly down upon the water?</l>
 </body>
</text>
Example

The body of a text may be replaced by a group of nested texts, as in the following schematic:

<text>
 <front>
<!-- front matter for the whole group -->
 </front>
 <group>
  <text>
<!-- first text -->
  </text>
  <text>
<!-- second text -->
  </text>
 </group>
</text>
Content model
<content>
 <sequence>
  <classRef key="model.global"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence minOccurs="0">
   <elementRef key="front"/>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
  <alternate>
   <elementRef key="body"/>
   <elementRef key="group"/>
  </alternate>
  <classRef key="model.global"
   maxOccurs="unboundedminOccurs="0"/>

  <sequence minOccurs="0">
   <elementRef key="back"/>
   <classRef key="model.global"
    maxOccurs="unboundedminOccurs="0"/>

  </sequence>
 </sequence>
</content>
Schema Declaration
element text
{
   tei_att.global.attributes,
   tei_att.declaring.attributes,
   tei_att.typed.attributes,
   tei_att.written.attributes,
   (
      tei_model.global*,
      ( tei_front, tei_model.global* )?,
      ( tei_body | group ),
      tei_model.global*,
      ( tei_back, tei_model.global* )?
   )
}

<textClass>

<textClass> (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc. [2.4.3. The Text Classification]
Moduleheader
Attributes
Member of
Contained by
header: profileDesc
May contain
header: keywords
Example
<taxonomy>
 <category xml:id="acprose">
  <catDesc>Academic prose</catDesc>
 </category>
<!-- other categories here -->
</taxonomy>
<!-- ... -->
<textClass>
 <catRef target="#acprose"/>
 <classCode scheme="http://www.udcc.org">001.9</classCode>
 <keywords scheme="http://authorities.loc.gov">
  <list>
   <item>End of the world</item>
   <item>History - philosophy</item>
  </list>
 </keywords>
</textClass>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <elementRef key="classCode"/>
  <elementRef key="catRef"/>
  <elementRef key="keywords"/>
 </alternate>
</content>
Schema Declaration
element textClass
{
   tei_att.global.attributes,
   tei_att.declarable.attributes,
   ( classCode | catRef | tei_keywords )*
}

<title>

<title> (title) contains a title for any kind of work. [3.12.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement]
Modulecore
Attributes
typeclassifies the title according to some convenient typology.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Sample values include:
main
main title
sub
(subordinate) subtitle, title of part
alt
(alternate) alternate title, often in another language, by which the work is also known
short
abbreviated form of title
desc
(descriptive) descriptive paraphrase of the work functioning as a title
Note

This attribute is provided for convenience in analysing titles and processing them according to their type; where such specialized processing is not necessary, there is no need for such analysis, and the entire title, including subtitles and any parallel titles, may be enclosed within a single title element.

levelindicates the bibliographic level for a title, that is, whether it identifies an article, book, journal, series, or unpublished material.
Status Optional
Datatype teidata.enumerated
Legal values are:
a
(analytic) the title applies to an analytic item, such as an article, poem, or other work published as part of a larger item.
m
(monographic) the title applies to a monograph such as a book or other item considered to be a distinct publication, including single volumes of multi-volume works
j
(journal) the title applies to any serial or periodical publication such as a journal, magazine, or newspaper
s
(series) the title applies to a series of otherwise distinct publications such as a collection
u
(unpublished) the title applies to any unpublished material (including theses and dissertations unless published by a commercial press)
Note

The level of a title is sometimes implied by its context: for example, a title appearing directly within an <analytic> element is ipso facto of level ‘a’, and one appearing within a <series> element of level ‘s’. For this reason, the level attribute is not required in contexts where its value can be unambiguously inferred. Where it is supplied in such contexts, its value should not contradict the value implied by its parent element.

Member of
Contained by
May contain
Note

The attributes key and ref, inherited from the class att.canonical may be used to indicate the canonical form for the title; the former, by supplying (for example) the identifier of a record in some external library system; the latter by pointing to an XML element somewhere containing the canonical form of the title.

Example
<title>Information Technology and the Research Process: Proceedings of
a conference held at Cranfield Institute of Technology, UK,
18–21 July 1989</title>
Example
<title>Hardy's Tess of the D'Urbervilles: a machine readable
edition</title>
Example
<title type="full">
 <title type="main">Synthèse</title>
 <title type="sub">an international journal for
   epistemology, methodology and history of
   science</title>
</title>
Schematron

<sch:rule context="tei:*[@calendar]">
<sch:assert test="string-length( normalize-space(.) ) gt 0"> @calendar indicates one or more
systems or calendars to which the date represented by the content of this element belongs,
but this <sch:name/> element has no textual content.</sch:assert>
</sch:rule>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element title
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   tei_att.canonical.attributes,
   tei_att.datable.attributes,
   attribute type { teidata.enumerated }?,
   attribute level { "a" | "m" | "j" | "s" | "u" }?,
   tei_macro.paraContent
}

<titlePage>

<titlePage> (title page) contains the title page of a text, appearing within the front or back matter. [4.6. Title Pages]
Moduletextstructure
Attributes
typeclassifies the title page according to any convenient typology.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Note

This attribute allows the same element to be used for volume title pages, series title pages, etc., as well as for the ‘main’ title page of a work.

Member of
Contained by
msdescription: msContents
textcrit: rdg
textstructure: back front
May contain
Example
<titlePage>
 <docTitle>
  <titlePart type="main">THOMAS OF Reading.</titlePart>
  <titlePart type="alt">OR, The sixe worthy yeomen of the West.</titlePart>
 </docTitle>
 <docEdition>Now the fourth time corrected and enlarged</docEdition>
 <byline>By T.D.</byline>
 <figure>
  <head>TP</head>
  <p>Thou shalt labor till thou returne to duste</p>
  <figDesc>Printers Ornament used by TP</figDesc>
 </figure>
 <docImprint>Printed at <name type="place">London</name> for <name>T.P.</name>
  <date>1612.</date>
 </docImprint>
</titlePage>
Content model
<content>
 <sequence>
  <classRef key="model.global"
   maxOccurs="unboundedminOccurs="0"/>

  <classRef key="model.titlepagePart"/>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <classRef key="model.titlepagePart"/>
   <classRef key="model.global"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element titlePage
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   attribute type { teidata.enumerated }?,
   (
      tei_model.global*,
      tei_model.titlepagePart,
      ( tei_model.titlepagePart | tei_model.global )*
   )
}

<titlePart>

<titlePart> (title part) contains a subsection or division of the title of a work, as indicated on a title page. [4.6. Title Pages]
Moduletextstructure
Attributes
type(type) specifies the role of this subdivision of the title.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
Suggested values include:
main
(main) main title of the work [Default]
sub
(subordinate) subtitle of the work
alt
(alternate) alternative title of the work
short
(short) abbreviated form of title
desc
(descriptive) descriptive paraphrase of the work
Member of
Contained by
msdescription: msItem
textcrit: rdg
textstructure: back docTitle front titlePage
May contain
Example
<docTitle>
 <titlePart type="main">THE FORTUNES
   AND MISFORTUNES Of the FAMOUS
   Moll Flanders, &amp;c.
 </titlePart>
 <titlePart type="desc">Who was BORN in NEWGATE,
   And during a Life of continu'd Variety for
   Threescore Years, besides her Childhood, was
   Twelve Year a <hi>Whore</hi>, five times a <hi>Wife</hi> (wherof
   once to her own Brother) Twelve Year a <hi>Thief,</hi>
   Eight Year a Transported <hi>Felon</hi> in <hi>Virginia</hi>,
   at last grew <hi>Rich</hi>, liv'd <hi>Honest</hi>, and died a
 <hi>Penitent</hi>.</titlePart>
</docTitle>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element titlePart
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   attribute type
   {
      "main" | "sub" | "alt" | "short" | "desc" | teidata.enumerated
   }?,
   tei_macro.paraContent
}

<titleStmt>

<titleStmt> (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description]
Moduleheader
Attributes
Contained by
header: fileDesc
May contain
Example
<titleStmt>
 <title>Capgrave's Life of St. John Norbert: a machine-readable transcription</title>
 <respStmt>
  <resp>compiled by</resp>
  <name>P.J. Lucas</name>
 </respStmt>
</titleStmt>
Content model
<content>
 <sequence>
  <elementRef key="title"
   maxOccurs="unboundedminOccurs="1"/>

  <classRef key="model.respLike"
   maxOccurs="unboundedminOccurs="0"/>

 </sequence>
</content>
Schema Declaration
element titleStmt
{
   tei_att.global.attributes,
   ( tei_title+, tei_model.respLike* )
}

<transpose>

<transpose> describes a single textual transposition as an ordered list of at least two pointers specifying the order in which the elements indicated should be re-combined. [11.3.4.5. Transpositions]
Moduletranscr
Attributes
Contained by
transcr: listTranspose
May contain
core: ptr
Note

Transposition is usually indicated in a document by a metamark such as a wavy line or numbering.

The order in which ptr elements appear within a transpose element should correspond with the desired order, as indicated by the metamark.

Example
<transpose>
 <ptr target="#ib02"/>
 <ptr target="#ib01"/>
</transpose>

The transposition recorded here indicates that the content of the element with identifier ib02 should appear before the content of the element with identifier ib01.

Content model
<content>
 <elementRef key="ptr"
  maxOccurs="unboundedminOccurs="2"/>

</content>
Schema Declaration
element transpose { tei_att.global.attributes, ( tei_ptr, tei_ptr, tei_ptr* ) }

<unclear>

<unclear> (unclear) contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source. [11.3.3.1. Damage, Illegibility, and Supplied Text 3.5.3. Additions, Deletions, and Omissions]
Modulecore
Attributes
reasonindicates why the material is hard to transcribe.
Status Optional
Datatype 1–∞ occurrences of teidata.enumerated separated by whitespace
Suggested values include:
illegible
(illegible)
inaudible
(inaudible)
faded
(faded)
background_noise
(background noise)
eccentric_ductus
(eccentric ductus) indicates illegibility due to an unusual, awkward, or incompetent execution of a glyph or glyphs
<div>
 <head>Rx</head>
 <p>500 mg <unclear reason="illegible">placebo</unclear>
 </p>
</div>
Note

One or more words may be used to describe the reason; usually each word will refer to a single cause.

agentWhere the difficulty in transcription arises from damage, categorizes the cause of the damage, if it can be identified.
Status Optional
Datatype teidata.enumerated
Sample values include:
rubbing
damage results from rubbing of the leaf edges
mildew
damage results from mildew on the leaf surface
smoke
damage results from smoke
Member of
Contained by
May contain
Note

The same element is used for all cases of uncertainty in the transcription of element content, whether for written or spoken material. For other aspects of certainty, uncertainty, and reliability of tagging and transcription, see chapter 21. Certainty, Precision, and Responsibility.

The <damage>, gap, del, unclear and supplied elements may be closely allied in use. See section 11.3.3.2. Use of the gap, del, damage, unclear, and supplied Elements in Combination for discussion of which element is appropriate for which circumstance.

The hand attribute points to a definition of the hand concerned, as further discussed in section 11.3.2.1. Document Hands.

Example
<u> ...and then <unclear reason="background-noise">Nathalie</unclear> said ... </u>
Content model
<content>
 <macroRef key="macro.paraContent"/>
</content>
Schema Declaration
element unclear
{
   tei_att.global.attributes,
   tei_att.editLike.attributes,
   tei_att.dimensions.attributes,
   attribute reason
   {
      list
      {
         (
            "illegible"
          | "inaudible"
          | "faded"
          | "background_noise"
          | "eccentric_ductus"
          | teidata.enumerated
         )+
      }
   }?,
   attribute agent { teidata.enumerated }?,
   tei_macro.paraContent
}

<undo>

<undo> indicates one or more marked-up interventions in a document which have subsequently been marked for cancellation. [11.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications]
Moduletranscr
Attributes
targetpoints to one or more elements representing the interventions which are to be reverted or undone.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Member of
Contained by
May containEmpty element
Example
<line>This is <del change="#s2rend="overstrike">
  <seg xml:id="undo-a">just some</seg>
   sample <seg xml:id="undo-b">text</seg>,
   we need</del>
 <add change="#s2">not</add>
a real example.</line>
<undo change="#s3rend="dotted"
 target="#undo-a #undo-b"/>

This encoding represents the following sequence of events:

  • "This is just some sample text, we need a real example" is written
  • At stage s2, "just some sample text, we need" is deleted by overstriking, and "not" is added
  • At stage s3, parts of the deletion are cancelled by underdotting, thus reinstating the words "just some" and "text".
Content model
<content>
 <empty/>
</content>
Schema Declaration
element undo
{
   tei_att.global.attributes,
   tei_att.spanning.attributes,
   tei_att.transcriptional.attributes,
   tei_att.dimensions.attributes,
   attribute target { list { teidata.pointer+ } }?,
   empty
}

<variantEncoding>

<variantEncoding> (variant encoding) declares the method used to encode text-critical variants. [12.1.1. The Apparatus Entry]
Moduletextcrit
Attributes
methodindicates which method is used to encode the apparatus of variants.
Status Required
Datatype teidata.enumerated
Legal values are:
location-referenced
apparatus uses line numbers or other canonical reference scheme referenced in a base text.
double-end-point
apparatus indicates the precise locations of the beginning and ending of each lemma relative to a base text.
parallel-segmentation
alternate readings of a passage are given in parallel in the text; no notion of a base text is necessary.
Note

The value ‘parallel-segmentation’ requires in-line encoding of the apparatus.

locationindicates whether the apparatus appears within the running text or external to it.
Status Required
Datatype teidata.enumerated
Schematron

<sch:rule context="tei:variantEncoding">
<sch:report test="@location eq 'external' and @method eq 'parallel-segmentation'"> The @location value "external" is inconsistent with the
parallel-segmentation method of apparatus markup.</sch:report>
</sch:rule>
Legal values are:
internal
apparatus appears within the running text.
external
apparatus appears outside the base text.
Note

The value ‘external’ is inconsistent with the parallel-segmentation method of apparatus markup.

Member of
Contained by
header: encodingDesc
May containEmpty element
Example
<variantEncoding location="external"
 method="location-referenced"/>
Content model
<content>
 <empty/>
</content>
Schema Declaration
element variantEncoding
{
   tei_att.global.attributes,
   attribute method
   {
      "location-referenced" | "double-end-point" | "parallel-segmentation"
   },
   attribute location { "internal" | "external" },
   empty
}

<w>

<w> (word) represents a grammatical (not necessarily orthographic) word. [17.1. Linguistic Segment Categories 17.4.2. Lightweight Linguistic Annotation]
Moduleanalysis
Attributes
Member of
Contained by
May contain
Example

This example is adapted from the Folger Library’s Early Modern English Drama version of The Wits: a Comedy by William Davenant.

<l>
 <w lemma="itpos="pn"
  xml:id="A19883-003-a-0100">
IT</w>
 <w lemma="havepos="vvz"
  xml:id="A19883-003-a-0110">
hath</w>
 <w lemma="bepos="vvn"
  xml:id="A19883-003-a-0120">
been</w>
 <w lemma="saypos="vvn"
  xml:id="A19883-003-a-0130">
said</w>
 <w lemma="ofpos="acp-p"
  xml:id="A19883-003-a-0140">
of</w>
 <w lemma="oldpos="j"
  xml:id="A19883-003-a-0150">
old</w>
 <pc xml:id="A19883-003-a-0160">,</pc>
 <w lemma="thatpos="cs"
  xml:id="A19883-003-a-0170">
that</w>
 <w lemma="playpos="vvz"
  xml:id="A19883-003-a-0180">

  <choice>
   <orig>Playes</orig>
   <reg>Plays</reg>
  </choice>
 </w>
 <w lemma="bepos="vvb"
  xml:id="A19883-003-a-0190">
are</w>
 <w lemma="feastpos="n2"
  xml:id="A19883-003-a-0200">
Feasts</w>
 <pc xml:id="A19883-003-a-0210">,</pc>
</l>
<l xml:id="A19883-e100220">
 <w lemma="poetpos="n2"
  xml:id="A19883-003-a-0220">
Poets</w>
 <w lemma="thepos="d"
  xml:id="A19883-003-a-0230">
the</w>
 <w lemma="cookpos="n2"
  xml:id="A19883-003-a-0240">

  <choice>
   <orig>Cookes</orig>
   <reg>Cooks</reg>
  </choice>
 </w>
 <pc xml:id="A19883-003-a-0250">,</pc>
 <w lemma="andpos="cc"
  xml:id="A19883-003-a-0260">
and</w>
 <w lemma="thepos="d"
  xml:id="A19883-003-a-0270">
the</w>
 <w lemma="spectatorpos="n2"
  xml:id="A19883-003-a-0280">
Spectators</w>
 <w lemma="guestpos="n2"
  xml:id="A19883-003-a-0290">
Guests</w>
 <pc xml:id="A19883-003-a-0300">,</pc>
</l>
<l xml:id="A19883-e100230">
 <w lemma="thepos="d"
  xml:id="A19883-003-a-0310">
The</w>
 <w lemma="actorpos="n2"
  xml:id="A19883-003-a-0320">
Actors</w>
 <w lemma="waiterpos="n2"
  xml:id="A19883-003-a-0330">
Waiters</w>
 <pc xml:id="A19883-003-a-0340">:</pc>
<!-- ... -->
</l>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <elementRef key="seg"/>
  <elementRef key="w"/>
  <elementRef key="m"/>
  <elementRef key="c"/>
  <elementRef key="pc"/>
  <classRef key="model.global"/>
  <classRef key="model.lPart"/>
  <classRef key="model.hiLike"/>
  <classRef key="model.pPart.edit"/>
 </alternate>
</content>
Schema Declaration
element w
{
   tei_att.global.attributes,
   tei_att.segLike.attributes,
   tei_att.typed.attributes,
   tei_att.linguistic.attributes,
   tei_att.notated.attributes,
   (
      text
    | tei_model.gLike
    | tei_seg
    | tei_w
    | m
    | tei_c
    | tei_pc
    | tei_model.global
    | tei_model.lPart
    | tei_model.hiLike
    | tei_model.pPart.edit
   )*
}

<wit>

<wit> (wit) contains a list of one or more sigla of witnesses attesting a given reading, in a textual variation. [12.1.4. Witness Information]
Moduletextcrit
Attributes
Member of
Contained by
textcrit: app rdg rdgGrp
May contain
Note

This element represents the same information as that provided by the wit attribute of the reading; it may be used to record the exact form of the sigla given in the source edition, when that is of interest.

Example
<rdg wit="#El #Hg">Experience</rdg>
<wit>Ellesmere, Hengwryt</wit>
Content model
<content>
 <macroRef key="macro.phraseSeq"/>
</content>
Schema Declaration
element wit
{
   tei_att.global.attributes,
   tei_att.rdgPart.attributes,
   tei_macro.phraseSeq
}

<witDetail>

<witDetail> (witness detail) gives further information about a particular witness, or witnesses, to a particular reading. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Attributes
typedescribes the type of information given about the witness.
Derived fromatt.typed
Status Optional
Datatype teidata.enumerated
wit(witnesses) indicates the sigil or sigla identifying the witness or witnesses to which the detail refers.
Status Required
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Member of
Contained by
May contain
Note

The witDetail element was formerly permitted anywhere that note could appear, but since it should only be used in association with <lem> and rdg, it is recommended that it be placed immediately following the reading that it modifies, in the same app. A witDetail without a target attribute should be assumed to refer to the closest preceding <lem> or rdg.

Example
<app type="substantive">
 <lem wit="#El #HGxml:id="W026x">Experience</lem>
 <rdg wit="#Ha4">Experiens</rdg>
 <witDetail resp="#PRtarget="#W026x"
  type="presentationwit="#El">
Ornamental capital.</witDetail>
</app>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.global"/>
  <elementRef key="bibl"/>
 </alternate>
</content>
Schema Declaration
element witDetail
{
   tei_att.global.attributes,
   tei_att.placement.attributes,
   tei_att.pointing.attributes,
   tei_att.typed.attribute.subtype,
   attribute type { teidata.enumerated }?,
   attribute wit { list { teidata.pointer+ } },
   ( text | tei_model.gLike | tei_model.phrase | tei_model.global | tei_bibl )*
}

<witness>

<witness> (witness) contains either a description of a single witness referred to within the critical apparatus, or a list of witnesses which is to be referred to by a single sigil. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Attributes
Contained by
May contain
Note

The content of the witness element may give bibliographic information about the witness or witness group, or it may be empty.

Example
<listWit>
 <witness xml:id="EL">Ellesmere, Huntingdon Library 26.C.9</witness>
 <witness xml:id="HG">Hengwrt, National Library of Wales,
   Aberystwyth, Peniarth 392D</witness>
 <witness xml:id="RA2">Bodleian Library Rawlinson Poetic 149
   (see further <ptr target="http://www.examples.com/MSdescs#MSRP149"/>)</witness>
</listWit>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.limitedPhrase"/>
  <classRef key="model.inter"/>
  <elementRef key="note"/>
  <elementRef key="object"/>
 </alternate>
</content>
Schema Declaration
element witness
{
   tei_att.global.attributes,
   tei_att.sortable.attributes,
   ( text | tei_model.limitedPhrase | tei_model.inter | tei_note | object )*
}

<xr>

<xr> (cross-reference phrase) contains a phrase, sentence, or icon referring the reader to some other location in this or another text. [9.3.5.3. Cross-References to Other Entries]
Moduledictionaries
Attributes
typeindicates the type of cross reference, using any convenient typology.
Derived fromatt.typed
Status Recommended
Datatype teidata.enumerated
Sample values include:
syn
(synonym) cross reference for synonym information
etym
(etymological) etymological information
cf
(compare or consult) related or similar term
illus
(illustration) illustration of an object
Member of
Contained by
core: cit
May contain
Note

May contain character data and phrase-level elements; usually contains a ref or a ptr element.

This element encloses both the actual indication of the location referred to, which may be tagged using the ref or ptr elements, and any accompanying material which gives more information about why the reader is being referred there.

Example
<entry>
 <form>
  <orth>lavage</orth>
 </form>
 <etym>[Fr. < <mentioned>laver</mentioned>; L. <mentioned>lavare</mentioned>, to wash;
 <xr>see <ref>lather</ref>
  </xr>]. </etym>
</entry>
Example
<entry>
 <form>
  <orth>lawful</orth>
 </form>
 <xr type="syn">SYN. see <ref>legal</ref>
 </xr>
</entry>
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.inter"/>
  <elementRef key="usg"/>
  <elementRef key="lbl"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Schema Declaration
element xr
{
   tei_att.global.attributes,
   tei_att.typed.attribute.subtype,
   tei_att.lexicographic.attributes,
   attribute type { teidata.enumerated }?,
   (
      text
    | tei_model.gLike
    | tei_model.phrase
    | tei_model.inter
    | usg
    | lbl
    | tei_model.global
   )*
}

<zone>

<zone> defines any two-dimensional area within a surface element. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription]
Moduletranscr
Attributes
rotateindicates the amount by which this zone has been rotated clockwise, with respect to the normal orientation of the parent surface element as implied by the dimensions given in the msDesc element or by the coordinates of the surface itself. The orientation is expressed in arc degrees.
Status Optional
Datatype teidata.count
Default 0
Member of
Contained by
linking: standOff
transcr: surface zone
May contain
Note

The position of every zone for a given surface is always defined by reference to the coordinate system defined for that surface.

A graphic element contained by a zone represents the whole of the zone.

A zone may be of any shape. The attribute points may be used to define a polygonal zone, using the coordinate system defined by its parent surface.

A zone is always a closed polygon. Repeating the initial coordinate at the end of the sequence is optional. To encode an unclosed path, use the <path> element.

Example
<surface lrx="0lry="0ulx="14.54"
 uly="16.14">

 <graphic url="stone.jpg"/>
 <zone points="4.6,6.3 5.25,5.85 6.2,6.6 8.19222,7.4125 9.89222,6.5875 10.9422,6.1375 11.4422,6.7125 8.21722,8.3125 6.2,7.65"/>
</surface>

This example defines a non-rectangular zone: see the illustration in section [ID PH-surfzone in TEI Guidelines].

Example
<facsimile>
 <surface lrx="400lry="280ulx="50"
  uly="20">

  <zone lrx="500lry="321ulx="0uly="0">
   <graphic url="graphic.png"/>
  </zone>
 </surface>
</facsimile>

This example defines a zone which has been defined as larger than its parent surface in order to match the dimensions of the graphic it contains.

Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.graphicLike"/>
  <classRef key="model.global"/>
  <elementRef key="surface"/>
  <classRef key="model.linePart"/>
 </alternate>
</content>
Schema Declaration
element zone
{
   tei_att.global.attributes,
   tei_att.coordinated.attributes,
   tei_att.typed.attributes,
   tei_att.written.attributes,
   attribute rotate { teidata.count }?,
   (
      text
    | tei_model.gLike
    | tei_model.graphicLike
    | tei_model.global
    | tei_surface
    | tei_model.linePart
   )*
}

Schema tei_lite: Model classes

model.addressLike

model.addressLike groups elements used to represent a postal or email address. [1. The TEI Infrastructure]
Moduletei
Used by
Membersaddress

model.addrPart

model.addrPart groups elements such as names or postal codes which may appear as part of a postal address. [3.6.2. Addresses]
Moduletei
Used by
Membersmodel.nameLike [model.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs] addrLine

model.annotationLike

model.annotationLike groups elements used to represent annotations. [16.10. The standOff Container]
Moduletei
Used by
Membersnote

model.attributable

model.attributable groups elements that contain a word or phrase that can be attributed to a source. [3.3.3. Quotation 4.3.2. Floating Texts]
Moduletei
Used by
Membersmodel.quoteLike [cit quote] said

model.availabilityPart

model.availabilityPart groups elements such as licences and paragraphs of text which may appear as part of an availability statement [2.2.4. Publication, Distribution, Licensing, etc.]
Moduletei
Used by
Memberslicence

model.biblLike

model.biblLike groups elements containing a bibliographic description. [3.12. Bibliographic Citations and References]
Moduletei
Used by
Membersbibl biblStruct msDesc

model.biblPart

model.biblPart groups elements which represent components of a bibliographic description. [3.12. Bibliographic Citations and References]
Moduletei
Used by
Membersmodel.imprintPart [biblScope distributor pubPlace publisher] model.respLike [author editor] availability bibl decoNote extent msIdentifier relatedItem

model.choicePart

model.choicePart groups elements (other than choice itself) which can be used within a choice alternation. [3.5. Simple Editorial Changes]
Moduletei
Used by
Membersabbr am corr ex expan orig reg seg sic supplied unclear

model.common

model.common groups common chunk- and inter-level elements. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.divPart [model.lLike [l] model.pLike [ab p] lg] model.entryLike model.inter [model.attributable [model.quoteLike [cit quote] said] model.biblLike [bibl biblStruct msDesc] model.egLike model.labelLike [desc] model.listLike [listApp] model.oddDecl model.stageLike] q
Note

This class defines the set of chunk- and inter-level elements; it is used in many content models, including those for textual divisions.

model.correspActionPart

model.correspActionPart groups elements which define the parts (usually names, dates and places) of one action related to the correspondence.
Moduletei
Used by
Membersmodel.addressLike [address] model.dateLike [date] model.nameLike [model.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs] note

model.correspDescPart

model.correspDescPart groups together metadata elements for describing correspondence
Moduletei
Used by
MemberscorrespAction note

model.dateLike

model.dateLike groups elements containing temporal expressions. [3.6.4. Dates and Times 13.4. Dates]
Moduletei
Used by
Membersdate

model.descLike

model.descLike groups elements which contain a description of their function.
Moduletei
Used by
Membersdesc

model.divBottom

model.divBottom groups elements appearing at the end of a text division. [4.2. Elements Common to All Divisions]
Moduletei
Used by
Membersmodel.divBottomPart [closer signed] model.divWrapper [byline dateline docAuthor epigraph salute]

model.divBottomPart

model.divBottomPart groups elements which can occur only at the end of a text division. [4.6. Title Pages]
Moduletei
Used by
Memberscloser signed

model.divLike

model.divLike groups elements used to represent un-numbered generic structural divisions.
Moduletei
Used by
Membersdiv

model.divPart

model.divPart groups paragraph-level elements appearing directly within divisions. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.lLike [l] model.pLike [ab p] lg
Note

Note that this element class does not include members of the model.inter class, which can appear either within or between paragraph-level items.

model.divTop

model.divTop groups elements appearing at the beginning of a text division. [4.2. Elements Common to All Divisions]
Moduletei
Used by
Membersmodel.divTopPart [model.headLike [head] opener signed] model.divWrapper [byline dateline docAuthor epigraph salute]

model.divTopPart

model.divTopPart groups elements which can occur only at the beginning of a text division. [4.6. Title Pages]
Moduletei
Used by
Membersmodel.headLike [head] opener signed

model.divWrapper

model.divWrapper groups elements which can appear at either top or bottom of a textual division. [4.2. Elements Common to All Divisions]
Moduletei
Used by
Membersbyline dateline docAuthor epigraph salute

model.emphLike

model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. [3.3. Highlighting and Quotation]
Moduletei
Used by
Membersemph foreign soCalled term title

model.encodingDescPart

model.encodingDescPart groups elements which may be used inside encodingDesc and appear multiple times.
Moduletei
Used by
MemberseditorialDecl projectDesc variantEncoding

model.entryPart

model.entryPart groups non-morphological elements appearing within a dictionary entry. [9.1. Dictionary Body and Overall Structure]
Moduletei
Used by
Membersxr

model.frontPart

model.frontPart groups elements which appear at the level of divisions within front or back matter. [7.1. Front and Back Matter ]
Moduletei
Used by
Membersmodel.frontPart.drama titlePage

model.global

model.global groups elements which may appear at any point within a TEI text. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.global.edit [addSpan app delSpan gap space witDetail] model.global.meta [index listTranspose span] model.milestoneLike [anchor fw gb lb milestone pb] model.noteLike [note] metamark

model.global.edit

model.global.edit groups globally available elements which perform a specifically editorial function. [1.3. The TEI Class System]
Moduletei
Used by
MembersaddSpan app delSpan gap space witDetail

model.global.meta

model.global.meta groups globally available elements which describe the status of other elements. [1.3. The TEI Class System]
Moduletei
Used by
Membersindex listTranspose span
Note

Elements in this class are typically used to hold groups of links or of abstract interpretations, or by provide indications of certainty etc. It may find be convenient to localize all metadata elements, for example to contain them within the same divison as the elements that they relate to; or to locate them all to a division of their own. They may however appear at any point in a TEI text.

model.graphicLike

model.graphicLike groups elements containing images, formulae, and similar objects. [3.10. Graphics and Other Non-textual Components]
Moduletei
Used by
Membersgraphic

model.headLike

model.headLike groups elements used to provide a title or heading at the start of a text division.
Moduletei
Used by
Membershead

model.highlighted

model.highlighted groups phrase-level elements which are typographically distinct. [3.3. Highlighting and Quotation]
Moduletei
Used by
Membersmodel.emphLike [emph foreign soCalled term title] model.hiLike [hi q]

model.hiLike

model.hiLike groups phrase-level elements which are typographically distinct but to which no specific function can be attributed. [3.3. Highlighting and Quotation]
Moduletei
Used by
Membershi q

model.imprintPart

model.imprintPart groups the bibliographic elements which occur inside imprints. [3.12. Bibliographic Citations and References]
Moduletei
Used by
MembersbiblScope distributor pubPlace publisher

model.inter

model.inter groups elements which can appear either within or between paragraph-like elements. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.attributable [model.quoteLike [cit quote] said] model.biblLike [bibl biblStruct msDesc] model.egLike model.labelLike [desc] model.listLike [listApp] model.oddDecl model.stageLike

model.labelLike

model.labelLike groups elements used to gloss or explain other parts of a document.
Moduletei
Used by
Membersdesc

model.limitedPhrase

model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.emphLike [emph foreign soCalled term title] model.hiLike [hi q] model.pPart.data [model.addressLike [address] model.dateLike [date] model.measureLike [measure num] model.nameLike [model.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs] ] model.pPart.editorial [abbr am choice ex expan] model.pPart.msdesc [locus material] model.phrase.xml model.ptrLike [ptr ref]

model.linePart

model.linePart groups transcriptional elements which appear within lines or zones of a source-oriented transcription within a <sourceDoc> element.
Moduletei
Used by
Membersmodel.hiLike [hi q] add c choice del handShift mod pc redo restore retrace seg unclear undo w zone

model.listLike

model.listLike groups list-like elements. [3.8. Lists]
Moduletei
Used by
MemberslistApp

model.lLike

model.lLike groups elements representing metrical components such as verse lines.
Moduletei
Used by
Membersl

model.lPart

model.lPart groups phrase-level elements which may appear within verse only. [6.2. Components of the Verse Line]
Moduletei
Used by
Membersrhyme

model.measureLike

model.measureLike groups elements which denote a number, a quantity, a measurement, or similar piece of text that conveys some numerical meaning. [3.6.3. Numbers and Measures]
Moduletei
Used by
Membersmeasure num

model.milestoneLike

model.milestoneLike groups milestone-style elements used to represent reference systems. [1.3. The TEI Class System 3.11.3. Milestone Elements]
Moduletei
Used by
Membersanchor fw gb lb milestone pb

model.nameLike

model.nameLike groups elements which name or refer to a person, place, or organization.
Moduletei
Used by
Membersmodel.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs
Note

A superset of the naming elements that may appear in datelines, addresses, statements of responsibility, etc.

model.nameLike.agent

model.nameLike.agent groups elements which contain names of individuals or corporate bodies. [3.6. Names, Numbers, Dates, Abbreviations, and Addresses]
Moduletei
Used by
MemberspersName
Note

This class is used in the content model of elements which reference names of people or organizations.

model.noteLike

model.noteLike groups globally-available note-like elements. [3.9. Notes, Annotation, and Indexing]
Moduletei
Used by
Membersnote

model.paraPart

model.paraPart groups elements that may appear in paragraphs and similar elements [3.1. Paragraphs]
Moduletei
Used by
Membersmodel.gLike model.global [model.global.edit [addSpan app delSpan gap space witDetail] model.global.meta [index listTranspose span] model.milestoneLike [anchor fw gb lb milestone pb] model.noteLike [note] metamark] model.inter [model.attributable [model.quoteLike [cit quote] said] model.biblLike [bibl biblStruct msDesc] model.egLike model.labelLike [desc] model.listLike [listApp] model.oddDecl model.stageLike] model.lLike [l] model.phrase [model.graphicLike [graphic] model.highlighted [model.emphLike [emph foreign soCalled term title] model.hiLike [hi q] ] model.lPart [rhyme] model.pPart.data [model.addressLike [address] model.dateLike [date] model.measureLike [measure num] model.nameLike [model.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs] ] model.pPart.edit [model.pPart.editorial [abbr am choice ex expan] model.pPart.transcriptional [add corr del handShift mod orig redo reg restore retrace secl sic supplied unclear undo] ] model.pPart.msdesc [locus material] model.phrase.xml model.ptrLike [ptr ref] model.ptrLike.form model.segLike [c pc s seg w] model.specDescLike] lg

model.persNamePart

model.persNamePart groups elements which form part of a personal name. [13.2.1. Personal Names]
Modulenamesdates
Used by
MembersaddName forename nameLink surname

model.phrase

model.phrase groups elements which can occur at the level of individual words or phrases. [1.3. The TEI Class System]
Moduletei
Used by
Membersmodel.graphicLike [graphic] model.highlighted [model.emphLike [emph foreign soCalled term title] model.hiLike [hi q] ] model.lPart [rhyme] model.pPart.data [model.addressLike [address] model.dateLike [date] model.measureLike [measure num] model.nameLike [model.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs] ] model.pPart.edit [model.pPart.editorial [abbr am choice ex expan] model.pPart.transcriptional [add corr del handShift mod orig redo reg restore retrace secl sic supplied unclear undo] ] model.pPart.msdesc [locus material] model.phrase.xml model.ptrLike [ptr ref] model.ptrLike.form model.segLike [c pc s seg w] model.specDescLike
Note

This class of elements can occur within paragraphs, list items, lines of verse, etc.

model.physDescPart

model.physDescPart groups specialized elements forming part of the physical description of a manuscript or similar written source.
Modulemsdescription
Used by
MembersaccMat bindingDesc handDesc objectDesc

model.placeNamePart

model.placeNamePart groups elements which form part of a place name. [13.2.3. Place Names]
Moduletei
Used by
MembersplaceName settlement

model.placeStateLike

model.placeStateLike groups elements which describe changing states of a place.
Moduletei
Used by
Membersmodel.placeNamePart [placeName settlement]

model.pLike

model.pLike groups paragraph-like elements.
Moduletei
Used by
Membersab p

model.pLike.front

model.pLike.front groups paragraph-like elements which can occur as direct constituents of front matter. [4.6. Title Pages]
Moduletei
Used by
Membersbyline dateline docAuthor docImprint docTitle epigraph head titlePart

model.pPart.data

model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data. [3.6. Names, Numbers, Dates, Abbreviations, and Addresses]
Moduletei
Used by
Membersmodel.addressLike [address] model.dateLike [date] model.measureLike [measure num] model.nameLike [model.nameLike.agent [persName] model.offsetLike model.persNamePart [addName forename nameLink surname] model.placeStateLike [model.placeNamePart [placeName settlement] ] idno rs]

model.pPart.edit

model.pPart.edit groups phrase-level elements for simple editorial correction and transcription. [3.5. Simple Editorial Changes]
Moduletei
Used by
Membersmodel.pPart.editorial [abbr am choice ex expan] model.pPart.transcriptional [add corr del handShift mod orig redo reg restore retrace secl sic supplied unclear undo]

model.pPart.editorial

model.pPart.editorial groups phrase-level elements for simple editorial interventions that may be useful both in transcribing and in authoring. [3.5. Simple Editorial Changes]
Moduletei
Used by
Membersabbr am choice ex expan

model.pPart.msdesc

model.pPart.msdesc groups phrase-level elements used in manuscript description. [10. Manuscript Description]
Moduletei
Used by
Memberslocus material

model.pPart.transcriptional

model.pPart.transcriptional groups phrase-level elements used for editorial transcription of pre-existing source materials. [3.5. Simple Editorial Changes]
Moduletei
Used by
Membersadd corr del handShift mod orig redo reg restore retrace secl sic supplied unclear undo

model.profileDescPart

model.profileDescPart groups elements which may be used inside profileDesc and appear multiple times.
Moduletei
Used by
MemberscorrespDesc creation handNotes listTranspose textClass

model.ptrLike

model.ptrLike groups elements used for purposes of location and reference. [3.7. Simple Links and Cross-References]
Moduletei
Used by
Membersptr ref

model.publicationStmtPart.agency

model.publicationStmtPart.agency groups the child elements of a publicationStmt element of the TEI header that indicate an authorising agent. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduletei
Used by
Membersauthority distributor publisher
Note

The ‘agency’ child elements, while not required, are required if one of the ‘detail’ child elements is to be used. It is not valid to have a ‘detail’ child element without a preceding ‘agency’ child element.

See also model.publicationStmtPart.detail.

model.publicationStmtPart.detail

model.publicationStmtPart.detail groups the agency-specific child elements of the publicationStmt element of the TEI header. [2.2.4. Publication, Distribution, Licensing, etc.]
Moduletei
Used by
Membersmodel.ptrLike [ptr ref] address availability date idno pubPlace
Note

A ‘detail’ child element may not occur unless an ‘agency’ child element precedes it.

See also model.publicationStmtPart.agency.

model.quoteLike

model.quoteLike groups elements used to directly contain quotations.
Moduletei
Used by
Memberscit quote

model.rdgLike

model.rdgLike groups elements which contain a single reading, other than the lemma, within a textual variation. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Used by
Membersrdg
Note

This class allows for variants of the rdg element to be easily created via TEI customizations.

model.rdgPart

model.rdgPart groups elements which mark the beginning or ending of a fragmentary manuscript or other witness. [12.1.5. Fragmentary Witnesses]
Moduletextcrit
Used by
Memberswit
Note

These elements may appear anywhere within the elements <lem> and rdg, and also within any of their constituent elements.

model.resource

model.resource groups separate elements which constitute the content of a digital resource, as opposed to its metadata. [1.3. The TEI Class System]
Moduletei
Used by
Membersfacsimile standOff text

model.respLike

model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element.
Moduletei
Used by
Membersauthor editor

model.segLike

model.segLike groups elements used for arbitrary segmentation. [16.3. Blocks, Segments, and Anchors 17.1. Linguistic Segment Categories]
Moduletei
Used by
Membersc pc s seg w
Note

The principles on which segmentation is carried out, and any special codes or attribute values used, should be defined explicitly in the <segmentation> element of the encodingDesc within the associated TEI header.

model.standOffPart

model.standOffPart groups elements which may be used as children of standOff.
Moduletei
Used by
Membersmodel.annotationLike [note] model.biblLike [bibl biblStruct msDesc] model.entryLike model.global.meta [index listTranspose span] model.listLike [listApp] listChange seg zone

model.teiHeaderPart

model.teiHeaderPart groups high level elements which may appear more than once in a TEI header.
Moduletei
Used by
MembersencodingDesc profileDesc

model.titlepagePart

model.titlepagePart groups elements which can occur as direct constituents of a title page, such as docTitle, docAuthor, docImprint, or epigraph. [4.6. Title Pages]
Moduletei
Used by
Membersbyline docAuthor docImprint docTitle epigraph graphic titlePart

Schema tei_lite: Attribute classes

att.anchoring

att.anchoring (anchoring) provides attributes for use on annotations, e.g. notes and groups of notes describing the existence and position of an anchor for annotations.
Moduletei
Membersnote
Attributes
anchored(anchored) indicates whether the copy text shows the exact place of reference for the note.
Status Optional
Datatype teidata.truthValue
Default true
Note

In modern texts, notes are usually anchored by means of explicit footnote or endnote symbols. An explicit indication of the phrase or line annotated may however be used instead (e.g. ‘page 218, lines 3–4’). The anchored attribute indicates whether any explicit location is given, whether by symbol or by prose cross-reference. The value true indicates that such an explicit location is indicated in the copy text; the value false indicates that the copy text does not indicate a specific place of attachment for the note. If the specific symbols used in the copy text at the location the note is anchored are to be recorded, use the n attribute.

targetEnd(target end) points to the end of the span to which the note is attached, if the note is not embedded in the text at that point.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

This attribute is retained for backwards compatibility; it may be removed at a subsequent release of the Guidelines. The recommended way of pointing to a span of elements is by means of the range function of XPointer, as further described in 16.2.4.6. range().

Example
<p>(...) tamen reuerendos dominos archiepiscopum et canonicos Leopolienses
necnon episcopum in duplicibus Quatuortemporibus<anchor xml:id="A55234"/> totaliter expediui...</p>
<!-- elsewhere in the document -->
<noteGrp targetEnd="#A55234">
 <note xml:lang="en"> Quatuor Tempora, so called dry fast days.
 </note>
 <note xml:lang="pl"> Quatuor Tempora, tzw. Suche dni postne.
 </note>
</noteGrp>

att.ascribed

att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual. [3.3.3. Quotation 8.3. Elements Unique to Spoken Texts]
Moduletei
Membersatt.ascribed.directed [q said] change
Attributes
whoindicates the person, or group of people, to whom the element content is ascribed.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace

In the following example from Hamlet, speeches (<sp>) in the body of the play are linked to <role> elements in the <castList> using the who attribute.

<castItem type="role">
 <role xml:id="Barnardo">Bernardo</role>
</castItem>
<castItem type="role">
 <role xml:id="Francisco">Francisco</role>
 <roleDesc>a soldier</roleDesc>
</castItem>
<!-- ... -->
<sp who="#Barnardo">
 <speaker>Bernardo</speaker>
 <l n="1">Who's there?</l>
</sp>
<sp who="#Francisco">
 <speaker>Francisco</speaker>
 <l n="2">Nay, answer me: stand, and unfold yourself.</l>
</sp>
Note

For transcribed speech, this will typically identify a participant or participant group; in other contexts, it will point to any identified <person> element.

att.ascribed.directed

att.ascribed.directed provides attributes for elements representing speech or action that can be directed at a group or individual. [3.3.3. Quotation 8.3. Elements Unique to Spoken Texts]
Moduletei
Membersq said
Attributes
toWhomindicates the person, or group of people, to whom a speech act or action is directed.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace

In the following example from Mary Pix's The False Friend, speeches (<sp>) in the body of the play are linked to <castItem> elements in the <castList> using the toWhom attribute, which is used to specify who the speech is directed to. Additionally, the <stage> includes toWhom to indicate the directionality of the action.

<castItem type="role">
 <role xml:id="emil">Emilius.</role>
</castItem>
<castItem type="role">
 <role xml:id="lov">Lovisa</role>
</castItem>
<castItem type="role">
 <role xml:id="serv">A servant</role>
</castItem>
<!-- ... -->
<sp toWhom="#lovwho="#emil">
 <speaker>Emil.</speaker>
 <l n="1">My love!</l>
</sp>
<sp toWhom="#emilwho="#lov">
 <speaker>Lov.</speaker>
 <l n="2">I have no Witness of my Noble Birth</l>
 <stage toWhom="#servwho="emil">Pointing to her Woman.</stage>
 <l>But that poor helpless wretch——</l>
</sp>
Note

To indicate the recipient of written correspondence, use the elements used in section 2.4.6. Correspondence Description, rather than a toWhom attribute.

att.breaking

att.breaking provides attributes to indicate whether or not the element concerned is considered to mark the end of an orthographic token in the same way as whitespace. [3.11.3. Milestone Elements]
Moduletei
Membersgb lb milestone pb
Attributes
breakindicates whether or not the element bearing this attribute should be considered to mark the end of an orthographic token in the same way as whitespace.
Status Recommended
Datatype teidata.enumerated
Sample values include
yes
the element bearing this attribute is considered to mark the end of any adjacent orthographic token irrespective of the presence of any adjacent whitespace
no
the element bearing this attribute is considered not to mark the end of any adjacent orthographic token irrespective of the presence of any adjacent whitespace
maybe
the encoding does not take any position on this issue.

In the following lines from the Dream of the Rood, linebreaks occur in the middle of the words lāðost and reord-berendum.

<ab> ...eƿesa tome iu icƿæs ȝeƿorden ƿita heardoſt .
leodum la<lb break="no"/> ðost ærþan ichim lifes
ƿeȝ rihtne ȝerymde reord be<lb break="no"/>
rendum hƿæt me þaȝeƿeorðode ƿuldres ealdor ofer...
</ab>

att.canonical

att.canonical provides attributes that can be used to associate a representation such as a name or title with canonical information about the object being named or referenced. [13.1.1. Linking Names and Their Referents]
Moduletei
Membersatt.naming [att.personal [addName forename persName placeName surname] author editor institution pubPlace repository rs settlement] authority correspDesc date distributor docAuthor docTitle material publisher term title
Attributes
keyprovides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind.
Status Optional
Datatype teidata.text
<author>
 <name key="Hugo, Victor (1802-1885)"
  ref="http://www.idref.fr/026927608">
Victor Hugo</name>
</author>
Note

The value may be a unique identifier from a database, or any other externally-defined string identifying the referent. No particular syntax is proposed for the values of the key attribute, since its form will depend entirely on practice within a given project.

ref(reference) provides an explicit means of locating a full definition or identity for the entity being named by means of one or more URIs.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
<name ref="http://viaf.org/viaf/109557338"
 type="person">
Seamus Heaney</name>
Note

The value must point directly to one or more XML elements or other resources by means of one or more URIs, separated by whitespace. If more than one is supplied the implication is that the name identifies several distinct entities.

Example

In this contrived example, a canonical reference to the same organisation is provided in four different ways.

<author n="1">
 <name ref="http://nzetc.victoria.ac.nz/tm/scholarly/name-427308.html"
  type="organisation">
New Zealand Parliament, Legislative Council</name>
</author>
 
<author n="2">
 <name ref="nzvn:427308"
  type="organisation">
New Zealand Parliament, Legislative Council</name>
</author>
 
<author n="3">
 <name ref="./named_entities.xml#o427308"
  type="organisation">
New Zealand Parliament, Legislative Council</name>
</author>
 

<author n="4">
 <name key="name-427308"
  type="organisation">
New Zealand Parliament, Legislative Council</name>
</author>

The first presumes the availability of an internet connection and a processor that can resolve a URI (most can). The second requires, in addition, a <prefixDef> that declares how the nzvm prefix should be interpreted. The third does not require an internet connection, but does require that a file named named_entities.xml be in the same directory as the TEI document. The fourth requires that an entire external system for key resolution be available.

Note

The key attribute is more flexible and general-purpose, but its use in interchange requires that documentation about how the key is to be resolved be sent to the recipient of the TEI document. In contrast values of the ref attribute are resolved using the widely accepted protocols for a URI, and thus less documentation, if any, is likely required by the recipient in data interchange.

These guidelines provide no semantic basis or suggested precedence when both key and ref are provided. For this reason simultaneous use of both is not recommended unless documentation explaining the use is provided, probably in an ODD customizaiton, for interchange.

att.citing

att.citing provides attributes for specifying the specific part of a bibliographic item being cited. [1.3.1. Attribute Classes]
Moduletei
MembersbiblScope
Attributes
unitidentifies the unit of information conveyed by the element, e.g. columns, pages, volume, entry.
Status Optional
Datatype teidata.enumerated
Suggested values include:
volume
(volume) the element contains a volume number.
issue
the element contains an issue number, or volume and issue numbers.
page
(page) the element contains a page number or page range.
line
the element contains a line number or line range.
chapter
(chapter) the element contains a chapter indication (number and/or title)
part
the element identifies a part of a book or collection.
column
the element identifies a column.
entry
the element identifies an entry number or label in a list of entries.
fromspecifies the starting point of the range of units indicated by the unit attribute.
Status Optional
Datatype teidata.word
tospecifies the end-point of the range of units indicated by the unit attribute.
Status Optional
Datatype teidata.word

att.coordinated

att.coordinated provides attributes that can be used to position their parent element within a two dimensional coordinate system.
Moduletranscr
Memberssurface zone
Attributes
startindicates the element within a transcription of the text containing at least the start of the writing represented by this zone or surface.
Status Optional
Datatype teidata.pointer
ulxgives the x coordinate value for the upper left corner of a rectangular space.
Status Optional
Datatype teidata.numeric
ulygives the y coordinate value for the upper left corner of a rectangular space.
Status Optional
Datatype teidata.numeric
lrxgives the x coordinate value for the lower right corner of a rectangular space.
Status Optional
Datatype teidata.numeric
lrygives the y coordinate value for the lower right corner of a rectangular space.
Status Optional
Datatype teidata.numeric
pointsidentifies a two dimensional area by means of a series of pairs of numbers, each of which gives the x,y coordinates of a point on a line enclosing the area.
Status Optional
Datatype 3–∞ occurrences of teidata.point separated by whitespace

att.cReferencing

att.cReferencing provides attributes that may be used to supply a canonical reference as a means of identifying the target of a pointer.
Moduletei
Membersptr ref term
Attributes
cRef(canonical reference) specifies the destination of the pointer by supplying a canonical reference expressed using the scheme defined in a <refsDecl> element in the TEI header
Status Optional
Datatype teidata.text
Note

The value of cRef should be constructed so that when the algorithm for the resolution of canonical references (described in section 16.2.5. Canonical References) is applied to it the result is a valid URI reference to the intended target.

The <refsDecl> to use may be indicated with the decls attribute.

Currently these Guidelines only provide for a single canonical reference to be encoded on any given ptr element.

att.datable

att.datable provides attributes for normalization of elements that contain dates, times, or datable events. [3.6.4. Dates and Times 13.4. Dates]
Moduletei
Membersacquisition author binding change creation date editor idno licence persName placeName provenance settlement title
Attributes
periodsupplies pointers to one or more definitions of named periods of time (typically <category>s, dates or <event>s) within which the datable item is understood to have occurred.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

This ‘superclass’ provides attributes that can be used to provide normalized values of temporal information. By default, the attributes from the att.datable.w3c class are provided. If the module for names & dates is loaded, this class also provides attributes from the att.datable.iso and att.datable.custom classes. In general, the possible values of attributes restricted to the W3C datatypes form a subset of those values available via the ISO 8601 standard. However, the greater expressiveness of the ISO datatypes may not be needed, and there exists much greater software support for the W3C datatypes.

att.datable.custom

att.datable.custom provides attributes for normalization of elements that contain datable events to a custom dating system (i.e. other than the Gregorian used by W3 and ISO). [13.4. Dates]
Modulenamesdates
Membersatt.datable [acquisition author binding change creation date editor idno licence persName placeName provenance settlement title]
Attributes
when-customsupplies the value of a date or time in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace

The following are examples of custom date or time formats that are not valid ISO or W3C format normalizations, normalized to a different dating system

<p>Alhazen died in Cairo on the
<date when="1040-03-06"
  when-custom="431-06-12">
12th day of Jumada t-Tania, 430 AH
 </date>.</p>
<p>The current world will end at the
<date when="2012-12-21"
  when-custom="13.0.0.0.0">
end of B'ak'tun 13</date>.</p>
<p>The Battle of Meggidu
(<date when-custom="Thutmose_III:23">23rd year of reign of Thutmose III</date>).</p>
<p>Esidorus bixit in pace annos LXX plus minus sub
<date when-custom="Ind:4-10-11">die XI mensis Octobris indictione IIII</date>
</p>

Not all custom date formulations will have Gregorian equivalents.

The when-custom attribute and other custom dating are not constrained to a datatype by the TEI, but individual projects are recommended to regularize and document their dating formats.

notBefore-customspecifies the earliest possible date for the event in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
notAfter-customspecifies the latest possible date for the event in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
from-customindicates the starting point of the period in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
<event datingMethod="#julian"
 from-custom="1666-09-02to-custom="1666-09-05xml:id="FIRE1">

 <head>The Great Fire of London</head>
 <p>The Great Fire of London burned through a large part
   of the city of London.</p>
</event>
to-customindicates the ending point of the period in some custom standard form.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
datingPointsupplies a pointer to some location defining a named point in time with reference to which the datable item is understood to have occurred
Status Optional
Datatype teidata.pointer
datingMethodsupplies a pointer to a <calendar> element or other means of interpreting the values of the custom dating attributes.
Status Optional
Datatype teidata.pointer
Contayning the Originall, Antiquity, Increaſe, Moderne
eſtate, and deſcription of that Citie, written in the yeare
<date calendar="#julian"
 datingMethod="#julianwhen-custom="1598">
1598</date>. by Iohn Stow
Citizen of London.

In this example, the calendar attribute points to a <calendar> element for the Julian calendar, specifying that the text content of the date element is a Julian date, and the datingMethod attribute also points to the Julian calendar to indicate that the content of the when-custom attribute value is Julian too.

<date datingMethod="#creationOfWorld"
 when="1382-06-28when-custom="6890-06-20">
μηνὶ Ἰουνίου εἰς <num>κ</num> ἔτους <num>ςωϞ</num>
</date>

In this example, a date is given in a Mediaeval text measured ‘from the creation of the world’, which is normalized (in when) to the Gregorian date, but is also normalized (in when-custom) to a machine-actionable, numeric version of the date from the Creation.

Note

Note that the datingMethod attribute (unlike calendar defined in att.datable) defines the calendar or dating system to which the date described by the parent element is normalized (i.e. in the when-custom or other X-custom attributes), not the calendar of the original date in the element.

att.datable.iso

att.datable.iso provides attributes for normalization of elements that contain datable events using the ISO 8601:2004 standard. [3.6.4. Dates and Times 13.4. Dates]
Modulenamesdates
Membersatt.datable [acquisition author binding change creation date editor idno licence persName placeName provenance settlement title]
Attributes
when-isosupplies the value of a date or time in a standard form.
Status Optional
Datatype teidata.temporal.iso

The following are examples of ISO date, time, and date & time formats that are not valid W3C format normalizations.

<date when-iso="1996-09-24T07:25+00">Sept. 24th, 1996 at 3:25 in the morning</date>
<date when-iso="1996-09-24T03:25-04">Sept. 24th, 1996 at 3:25 in the morning</date>
<time when-iso="1999-01-04T20:42-05">4 Jan 1999 at 8:42 pm</time>
<time when-iso="1999-W01-1T20,70-05">4 Jan 1999 at 8:42 pm</time>
<date when-iso="2006-05-18T10:03">a few minutes after ten in the morning on Thu 18 May</date>
<time when-iso="03:00">3 A.M.</time>
<time when-iso="14">around two</time>
<time when-iso="15,5">half past three</time>

All of the examples of the when attribute in the att.datable.w3c class are also valid with respect to this attribute.

He likes to be punctual. I said <q>
 <time when-iso="12">around noon</time>
</q>, and he showed up at <time when-iso="12:00:00">12 O'clock</time> on the dot.

The second occurence of <time> could have been encoded with the when attribute, as 12:00:00 is a valid time with respect to the W3C XML Schema Part 2: Datatypes Second Edition specification. The first occurence could not.

notBefore-isospecifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype teidata.temporal.iso
notAfter-isospecifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype teidata.temporal.iso
from-isoindicates the starting point of the period in standard form.
Status Optional
Datatype teidata.temporal.iso
to-isoindicates the ending point of the period in standard form.
Status Optional
Datatype teidata.temporal.iso
Note

The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by ISO 8601:2004, using the Gregorian calendar.

If both when-iso and dur-iso are specified, the values should be interpreted as indicating a span of time by its starting time (or date) and duration. That is,
<date dur-iso="P8Dwhen-iso="2007-06-01"/>
indicates the same time period as
<date when-iso="2007-06-01/P8D"/>

In providing a ‘regularized’ form, no claim is made that the form in the source text is incorrect; the regularized form is simply that chosen as the main form for purposes of unifying variant forms under a single heading.

att.datable.w3c

att.datable.w3c provides attributes for normalization of elements that contain datable events conforming to the W3C XML Schema Part 2: Datatypes Second Edition. [3.6.4. Dates and Times 13.4. Dates]
Moduletei
Membersatt.datable [acquisition author binding change creation date editor idno licence persName placeName provenance settlement title]
Attributes
whensupplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.
Status Optional
Datatype teidata.temporal.w3c

Examples of W3C date, time, and date & time formats.

<p>
 <date when="1945-10-24">24 Oct 45</date>
 <date when="1996-09-24T07:25:00Z">September 24th, 1996 at 3:25 in the morning</date>
 <time when="1999-01-04T20:42:00-05:00">Jan 4 1999 at 8 pm</time>
 <time when="14:12:38">fourteen twelve and 38 seconds</time>
 <date when="1962-10">October of 1962</date>
 <date when="--06-12">June 12th</date>
 <date when="---01">the first of the month</date>
 <date when="--08">August</date>
 <date when="2006">MMVI</date>
 <date when="0056">AD 56</date>
 <date when="-0056">56 BC</date>
</p>
This list begins in
the year 1632, more precisely on Trinity Sunday, i.e. the Sunday after
Pentecost, in that year the
<date calendar="#julianwhen="1632-06-06">27th of May (old style)</date>.
<opener>
 <dateline>
  <placeName>Dorchester, Village,</placeName>
  <date when="1828-03-02">March 2d. 1828.</date>
 </dateline>
 <salute>To
   Mrs. Cornell,</salute> Sunday <time when="12:00:00">noon.</time>
</opener>
Schematron

<sch:rule context="tei:*[@when]">
<sch:report role="nonfatal"
 test="@notBefore|@notAfter|@from|@to">
The @when attribute cannot be used with any other att.datable.w3c attributes.</sch:report>
</sch:rule>
Schematron

<sch:rule context="tei:*[@from]">
<sch:report role="nonfatal"
 test="@notBefore">
The @from and @notBefore attributes cannot be used together.</sch:report>
</sch:rule>
Schematron

<sch:rule context="tei:*[@to]">
<sch:report role="nonfatal"
 test="@notAfter">
The @to and @notAfter attributes cannot be used together.</sch:report>
</sch:rule>
Example
<date from="1863-05-28to="1863-06-01">28 May through 1 June 1863</date>
Note

The value of these attributes should be a normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by XML Schema Part 2: Datatypes Second Edition, using the Gregorian calendar.

The most commonly-encountered format for the date portion of a temporal attribute is yyyy-mm-dd, but yyyy, --mm, ---dd, yyyy-mm, or --mm-dd may also be used. For the time part, the form hh:mm:ss is used.

Note that this format does not currently permit use of the value 0000 to represent the year 1 BCE; instead the value -0001 should be used.

att.datcat

att.datcat provides attributes that are used to align XML elements or attributes with the appropriate Data Categories (DCs) defined by an external taxonomy, in this way establishing the identity of information containers and values, and providing means of interpreting them. [9.5.2. Lexical View 18.3. Other Atomic Feature Values]
Moduletei
Membersatt.lexicographic [xr] att.segLike [c pc s seg w]
Attributes
datcatprovides a pointer to a definition of, and/or general information about, (a) an information container (element or attribute) or (b) a value of an information container (element content or attribute value), by referencing an external taxonomy or ontology. If valueDatcat is present in the immediate context, this attribute takes on role (a), while valueDatcat performs role (b).
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
valueDatcatprovides a definition of, and/or general information about a value of an information container (element content or attribute value), by reference to an external taxonomy or ontology. Used especially where a contrast with datcat is needed.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
targetDatcatprovides a definition of, and/or general information about, information structure of an object referenced or modeled by the containing element, by reference to an external taxonomy or ontology. This attribute has the characteristics of the datcat attribute, except that it addresses not its containing element, but an object that is being referenced or modeled by its containing element.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Example

The example below presents the TEI encoding of the name-value pair <part of speech, common noun>, where the name (key) ‘part of speech’ is abbreviated as ‘POS’, and the value, ‘common noun’ is symbolized by ‘NN’. The entire name-value pair is encoded by means of the element <f>. In TEI XML, that element acts as the container, labeled with the name attribute. Its contents may be complex or simple. In the case at hand, the content is the symbol ‘NN’.

The datcat attribute relates the feature name (i.e., the key) to the data category ‘part of speech’, while the attribute valueDatcat relates the feature value to the data category common noun. Both these data categories should be defined in an external and preferably open reference taxonomy or ontology.

<fs>
 <f datcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"
  name="POS">

  <symbol value="NN"
   valueDatcat="http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545"/>

 </f>
<!-- ... -->
</fs>

‘NN’ is the symbol for common noun used e.g. in the CLAWS-7 tagset defined by the University Centre for Computer Corpus Research on Language at the University of Lancaster. The very same data category used for tagging an early version of the British National Corpus, and coming from the BNC Basic (C5) tagset, uses the symbol ‘NN0’ (rather than ‘NN’). Making these values semantically interoperable would be extremely difficult without a human expert if they were not anchored in a single point of an established reference taxonomy of morphosyntactic data categories. In the case at hand, the string http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545 is both a persistent identifier of the data category in question, as well as a pointer to a shared definition of common noun.

While the symbols ‘NN’, ‘NN0’, and many others (often coming from languages other than English) are implicitly members of the container category ‘part of speech’, it is sometimes useful not to rely on such an implicit relationship but rather use an explicit identifier for that data category, to distinguish it from other morphosyntactic data categories, such as gender, tense, etc. For that purpose, the above example uses the datcat attribute to reference a definition of part of speech. The reference taxonomy in this example is the CLARIN Concept Registry.

If the feature structure markup exemplified above is to be repeated many times in a single document, it is much more efficient to gather the persistent identifiers in a single place and to only reference them, implicitly or directly, from feature structure markup. The following example is much more concise than the one above and relies on the concepts of feature structure declaration and feature value library, discussed in chapter [ID FS in TEI Guidelines].
<fs>
 <f fVal="#commonNounname="POS"/>
<!-- ... -->
</fs>

The assumption here is that the relevant feature values are collected in a place that the annotation document in question has access to — preferably, a single document per linguistic resource, for example an <fsdDecl> that is XIncluded as a sibling of text or a child of encodingDesc; a <taxonomy> available resource-wide (e.g., in a shared header) is also an option.

The example below presents an <fvLib> element that collects the relevant feature values (most of them omitted). At the same time, this example shows one way of encoding a tagset, i.e., an established inventory of values of (in the case at hand) morphosyntactic categories.
<fvLib n="POS values">
 <symbol datcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"
  value="NNxml:id="commonNoun"/>

 <symbol datcat="http://hdl.handle.net/11459/CCR_C-1371_fbebd9ec-a7f4-9a36-d6e9-88ee16b944ae"
  value="NPxml:id="properNoun"/>

<!-- ... -->
</fvLib>

Note that these Guidelines do not prescribe a specific choice between datcat and valueDatcat in such cases. The former is the generic way of referencing a data category, whereas the latter is more specific, in that it references a data category that represents a value. The choice between them comes into play where a single element — or a tight element complex, such as the <f>/<symbol> complex illustrated above — make it necessary or useful to distinguish between the container data category and its value.

Example

In the context of dictionaries designed with semantic interoperability in mind, the following example ensures that the <pos> element is interpreted as the same information container as in the case of the example of <f name="POS"> above.

<gramGrp>
 <pos datcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"
  valueDatcat="http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545">
NN</pos>
</gramGrp>
Efficiency of this type of interoperable markup demands that the references to the particular data categories should best be provided in a single place within the dictionary (or a single place within the project), rather than being repeated inside every entry. For the container elements, this can be achieved at the level of <tagUsage>, although here, the valueDatcat attribute should be used, because it is not the <tagUsage> element that is associated with the relevant data category, but rather the element <pos> (or <case>, etc.) that is described by <tagUsage>:
<tagsDecl partial="true">
<!-- ... -->
 <namespace name="http://www.tei-c.org/ns/1.0">
  <tagUsage gi="pos"
   targetDatcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3">
Contains the part of speech.</tagUsage>
  <tagUsage gi="case"
   targetDatcat="http://hdl.handle.net/11459/CCR_C-1840_9f4e319c-f233-6c90-9117-7270e215f039">
Contains information about the grammatical case that the described form is inflected for.</tagUsage>
<!-- ... -->
 </namespace>
</tagsDecl>
Another possibility is to shorten the URIs by means of the <prefixDef> mechanism, as illustrated below:
<listPrefixDef>
 <prefixDef ident="ccrmatchPattern="pos"
  replacementPattern="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3"/>

 <prefixDef ident="ccrmatchPattern="adj"
  replacementPattern="http://hdl.handle.net/11459/CCR_C-1230_23653c21-fca1-edf8-fd7c-3df2d6499157"/>

</listPrefixDef>
<!-- ... -->
<entry>
<!--...-->
 <form>
  <orth>isotope</orth>
 </form>
 <gramGrp>
  <pos datcat="ccr:pos"
   valueDatcat="ccr:adj">
adj</pos>
 </gramGrp>
<!--...-->
</entry>

This mechanism creates implications that are not always wanted, among others, in the case at hand, suggesting that the identifiers ‘pos’ and ‘adj’ belong to a namespace associated with the CLARIN Concept Repository (CCR), whereas that is solely a shorthand mechanism whose scope is the current resource. Documenting this clearly in the header of the dictionary is therefore advised.

Yet another possibility is to associate the information about the relationship between a TEI markup element and the data category that it is intended to model already at the level of modeling the dictionary resource, that is, at the level of the ODD, in the <equiv> element that is a child of <elementSpec> or <attDef>.

Example

The <taxonomy> element is a handy tool for encoding taxonomies that are later referenced by att.datcat attributes, but it can also act as an intermediary device, for example holding a fragment of an external taxonomy (or ‘flattening’ an external ontology) that is relevant to the project or document at hand. (It is also imaginable that, for the purpose of the project at hand, the local <taxonomy> element combines vocabularies that originate from more than one external taxonomy or ontology.) In such cases, the <taxonomy> creates a local layer of indirection: the att.datcat attributes internal to the resource may reference the <category> elements stored in the header (as well as the <taxonomy> element itself), whereas these same <category> and <taxonomy> elements use att.datcat attributes to reference the original taxonomy or ontology.

<encodingDesc>
<!-- ... -->
 <classDecl>
<!-- ... -->
  <taxonomy datcat="https://universaldependencies.org/u/dep/index.html"
   xml:id="UD-SYN">

   <desc>
    <term>UD syntactic relations</term>
   </desc>
   <category valueDatcat="https://universaldependencies.org/u/dep/acl.html"
    xml:id="acl">

    <catDesc>
     <term>acl</term>: Clausal modifier of noun (adjectival clause)</catDesc>
   </category>
   <category valueDatcat="https://universaldependencies.org/u/dep/acl-relcl.html"
    xml:id="acl_relcl">

    <catDesc>
     <term>acl:relcl</term>: relative clause modifier</catDesc>
   </category>
   <category valueDatcat="https://universaldependencies.org/u/dep/advcl.html"
    xml:id="advcl">

    <catDesc>
     <term>advcl</term>: Adverbial clause modifier</catDesc>
   </category>
<!-- ... -->
  </taxonomy>
 </classDecl>
</encodingDesc>

The above fragment was excerpted from the GB subset of the ParlaMint project in April 2023, and enriched with att.datcat attributes for the purpose of illustrating the mechanism described here.

Note that, in the ideal case, the values of att.datcat attributes should be persistent identifiers, and that the addressing scheme of Universal Dependencies is treated here as persistent for the sake of illustration. Note also that the contrast between datcat used on <taxonomy> on the one hand, and the valueDatcat used on <category> on the other, is not mandatory: both kinds of relations could be encoded by means of the generic datcat attribute, but using the former for the container and the latter for the content is more user-friendly.

Example

The targetDatcat attribute is designed to be used in, e.g., feature structure declarations, and is analogous to the targetLang attribute of the att.pointing class, in that it describes the object that is being referenced, rather than the referencing object.

<fDecl name="POS"
 targetDatcat="http://hdl.handle.net/11459/CCR_C-396_5a972b93-2294-ab5c-a541-7c344c5f26c3">

 <fDescr>part of speech (morphosyntactic category)</fDescr>
 <vRange>
  <vAlt>
   <symbol datcat="http://hdl.handle.net/11459/CCR_C-1256_7ec6083c-23d4-224d-6f94-eecbe6861545"
    value="NN"/>

   <symbol datcat="http://hdl.handle.net/11459/CCR_C-1371_fbebd9ec-a7f4-9a36-d6e9-88ee16b944ae"
    value="NP"/>

<!-- ... -->
  </vAlt>
 </vRange>
</fDecl>

Above, the <fDecl> uses targetDatcat, because if it were to use datcat, it would be asserting that it is an instance of the container data category part of speech, whereas it is not — it models a container (<f>) that encodes a part of speech. Note also that it is the <f> that is modeled above, not its values, which are used as direct references to data categories; hence the use of datcat in the <symbol> element.

Example

The att.datcat attributes can be used for any sort of taxonomies. The example below illustrates their usefulness for describing usage domain labels in dictionaries on the example of the Diccionario da Lingua Portugueza by António de Morais Silva, retro-digitised in the MORDigital project.


<!-- in the dictionary header --><encodingDesc>
 <classDecl>
  <taxonomy xml:id="domains">
<!--...-->
   <category xml:id="domain.medical_and_health_sciences">
    <catDesc xml:lang="en">Medical and Health Sciences</catDesc>
    <catDesc xml:lang="pt">Ciências Médicas e da Saúde</catDesc>
    <category valueDatcat="https://vocabs.rossio.fcsh.unl.pt/pub/morais_domains/pt/page/0025"
     xml:id="domain.medical_and_health_sciences.medicine">

     <catDesc xml:lang="en">
      <term>Medicine</term>
      <gloss>
<!--...-->
      </gloss>
     </catDesc>
     <catDesc xml:lang="pt">
      <term>Medicina</term>
      <gloss>
<!--...-->
      </gloss>
     </catDesc>
    </category>
   </category>
<!--...-->
  </taxonomy>
 </classDecl>
</encodingDesc>
<!-- inside an <entry> element: -->
<usg type="domain"
 valueDatcat="#domain.medical_and_health_sciences.medicine">
Med.</usg>

In the Morais dictionary, the relevant domain labels are in the header, getting referenced inside the dictionary, from <usg> elements. The vocabulary used for dictionary-internal labelling is in turn anchored in the MorDigital controlled vocabulary service of the NOVA University of Lisbon – School of Social Sciences and Humanities (NOVA FCSH).

Note

The TEI Abstract Model can be expressed as a hierarchy of attribute-value matrices (AVMs) of various types and of various levels of complexity, nested or grouped in various ways. At the most abstract level, an AVM consists of an information container and the value (contents) of that container.

A simple example of an XML serialization of such structures is, on the one hand, the opening and closing tags that delimit and name the container, and, on the other, the content enclosed by the two tags that constitues the value. An analogous example is an attribute name and the value of that attribute.

In a TEI XML example of two equivalent serializations expressing the name-value pair <part-of-speech,common-noun>, namely <pos>commonNoun</pos> and pos="common-noun", one would classify the element <pos> and the attribute pos as containers (mapping onto the first member of the relevant name-value pair), while the character data content of <pos> or the value of pos would be seen as mapping onto the second member of the pair.

The att.datcat class provides means of addressing the containers and their values, while at the same time providing a way to interpret them in the context of external taxonomies or ontologies. Aligning e.g. both the <pos> element and the pos attribute with the same value of an external reference point (i.e., an entry in an agreed taxonomy) affirms the identity of the concept serialised by both the element container and the attribute container, and optionally provides a definition of that concept (in the case at hand, the concept part of speech).

The value of the att.datcat attributes should be a PID (persistent identifier) that points to a specific — and, ideally, shared — taxonomy or ontology. Among the resources that can, to a lesser or greater extent, be used as inventories of (more or less) standardized linguistic categories are the GOLD ontology, CLARIN CCR, OLiA, or TermWeb's DatCatInfo, and also the Universal Dependencies inventory, on the assumption that its URIs are going to persist. It is imaginable that a project may choose to address a local taxonomy store instead, but this risks losing the advantage of interchangeability with other projects.

Historically, datcat and valueDatcat originate from the (now obsolete) ISO 12620:2009 standard, describing the data model and procedures for a Data Category Registry (DCR). The current version of that standard, ISO 12620-1, does not standardize the serialization of pointers, merely mentioning the TEI att.datcat as an example.

Note that no constraint prevents the occurrence of a combination of att.datcat attributes: the <fDecl> element, which is a natural bearer of the targetDatcat attribute, is an instance of a specific modeling element, and, in principle, could be semantically fixed by an appropriate reference taxonomy of modeling devices.

att.declarable

att.declarable provides attributes for those elements in the TEI header which may be independently selected by means of the special purpose decls attribute. [15.3. Associating Contextual Information with a Text]
Moduletei
Membersavailability bibl biblStruct correspDesc editorialDecl listApp normalization projectDesc sourceDesc textClass
Attributes
defaultindicates whether or not this element is selected by default when its parent is selected.
Status Optional
Datatype teidata.truthValue
Legal values are:
true
This element is selected if its parent is selected
false
This element can only be selected explicitly, unless it is the only one of its kind, in which case it is selected if its parent is selected. [Default]
Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text. Only one element of a particular type may have a default attribute with a value of true.

att.declaring

att.declaring provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. [15.3. Associating Contextual Information with a Text]
Moduletei
Membersab back body div facsimile front graphic lg msDesc p ptr ref standOff surface term text
Attributes
decls(declarations) identifies one or more declarable elements within the header, which are understood to apply to the element bearing this attribute and its content.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. Associating Contextual Information with a Text.

att.dimensions

att.dimensions provides attributes for describing the size of physical objects.
Moduletei
Membersadd addSpan date del delSpan ex gap mod redo restore retrace secl space supplied unclear undo
Attributes
unitnames the unit used for the measurement
Status Optional
Datatype teidata.enumerated
Suggested values include:
cm
(centimetres)
mm
(millimetres)
in
(inches)
line
lines of text
char
(characters) characters of text
quantityspecifies the length in the units specified
Status Optional
Datatype teidata.numeric
extentindicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words.
Status Optional
Datatype teidata.text
<gap extent="5 words"/>
<height extent="half the page"/>
precisioncharacterizes the precision of the values specified by the other attributes.
Status Optional
Datatype teidata.certainty
scopewhere the measurement summarizes more than one observation, specifies the applicability of this measurement.
Status Optional
Datatype teidata.enumerated
Sample values include:
all
measurement applies to all instances.
most
measurement applies to most of the instances inspected.
range
measurement applies to only the specified range of instances.

att.divLike

att.divLike provides attributes common to all elements which behave in the same way as divisions. [4. Default Text Structure]
Moduletei
Membersdiv lg
Attributes
org(organization) specifies how the content of the division is organized.
Status Optional
Datatype teidata.enumerated
Legal values are:
composite
no claim is made about the sequence in which the immediate contents of this division are to be processed, or their inter-relationships.
uniform
the immediate contents of this element are regarded as forming a logical unit, to be processed in sequence. [Default]
sampleindicates whether this division is a sample of the original source and if so, from which part.
Status Optional
Datatype teidata.enumerated
Legal values are:
initial
division lacks material present at end in source.
medial
division lacks material at start and end.
final
division lacks material at start.
unknown
position of sampled material within original unknown.
complete
division is not a sample. [Default]

att.docStatus

att.docStatus provides attributes for use on metadata elements describing the status of a document.
Moduletei
Membersbibl biblStruct change msDesc revisionDesc
Attributes
statusdescribes the status of a document either currently or, when associated with a dated element, at the time indicated.
Status Optional
Datatype teidata.enumerated
Sample values include:
approved
candidate
cleared
deprecated
draft
[Default]
embargoed
expired
frozen
galley
proposed
published
recommendation
submitted
unfinished
withdrawn
Example
<revisionDesc status="published">
 <change status="published"
  when="2010-10-21"/>

 <change status="clearedwhen="2010-10-02"/>
 <change status="embargoed"
  when="2010-08-02"/>

 <change status="frozenwhen="2010-05-01"
  who="#MSM"/>

 <change status="draftwhen="2010-03-01"
  who="#LB"/>

</revisionDesc>

att.edition

att.edition provides attributes identifying the source edition from which some encoded feature derives.
Moduletei
Membersgb lb milestone pb
Attributes
ed(edition) supplies a sigil or other arbitrary identifier for the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
edRef(edition reference) provides a pointer to the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Example
<l>Of Mans First Disobedience,<lb ed="1674"/> and<lb ed="1667"/> the Fruit</l>
<l>Of that Forbidden Tree, whose<lb ed="1667 1674"/> mortal tast</l>
<l>Brought Death into the World,<lb ed="1667"/> and all<lb ed="1674"/> our woe,</l>
Example
<listBibl>
 <bibl xml:id="stapledon1937">
  <author>Olaf Stapledon</author>,
 <title>Starmaker</title>, <publisher>Methuen</publisher>, <date>1937</date>
 </bibl>
 <bibl xml:id="stapledon1968">
  <author>Olaf Stapledon</author>,
 <title>Starmaker</title>, <publisher>Dover</publisher>, <date>1968</date>
 </bibl>
</listBibl>
<!-- ... -->
<p>Looking into the future aeons from the supreme moment of
the cosmos, I saw the populations still with all their
strength maintaining the<pb edRef="#stapledon1968n="411"/>essentials of their ancient culture,
still living their personal lives in zest and endless
novelty of action, … I saw myself still
preserving, though with increasing difficulty, my lucid
con-<pb edRef="#stapledon1937n="291"/>sciousness;</p>

att.editLike

att.editLike provides attributes describing the nature of an encoded scholarly intervention or interpretation of any kind. [3.5. Simple Editorial Changes 10.3.1. Origination 13.3.2. The Person Element 11.3.1.1. Core Elements for Transcriptional Work]
Moduletei
Membersatt.transcriptional [add addSpan del delSpan mod redo restore retrace undo] am corr date ex expan gap persName placeName reg secl supplied unclear
Attributes
evidenceindicates the nature of the evidence supporting the reliability or accuracy of the intervention or interpretation.
Status Optional
Datatype 1–∞ occurrences of teidata.enumerated separated by whitespace
Suggested values include:
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or scholar on the basis of their expertise.
instantindicates whether this is an instant revision or not.
Status Optional
Datatype teidata.xTruthValue
Default false
Note

The members of this attribute class are typically used to represent any kind of editorial intervention in a text, for example a correction or interpretation, or to date or localize manuscripts etc.

Each pointer on the source (if present) corresponding to a witness or witness group should reference a bibliographic citation such as a witness, msDesc, or bibl element, or another external bibliographic citation, documenting the source concerned.

att.enjamb

att.enjamb (enjambement) provides attributes that may be used to indicate enjambement of the parent element. [6.2. Components of the Verse Line]
Moduleverse
Membersl
Attributes
enjamb(enjambement) indicates that the end of a verse line is marked by enjambement.
Status Optional
Datatype teidata.enumerated
Sample values include:
no
the line is end-stopped
yes
the line in question runs on into the next
weak
the line is weakly enjambed
strong
the line is strongly enjambed
Note

The usual practice will be to give the value ‘yes’ to this attribute when enjambement is being marked, or the values ‘weak’ and ‘strong’ if degrees of enjambement are of interest; if no value is given, however, the attribute does not default to a value of ‘no’; this allows the attribute to be omitted entirely when enjambement is not of particular interest.

att.fragmentable

att.fragmentable provides attributes for representing fragmentation of a structural element, typically as a consequence of some overlapping hierarchy.
Moduletei
Membersatt.divLike [div lg] att.segLike [c pc s seg w] ab l p
Attributes
partspecifies whether or not its parent element is fragmented in some way, typically by some other overlapping structure: for example a speech which is divided between two or more verse stanzas, a paragraph which is split across a page division, a verse line which is divided between two speakers.
Status Optional
Datatype teidata.enumerated
Legal values are:
Y
(yes) the element is fragmented in some (unspecified) respect
N
(no) the element is not fragmented, or no claim is made as to its completeness [Default]
I
(initial) this is the initial part of a fragmented element
M
(medial) this is a medial part of a fragmented element
F
(final) this is the final part of a fragmented element
Note

The values I, M, or F should be used only where it is clear how the element may be reconstituted.

att.global

att.global provides attributes common to all elements in the TEI encoding scheme. [1.3.1.1. Global Attributes]
Moduletei
MembersTEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone
Attributes
xml:id(identifier) provides a unique identifier for the element bearing the attribute.
Status Optional
Datatype ID
Note

The xml:id attribute may be used to specify a canonical reference for an element; see section 3.11. Reference Systems.

n(number) gives a number (or other label) for an element, which is not necessarily unique within the document.
Status Optional
Datatype teidata.text
Note

The value of this attribute is always understood to be a single token, even if it contains space or other punctuation characters, and need not be composed of numbers only. It is typically used to specify the numbering of chapters, sections, list items, etc.; it may also be used in the specification of a standard reference system for the text.

xml:lang(language) indicates the language of the element content using a ‘tag’ generated according to BCP 47.
Status Optional
Datatype teidata.language
<p> … The consequences of
this rapid depopulation were the loss of the last
<foreign xml:lang="rap">ariki</foreign> or chief
(Routledge 1920:205,210) and their connections to
ancestral territorial organization.</p>
Note

The xml:lang value will be inherited from the immediately enclosing element, or from its parent, and so on up the document hierarchy. It is generally good practice to specify xml:lang at the highest appropriate level, noticing that a different default may be needed for the teiHeader from that needed for the associated resource element or elements, and that a single TEI document may contain texts in many languages.

Only attributes with free text values (rare in these guidelines) will be in the scope of xml:lang.

The authoritative list of registered language subtags is maintained by IANA and is available at https://www.iana.org/assignments/language-subtag-registry. For a good general overview of the construction of language tags, see https://www.w3.org/International/articles/language-tags/, and for a practical step-by-step guide, see https://www.w3.org/International/questions/qa-choosing-language-tags.en.php.

The value used must conform with BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a <language> element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.

xml:spacesignals an intention about how white space should be managed by applications.
Status Optional
Datatype teidata.enumerated
Legal values are:
default
signals that the application's default white-space processing modes are acceptable
preserve
indicates the intent that applications preserve all white space
Note

The XML specification provides further guidance on the use of this attribute. Note that many parsers may not handle xml:space correctly.

att.global.analytic

att.global.analytic provides additional global attributes for associating specific analyses or interpretations with appropriate portions of a text. [17.2. Global Attributes for Simple Analyses 17.3. Spans and Interpretations]
Moduleanalysis
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
ana(analysis) indicates one or more elements containing interpretations of the element on which the ana attribute appears.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

When multiple values are given, they may reflect either multiple divergent interpretations of an ambiguous text, or multiple mutually consistent interpretations of the same passage in different contexts.

att.global.change

att.global.change provides attributes allowing its member elements to specify one or more states or revision campaigns with which they are associated.
Moduletranscr
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
changepoints to one or more change elements documenting a state or revision campaign to which the element bearing this attribute and its children have been assigned by the encoder.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace

att.global.facs

att.global.facs provides attributes used to express correspondence between an element and all or part of a facsimile image or surface. [11.1. Digital Facsimiles]
Moduletranscr
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
facs(facsimile) points to one or more images, portions of an image, or surfaces which correspond to the current element.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace

att.global.linking

att.global.linking provides a set of attributes for hypertextual linking. [16. Linking, Segmentation, and Alignment]
Modulelinking
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
corresp(corresponds) points to elements that correspond to the current element in some way.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
<group>
 <text xml:id="t1-g1-t1xml:lang="mi">
  <body xml:id="t1-g1-t1-body1">
   <div type="chapter">
    <head>He Whakamaramatanga mo te Ture Hoko, Riihi hoki, i nga Whenua Maori, 1876.</head>
    <p></p>
   </div>
  </body>
 </text>
 <text xml:id="t1-g1-t2xml:lang="en">
  <body corresp="#t1-g1-t1-body1"
   xml:id="t1-g1-t2-body1">

   <div type="chapter">
    <head>An Act to regulate the Sale, Letting, and Disposal of Native Lands, 1876.</head>
    <p></p>
   </div>
  </body>
 </text>
</group>

In this example a <group> contains two texts, each containing the same document in a different language. The correspondence is indicated using corresp. The language is indicated using xml:lang, whose value is inherited; both the tag with the corresp and the tag pointed to by the corresp inherit the value from their immediate parent.


<!-- In a placeography called "places.xml" --><place corresp="people.xml#LOND2 people.xml#GENI1"
 xml:id="LOND1">

 <placeName>London</placeName>
 <desc>The city of London...</desc>
</place>
<!-- In a literary personography called "people.xml" -->
<person corresp="places.xml#LOND1 #GENI1"
 xml:id="LOND2">

 <persName type="lit">London</persName>
 <note>
  <p>Allegorical character representing the city of <placeName ref="places.xml#LOND1">London</placeName>.</p>
 </note>
</person>
<person corresp="places.xml#LOND1 #LOND2"
 xml:id="GENI1">

 <persName type="lit">London’s Genius</persName>
 <note>
  <p>Personification of London’s genius. Appears as an
     allegorical character in mayoral shows.
  </p>
 </note>
</person>

In this example, a <place> element containing information about the city of London is linked with two <person> elements in a literary personography. This correspondence represents a slightly looser relationship than the one in the preceding example; there is no sense in which an allegorical character could be substituted for the physical city, or vice versa, but there is obviously a correspondence between them.

nextpoints to the next element of a virtual aggregate of which the current element is part.
Status Optional
Datatype teidata.pointer
Note

It is recommended that the element indicated be of the same type as the element bearing this attribute.

prev(previous) points to the previous element of a virtual aggregate of which the current element is part.
Status Optional
Datatype teidata.pointer
Note

It is recommended that the element indicated be of the same type as the element bearing this attribute.

att.global.rendition

att.global.rendition provides rendering attributes common to all elements in the TEI encoding scheme. [1.3.1.1.3. Rendition Indicators]
Moduletei
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
rend(rendition) indicates how the element in question was rendered or presented in the source text.
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
<head rend="align(center) case(allcaps)">
 <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle,
<lb/>On Her <lb/>
 <hi rend="case(mixed)">New Blazing-World</hi>.
</head>
Note

These Guidelines make no binding recommendations for the values of the rend attribute; the characteristics of visual presentation vary too much from text to text and the decision to record or ignore individual characteristics varies too much from project to project. Some potentially useful conventions are noted from time to time at appropriate points in the Guidelines. The values of the rend attribute are a set of sequence-indeterminate individual tokens separated by whitespace.

att.global.responsibility

att.global.responsibility provides attributes indicating the agent responsible for some aspect of the text, the markup or something asserted by the markup, and the degree of certainty associated with it. [1.3.1.1.4. Sources, certainty, and responsibility 3.5. Simple Editorial Changes 11.3.2.2. Hand, Responsibility, and Certainty Attributes 17.3. Spans and Interpretations 13.1.1. Linking Names and Their Referents]
Moduletei
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
cert(certainty) signifies the degree of certainty associated with the intervention or interpretation.
Status Optional
Datatype teidata.probCert
resp(responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

To reduce the ambiguity of a resp pointing directly to a person or organization, we recommend that resp be used to point not to an agent (<person> or <org>) but to a <respStmt>, author, editor or similar element which clarifies the exact role played by the agent. Pointing to multiple <respStmt>s allows the encoder to specify clearly each of the roles played in part of a TEI file (creating, transcribing, encoding, editing, proofing etc.).

Example
Blessed are the
<choice>
 <sic>cheesemakers</sic>
 <corr cert="highresp="#editor">peacemakers</corr>
</choice>: for they shall be called the children of God.
Example

<!-- in the <text> ... --><lg>
<!-- ... -->
 <l>Punkes, Panders, baſe extortionizing
   sla<choice>
   <sic>n</sic>
   <corr resp="#JENS1_transcriber">u</corr>
  </choice>es,</l>
<!-- ... -->
</lg>
<!-- in the <teiHeader> ... -->
<!-- ... -->
<respStmt xml:id="JENS1_transcriber">
 <resp when="2014">Transcriber</resp>
 <name>Janelle Jenstad</name>
</respStmt>

att.global.source

att.global.source provides attributes used by elements to point to an external source. [1.3.1.1.4. Sources, certainty, and responsibility 3.3.3. Quotation 8.3.4. Writing]
Moduletei
Membersatt.global [TEI ab abbr accMat acquisition add addName addSpan addrLine address am anchor app author authority availability back bibl biblScope biblStruct binding bindingDesc body byline c change choice cit closer collation corr correspAction correspDesc creation date dateline decoNote del delSpan desc distributor div docAuthor docImprint docTitle editor editorialDecl emph encodingDesc epigraph ex expan extent facsimile fileDesc filiation foliation foreign forename front fw gap gb graphic handDesc handNote handNotes handShift head hi history idno imprint index institution item keywords l layout layoutDesc lb lg licence listApp listChange listTranspose locus material measure metamark milestone mod monogr msContents msDesc msIdentifier msItem msItemStruct nameLink normalization note notesStmt num objectDesc opener orig p pb pc persName physDesc placeName profileDesc projectDesc provenance ptr pubPlace publicationStmt publisher q quote rdg rdgGrp redo ref reg relatedItem repository restore retrace revisionDesc rhyme rs s said salute secl seg settlement sic signed soCalled sourceDesc space span standOff supplied support supportDesc surface surname teiHeader term text textClass title titlePage titlePart titleStmt transpose unclear undo variantEncoding w wit witDetail witness xr zone]
Attributes
sourcespecifies the source from which some aspect of this element is drawn.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Schematron

<sch:rule context="tei:*[@source]">
<sch:let name="srcs"
 value="tokenize( normalize-space(@source),' ')"/>

<sch:report test="( self::tei:classRef | self::tei:dataRef | self::tei:elementRef | self::tei:macroRef | self::tei:moduleRef | self::tei:schemaSpec ) and $srcs[2]"> When used on a schema description element (like
<sch:value-of select="name(.)"/>), the @source attribute
should have only 1 value. (This one has <sch:value-of select="count($srcs)"/>.)
</sch:report>
</sch:rule>
Note

The source attribute points to an external source. When used on an element describing a schema component (<classRef>, <dataRef>, <elementRef>, <macroRef>, <moduleRef>, or <schemaSpec>), it identifies the source from which declarations for the components should be obtained.

On other elements it provides a pointer to the bibliographical source from which a quotation or citation is drawn.

In either case, the location may be provided using any form of URI, for example an absolute URI, a relative URI, a private scheme URI of the form tei:x.y.z, where x.y.z indicates the version number, e.g. tei:4.3.2 for TEI P5 release 4.3.2 or (as a special case) tei:current for whatever is the latest release, or a private scheme URI that is expanded to an absolute URI as documented in a <prefixDef>.

When used on elements describing schema components, source should have only one value; when used on other elements multiple values are permitted.

Example
<p>
<!-- ... --> As Willard McCarty (<bibl xml:id="mcc_2012">2012, p.2</bibl>) tells us, <quote source="#mcc_2012">‘Collaboration’ is a problematic and should be a contested
   term.</quote>
<!-- ... -->
</p>
Example
<p>
<!-- ... -->
 <quote source="#chicago_15_ed">Grammatical theories are in flux, and the more we learn, the
   less we seem to know.</quote>
<!-- ... -->
</p>
<!-- ... -->
<bibl xml:id="chicago_15_ed">
 <title level="m">The Chicago Manual of Style</title>,
<edition>15th edition</edition>. <pubPlace>Chicago</pubPlace>: <publisher>University of
   Chicago Press</publisher> (<date>2003</date>), <biblScope unit="page">p.147</biblScope>.

</bibl>
Example
<elementRef key="psource="tei:2.0.1"/>

Include in the schema an element named p available from the TEI P5 2.0.1 release.

Example
<schemaSpec ident="myODD"
 source="mycompiledODD.xml">

<!-- further declarations specifying the components required -->
</schemaSpec>

Create a schema using components taken from the file mycompiledODD.xml.

att.handFeatures

att.handFeatures provides attributes describing aspects of the hand in which a manuscript is written. [11.3.2.1. Document Hands]
Moduletei
MembershandNote handShift
Attributes
scribegives a name or other identifier for the scribe believed to be responsible for this hand.
Status Optional
Datatype teidata.name
scribeRefpoints to a full description of the scribe concerned, typically supplied by a <person> element elsewhere in the description.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
scriptcharacterizes the particular script or writing style used by this hand, for example secretary, copperplate, Chancery, Italian, etc.
Status Optional
Datatype 1–∞ occurrences of teidata.name separated by whitespace
scriptRefpoints to a full description of the script or writing style used by this hand, typically supplied by a <scriptNote> element elsewhere in the description.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
mediumdescribes the tint or type of ink, e.g. brown, or other writing medium, e.g. pencil
Status Optional
Datatype 1–∞ occurrences of teidata.enumerated separated by whitespace
scopespecifies how widely this hand is used in the manuscript.
Status Optional
Datatype teidata.enumerated
Legal values are:
sole
only this hand is used throughout the manuscript
major
this hand is used through most of the manuscript
minor
this hand is used occasionally in the manuscript
Note

Usually either script or scriptRef, and similarly, either scribe or scribeRef, will be supplied.

att.interpLike

att.interpLike provides attributes for elements which represent a formal analysis or interpretation. [17.2. Global Attributes for Simple Analyses]
Moduletei
Membersspan
Attributes
typeindicates what kind of phenomenon is being noted in the passage.
Status Recommended
Datatype teidata.enumerated
Sample values include:
image
identifies an image in the passage.
character
identifies a character associated with the passage.
theme
identifies a theme in the passage.
allusion
identifies an allusion to another text.
subtype(subtype) provides a sub-categorization of the phenomenon is being noted in the passage, if needed
Status Optional
Datatype teidata.enumerated
Note

The subtype attribute may be used to provide any sub-classification for the element additional to that provided by its type attribute.

inst(instances) points to instances of the analysis or interpretation represented by the current element.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

The current element should be an analytic one. The element pointed at should be a textual one.

att.lexicographic

att.lexicographic provides a set of attributes for specifying standard and normalized values, grammatical functions, alternate or equivalent forms, and information about composite parts. [9.2. The Structure of Dictionary Entries]
Moduledictionaries
Membersxr
Attributes
expand(expand) gives an expanded form of information presented more concisely in the dictionary
Status Optional
Datatype teidata.text
<gramGrp>
 <pos expand="noun">n</pos>
</gramGrp>
split(split) gives the list of split values for a merged form
Status Optional
Datatype teidata.text
value(value) gives a value which lacks any realization in the printed source text.
Status Optional
Datatype teidata.text
location(location) indicates an anchor element typically elsewhere in the document, but possibly in another document, which is the original location of this component.
Status Optional
Datatype teidata.pointer
mergedIn(merged into) gives a reference to another element, where the original appears as a merged form.
Status Optional
Datatype teidata.pointer
opt(optional) indicates whether the element is optional or not
Status Optional
Datatype teidata.truthValue
Default false

att.lexicographic.normalized

att.lexicographic.normalized provides attributes for usage within word-level elements in the analysis module and within lexicographic microstructure in the dictionaries module.
Moduleanalysis
Membersatt.lexicographic [xr] att.linguistic [pc w]
Attributes
norm(normalized) provides the normalized/standardized form of information present in the source text in a non-normalized form
Status Optional
Datatype teidata.text

Normalization of part-of-speech information within a dictionary entry.

<gramGrp>
 <pos norm="noun">n</pos>
</gramGrp>

Normalization of a source form in a tokenized historical corpus.

<s>
 <w>for</w>
 <w norm="virtue's">vertues</w>
 <w>sake</w>
</s>
<s>
 <w norm="persuasion">perswasion</w>
 <w>of</w>
 <w norm="Unity">Vnitie</w>
</s>

Example of normalization from Aviso. Relation oder Zeitung. Wolfenbüttel, 1609. In: Deutsches Textarchiv.

<s>
 <w norm="freiwillig">freywillig</w>
 <pc join="leftnorm=",">/</pc>
 <w norm="unbedrängt">vnbedraͤngt</w>
 <w norm="und">vnd</w>
 <w norm="unverhindert">vnuerhindert</w>
</s>
<w norm="Teil">Theyll</w>
<w norm="Freude">Frewde</w>
orig(original) gives the original string or is the empty string when the element does not appear in the source text.
Status Optional
Datatype teidata.text

Example from a language documentation project of the Mixtepec-Mixtec language (ISO 639-3: 'mix'). This is a use case where speakers spell something incorrectly but we would like to preserve it for any number of reasons, the use of orig is essential and could have uses for both the speaker to see past mistakes, researchers to get insight into how untrained speakers write their language instinctually (in contrast to prescribed convention), etc.:

<w orig="ntsa sia'i">ntsasia'i</w>

Example from the EarlyPrint project. Fragment of text where obvious errors have been corrected but the original forms remain recorded:

<w lemma="hepos="pns"
 xml:id="b1afj-003-a-0950">
he</w>
<w lemma="havepos="vvz"
 xml:id="b1afj-003-a-0960">
hath</w>
<w lemma="bringpos="vvn"
 xml:id="b1afj-003-a-0970">
brought</w>
<w lemma="forthorig="sorthpos="av"
 xml:id="b1afj-003-a-0980">
forth</w>

An example from the EarlyPrint project showing the use of both norm and orig. The orig attribute preserves the original version (sometimes with spelling errors, often with printer abbreviations), the element content resolves printer abbreviations but retains the original orthography, and the norm attribute holds normalized values:

<w lemma="commandmentnorm="commandment"
 orig="commandemētpos="n1xml:id="b9avr-018-a-7720">
commandement</w>
Note

It needs to be stressed that the two attributes in this class are meant for strictly lexicographic and linguistic uses, and not for editorial interventions. For the latter, the mechanism based on choice, orig, and reg needs to be employed.

att.linguistic

att.linguistic provides a set of attributes concerning linguistic features of tokens, for usage within token-level elements, specifically w and pc in the analysis module. [17.4.2. Lightweight Linguistic Annotation]
Moduleanalysis
Memberspc w
Attributes
lemmaprovides a lemma (base form) for the word, typically uninflected and serving both as an identifier (e.g. in dictionary contexts, as a headword), and as a basis for potential inflections.
Status Optional
Datatype teidata.text
<w lemma="wife">wives</w>
<w lemma="Arznei">Artzeneyen</w>
lemmaRefprovides a pointer to a definition of the lemma for the word, for example in an online lexicon.
Status Optional
Datatype teidata.pointer
<w lemma="hit"
 lemmaRef="http://www.example.com/lexicon/hitvb.xmltype="verb">
hitt<m type="suffix">ing</m>
</w>
pos(part of speech) indicates the part of speech assigned to a token (i.e. information on whether it is a noun, adjective, or verb), usually according to some official reference vocabulary (e.g. for German: STTS, for English: CLAWS, for Polish: NKJP, etc.).
Status Optional
Datatype teidata.text

The German sentence ‘Wir fahren in den Urlaub.’ tagged with the Stuttgart-Tuebingen-Tagset (STTS).

<s>
 <w pos="PPER">Wir</w>
 <w pos="VVFIN">fahren</w>
 <w pos="APPR">in</w>
 <w pos="ART">den</w>
 <w pos="NN">Urlaub</w>
 <w pos="$.">.</w>
</s>

The English sentence ‘We're going to Brazil.’ tagged with the CLAWS-5 tagset, arranged inline (with significant whitespace).

<p><w pos="PNP">We</w><w pos="VBB">'re</w> <w pos="VVG">going</w> <w pos="PRP">to</w> <w pos="NP0">Brazil</w><pc pos="PUN">.</pc></p>         

The English sentence ‘We're going on vacation to Brazil for a month!’ tagged with the CLAWS-7 tagset and arranged sequentially.

<p>
 <w pos="PPIS2">We</w>
 <w pos="VBR">'re</w>
 <w pos="VVG">going</w>
 <w pos="II">on</w>
 <w pos="NN1">vacation</w>
 <w pos="II">to</w>
 <w pos="NP1">Brazil</w>
 <w pos="IF">for</w>
 <w pos="AT1">a</w>
 <w pos="NNT1">month</w>
 <pc pos="!">!</pc>
</p>
msd(morphosyntactic description) supplies morphosyntactic information for a token, usually according to some official reference vocabulary (e.g. for German: STTS-large tagset; for a feature description system designed as (pragmatically) universal, see Universal Features).
Status Optional
Datatype teidata.text
<ab>
 <w msd="1.Pl.*.Nompos="PPER">Wir</w>
 <w msd="1.Pl.Pres.Indpos="VVFIN">fahren</w>
 <w msd="--pos="APPR">in</w>
 <w msd="Def.Masc.Akk.Sgpos="ART">den</w>
 <w msd="Masc.Akk.Sgpos="NN">Urlaub</w>
 <pc msd="--pos="$.">.</pc>
</ab>
joinwhen present, provides information on whether the token in question is adjacent to another, and if so, on which side.
Status Optional
Datatype teidata.text
Legal values are:
no
the token is not adjacent to another
left
there is no whitespace on the left side of the token
right
there is no whitespace on the right side of the token
both
there is no whitespace on either side of the token
overlap
the token overlaps with another; other devices (specifying the extent and the area of overlap) are needed to more precisely locate this token in the character stream

The example below assumes that the lack of whitespace is marked redundantly, by using the appropriate values of join.

<s>
 <pc join="right">"</pc>
 <w join="left">Friends</w>
 <w>will</w>
 <w>be</w>
 <w join="right">friends</w>
 <pc join="both">.</pc>
 <pc join="left">"</pc>
</s>

Note that a project may make a decision to only indicate lack of whitespace in one direction, or do that non-redundantly. The existing proposal is the broadest possible, on the assumption that we adopt the "streamable view", where all the information on the current element needs to be represented locally.

The English sentence ‘We're going on vacation.’ tagged with the CLAWS-5 tagset, arranged sequentially, tagged on the assumption that only the lack of the preceding whitespace is indicated.

<p>
 <w pos="PNP">We</w>
 <w join="leftpos="VBB">'re</w>
 <w pos="VVG">going</w>
 <w pos="PRP">on</w>
 <w pos="NN1">vacation</w>
 <pc join="leftpos="PUN">.</pc>
</p>
Note

The definition of this attribute is adapted from ISO MAF (Morpho-syntactic Annotation Framework), ISO 24611:2012.

Note

These attributes make it possible to encode simple language corpora and to add a layer of linguistic information to any tokenized resource. See section 17.4.2. Lightweight Linguistic Annotation for discussion.

att.measurement

att.measurement provides attributes to represent a regularized or normalized measurement.
Moduletei
Membersmeasure
Attributes
unit(unit) indicates the units used for the measurement, usually using the standard symbol for the desired units.
Status Optional
Datatype teidata.enumerated
Suggested values include:
m
(metre) SI base unit of length
kg
(kilogram) SI base unit of mass
s
(second) SI base unit of time
Hz
(hertz) SI unit of frequency
Pa
(pascal) SI unit of pressure or stress
Ω
(ohm) SI unit of electric resistance
L
(litre) 1 dm³
t
(tonne) 10³ kg
ha
(hectare) 1 hm²
Å
(ångström) 10⁻¹⁰ m
mL
(millilitre)
cm
(centimetre)
dB
(decibel) see remarks, below
kbit
(kilobit) 10³ or 1000 bits
Kibit
(kibibit) 2¹⁰ or 1024 bits
kB
(kilobyte) 10³ or 1000 bytes
KiB
(kibibyte) 2¹⁰ or 1024 bytes
MB
(megabyte) 10⁶ or 1 000 000 bytes
MiB
(mebibyte) 2²⁰ or 1 048 576 bytes
Note

If the measurement being represented is not expressed in a particular unit, but rather is a number of discrete items, the unit count should be used, or the unit attribute may be left unspecified.

Wherever appropriate, a recognized SI unit name should be used (see further http://www.bipm.org/en/publications/si-brochure/; http://physics.nist.gov/cuu/Units/). The list above is indicative rather than exhaustive.

unitRefpoints to a unique identifier stored in the xml:id of a <unitDef> element that defines a unit of measure.
Status Optional
Datatype teidata.pointer
quantity(quantity) specifies the number of the specified units that comprise the measurement
Status Optional
Datatype teidata.numeric
commodity(commodity) indicates the substance that is being measured
Status Optional
Datatype 1–∞ occurrences of teidata.word separated by whitespace
Note

In general, when the commodity is made of discrete entities, the plural form should be used, even when the measurement is of only one of them.

Schematron

<sch:rule context="tei:*[@unitRef]">
<sch:report role="infotest="@unit">The @unit attribute may be unnecessary when @unitRef is present.</sch:report>
</sch:rule>
Note
This attribute class provides a triplet of attributes that may be used either to regularize the values of the measurement being encoded, or to normalize them with respect to a standard measurement system.
<l>So weren't you gonna buy <measure commodity="ice cream"
  quantity="0.5unit="gal">
half
   a gallon</measure>, baby</l>
<l>So won't you go and buy <measure commodity="ice cream"
  quantity="1.893unit="L">
half
   a gallon</measure>, baby?</l>

The unit should normally be named using the standard symbol for an SI unit (see further http://www.bipm.org/en/publications/si-brochure/; http://physics.nist.gov/cuu/Units/). However, encoders may also specify measurements using informally defined units such as lines or characters.

att.media

att.media provides attributes for specifying display and related properties of external media.
Moduletei
Membersgraphic
Attributes
widthWhere the media are displayed, indicates the display width
Status Optional
Datatype teidata.outputMeasurement
heightWhere the media are displayed, indicates the display height
Status Optional
Datatype teidata.outputMeasurement
scaleWhere the media are displayed, indicates a scale factor to be applied when generating the desired display size
Status Optional
Datatype teidata.numeric

att.metrical

att.metrical defines a set of attributes that certain elements may use to represent metrical information. [6.4. Rhyme and Metrical Analysis]
Moduleverse
Membersatt.divLike [div lg] att.segLike [c pc s seg w] l
Attributes
met(metrical structure, conventional) contains a user-specified encoding for the conventional metrical structure of the element.
Status Recommended
Datatype token
Note

The pattern may be specified by means of either a standard term for the kind of metrical unit (e.g. hexameter) or an encoded representation for the metrical pattern (e.g. +--+-+-+-+-). In either case, the notation used should be documented by a <metDecl> element within the encodingDesc of the associated header.

Where this attribute is not specified, the metrical pattern for the element concerned is understood to be inherited from its parent.

real(metrical structure, realized) contains a user-specified encoding for the actual realization of the conventional metrical structure applicable to the element.
Status Optional
Datatype token
Note

The pattern may be specified by means of either a standard term for the kind of metrical unit (e.g. hexameter) or an encoded representation for the metrical pattern (e.g. +--+-+-+-+-). In either case, the notation used should be documented by a <metDecl> element within the encodingDesc of the associated header.

Where this attribute is not specified, the metrical realization for the element concerned is understood to be identical to that specified or implied for the met attribute.

rhyme(rhyme scheme) specifies the rhyme scheme applicable to a group of verse lines.
Status Recommended
Datatype token
Note

By default, the rhyme scheme is expressed as a string of alphabetic characters each corresponding with a rhyming line. Any non-rhyming lines should be represented by a hyphen or an X. Alternative notations may be defined as for met by use of the <metDecl> element in the TEI header.

When the default notation is used, it does not make sense to specify this attribute on any unit smaller than a line. Nor does the default notation provide any way to record internal rhyme, or to specify non-conventional rhyming practice. These extensions would require user-defined alternative notations.

att.milestoneUnit

att.milestoneUnit provides attributes to indicate the type of section which is changing at a specific milestone. [3.11.3. Milestone Elements 2.3.6.3. Milestone Method 2.3.6. The Reference System Declaration]
Modulecore
Membersmilestone
Attributes
unitprovides a conventional name for the kind of section changing at this milestone.
Status Required
Datatype teidata.enumerated
Suggested values include:
page
physical page breaks (synonymous with the pb element).
column
column breaks.
line
line breaks (synonymous with the lb element).
book
any units termed book, liber, etc.
poem
individual poems in a collection.
canto
cantos or other major sections of a poem.
speaker
changes of speaker or narrator.
stanza
stanzas within a poem, book, or canto.
act
acts within a play.
scene
scenes within a play or act.
section
sections of any kind.
absent
passages not present in the reference edition.
unnumbered
passages present in the text, but not to be included as part of the reference.
<milestone ed="Lan="23unit="Dreissiger"/>
... <milestone ed="AVn="24unit="verse"/> ...
Note

If the milestone marks the beginning of a piece of text not present in the reference edition, the special value absent may be used as the value of unit. The normal interpretation is that the reference edition does not contain the text which follows, until the next milestone tag for the edition in question is encountered.

In addition to the values suggested, other terms may be appropriate (e.g. Stephanus for the Stephanus numbers in Plato).

The type attribute may be used to characterize the unit boundary in any respect other than simply identifying the type of unit, for example as word-breaking or not.

att.msClass

att.msClass provides attributes to indicate text type or classification. [10.6. Intellectual Content 10.6.1. The msItem and msItemStruct Elements]
Modulemsdescription
MembersmsContents msItem msItemStruct
Attributes
classidentifies the text types or classifications applicable to this item by pointing to other elements or resources defining the classification concerned.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace

att.msExcerpt

att.msExcerpt (manuscript excerpt) provides attributes used to describe excerpts from a manuscript placed in a description thereof. [10.6. Intellectual Content]
Modulemsdescription
MembersmsContents msItem msItemStruct quote
Attributes
defectiveindicates whether the passage being quoted is defective, i.e. incomplete through loss or damage.
Status Optional
Datatype teidata.xTruthValue
Note

In the case of an incipit, indicates whether the incipit as given is defective, i.e. the first words of the text as preserved, as opposed to the first words of the work itself. In the case of an explicit, indicates whether the explicit as given is defective, i.e. the final words of the text as preserved, as opposed to what the closing words would have been had the text of the work been whole.

att.naming

att.naming provides attributes common to elements which refer to named persons, places, organizations etc. [3.6.1. Referring Strings 13.3.7. Names and Nyms]
Moduletei
Membersatt.personal [addName forename persName placeName surname] author editor institution pubPlace repository rs settlement
Attributes
rolemay be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place.
Status Optional
Datatype 1–∞ occurrences of teidata.enumerated separated by whitespace
nymRef(reference to the canonical name) provides a means of locating the canonical form (nym) of the names associated with the object named by the element bearing it.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

The value must point directly to one or more XML elements by means of one or more URIs, separated by whitespace. If more than one is supplied, the implication is that the name is associated with several distinct canonical names.

att.notated

att.notated provides attributes to indicate any specialised notation used for element content.
Moduletei
Membersc quote s seg w
Attributes
notationnames the notation used for the content of the element.
Status Optional
Datatype teidata.enumerated

att.personal

att.personal (attributes for components of names usually, but not necessarily, personal names) common attributes for those elements which form part of a name usually, but not necessarily, a personal name. [13.2.1. Personal Names]
Moduletei
MembersaddName forename persName placeName surname
Attributes
fullindicates whether the name component is given in full, as an abbreviation or simply as an initial.
Status Optional
Datatype teidata.enumerated
Legal values are:
yes
(yes) the name component is spelled out in full. [Default]
abb
(abbreviated) the name component is given in an abbreviated form.
init
(initial letter) the name component is indicated only by one initial.
sort(sort) specifies the sort order of the name component in relation to others within the name.
Status Optional
Datatype teidata.count

att.placement

att.placement provides attributes for describing where on the source page or object a textual element appears. [3.5.3. Additions, Deletions, and Omissions 11.3.1.4. Additions and Deletions]
Moduletei
Membersadd addSpan fw head metamark note witDetail
Attributes
placespecifies where this item is placed.
Status Recommended
Datatype 1–∞ occurrences of teidata.enumerated separated by whitespace
Suggested values include:
top
at the top of the page
bottom
at the foot of the page
margin
in the margin (left, right, or both)
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
right
to the right, e.g. to the right of a vertical line of text, or to the right of a figure
below
below the line
left
to the left, e.g. to the left of a vertical line of text, or to the left of a figure
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
<add place="margin">[An addition written in the margin]</add>
<add place="bottom opposite">[An addition written at the
foot of the current page and also on the facing page]</add>
<note place="bottom">Ibid, p.7</note>

att.pointing

att.pointing provides a set of attributes used by all elements which point to other elements by means of one or more URI references. [1.3.1.1.2. Language Indicators 3.7. Simple Links and Cross-References]
Moduletei
Memberslicence locus note ptr ref span term witDetail
Attributes
targetLangspecifies the language of the content to be found at the destination referenced by target, using a ‘language tag’ generated according to BCP 47.
Status Optional
Datatype teidata.language
Schematron

<sch:rule context="tei:*[not(self::tei:schemaSpec)][@targetLang]">
<sch:assert test="@target">@targetLang should only be used on <sch:name/> if @target is specified.</sch:assert>
</sch:rule>
<linkGrp xml:id="pol-swh_aln_2.1-linkGrp">
 <ptr target="pol/UDHR/text.xml#pol_txt_1-head"
  targetLang="pltype="tuvxml:id="pol-swh_aln_2.1.1-ptr"/>

 <ptr target="swh/UDHR/text.xml#swh_txt_1-head"
  targetLang="swtype="tuvxml:id="pol-swh_aln_2.1.2-ptr"/>

</linkGrp>

In the example above, the <linkGrp> combines pointers at parallel fragments of the Universal Declaration of Human Rights: one of them is in Polish, the other in Swahili.

Note

The value must conform to BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a <language> element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.

targetspecifies the destination of the reference by supplying one or more URI References
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

One or more syntactically valid URI references, separated by whitespace. Because whitespace is used to separate URIs, no whitespace is permitted inside a single URI. If a whitespace character is required in a URI, it should be escaped with the normal mechanism, e.g. TEI%20Consortium.

evaluate(evaluate) specifies the intended meaning when the target of a pointer is itself a pointer.
Status Optional
Datatype teidata.enumerated
Legal values are:
all
if the element pointed to is itself a pointer, then the target of that pointer will be taken, and so on, until an element is found which is not a pointer.
one
if the element pointed to is itself a pointer, then its target (whether a pointer or not) is taken as the target of this pointer.
none
no further evaluation of targets is carried out beyond that needed to find the element specified in the pointer's target.
Note

If no value is given, the application program is responsible for deciding (possibly on the basis of user input) how far to trace a chain of pointers.

att.ranging

att.ranging provides attributes for describing numerical ranges.
Moduletei
Membersatt.dimensions [add addSpan date del delSpan ex gap mod redo restore retrace secl space supplied unclear undo] measure num
Attributes
atLeastgives a minimum estimated value for the approximate measurement.
Status Optional
Datatype teidata.numeric
atMostgives a maximum estimated value for the approximate measurement.
Status Optional
Datatype teidata.numeric
minwhere the measurement summarizes more than one observation or a range, supplies the minimum value observed.
Status Optional
Datatype teidata.numeric
maxwhere the measurement summarizes more than one observation or a range, supplies the maximum value observed.
Status Optional
Datatype teidata.numeric
confidencespecifies the degree of statistical confidence (between zero and one) that a value falls within the range specified by min and max, or the proportion of observed values that fall within that range.
Status Optional
Datatype teidata.probability
Example
The MS. was lost in transmission by mail from <del rend="overstrike">
 <gap atLeast="1atMost="2"
  extent="one or two lettersreason="illegibleunit="chars"/>

</del> Philadelphia to the Graphic office, New York.
Example
Americares has been supporting the health sector in Eastern
Europe since 1986, and since 1992 has provided <measure atLeast="120000000"
 commodity="currencyunit="USD">
more than
$120m</measure> in aid to Ukrainians.

att.rdgPart

att.rdgPart provides attributes to mark the beginning or ending of a fragmentary manuscript or other witness. [12.1.5. Fragmentary Witnesses]
Moduletextcrit
Memberswit
Attributes
wit(witness or witnesses) contains a space-delimited list of one or more sigla indicating the witnesses to this reading beginning or ending at this point.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

These elements may appear anywhere within the elements <lem> and rdg, and also within any of their constituent elements.

att.resourced

att.resourced provides attributes by which a resource (such as an externally held media file) may be located.
Moduletei
Membersgraphic
Attributes
url(uniform resource locator) specifies the URL from which the media concerned may be obtained.
Status Required
Datatype teidata.pointer

att.segLike

att.segLike provides attributes for elements used for arbitrary segmentation. [16.3. Blocks, Segments, and Anchors 17.1. Linguistic Segment Categories]
Moduletei
Membersc pc s seg w
Attributes
function(function) characterizes the function of the segment.
Status Optional
Datatype teidata.enumerated
Note

Attribute values will often vary depending on the type of element to which they are attached. For example, a <cl>, may take values such as coordinate, subject, adverbial etc. For a <phr>, such values as subject, predicate etc. may be more appropriate. Such constraints will typically be implemented by a project-defined customization.

att.sortable

att.sortable provides attributes for elements in lists or groups that are sortable, but whose sorting key cannot be derived mechanically from the element content. [9.1. Dictionary Body and Overall Structure]
Moduletei
Membersbibl biblStruct correspAction idno item listApp listChange msDesc term witness
Attributes
sortKeysupplies the sort key for this element in an index, list or group which contains it.
Status Optional
Datatype teidata.word
David's other principal backer, Josiah
ha-Kohen <index indexName="NAMES">
 <term sortKey="Azarya_Josiah_Kohen">Josiah ha-Kohen b. Azarya</term>
</index> b. Azarya, son of one of the last gaons of Sura was David's own first
cousin.
Note

The sort key is used to determine the sequence and grouping of entries in an index. It provides a sequence of characters which, when sorted with the other values, will produced the desired order; specifics of sort key construction are application-dependent

Dictionary order often differs from the collation sequence of machine-readable character sets; in English-language dictionaries, an entry for 4-H will often appear alphabetized under ‘fourh’, and McCoy may be alphabetized under ‘maccoy’, while A1, A4, and A5 may all appear in numeric order ‘alphabetized’ between ‘a-’ and ‘AA’. The sort key is required if the orthography of the dictionary entry does not suffice to determine its location.

att.spanning

att.spanning provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it. [11.3.1.4. Additions and Deletions 1.3.1. Attribute Classes]
Moduletei
MembersaddSpan delSpan gb index lb metamark milestone mod pb redo retrace undo
Attributes
spanToindicates the end of a span initiated by the element bearing this attribute.
Status Optional
Datatype teidata.pointer
SchematronThe @spanTo attribute must point to an element following the current element

<sch:rule context="tei:*[@spanTo]">
<sch:assert test="id(substring(@spanTo,2)) and following::*[@xml:id=substring(current()/@spanTo,2)]">The element indicated by @spanTo (<sch:value-of select="@spanTo"/>) must follow the current element <sch:name/>
</sch:assert>
</sch:rule>
Note

The span is defined as running in document order from the start of the content of the pointing element to the end of the content of the element pointed to by the spanTo attribute (if any). If no value is supplied for the attribute, the assumption is that the span is coextensive with the pointing element. If no content is present, the assumption is that the starting point of the span is immediately following the element itself.

att.textCritical

att.textCritical defines a set of attributes common to all elements representing variant readings in text critical work. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Membersrdg rdgGrp
Attributes
typeclassifies the reading according to some useful typology.
Status Optional
Datatype teidata.enumerated
Sample values include:
substantive
(substantive) the reading offers a substantive variant.
orthographic
(orthographic) the reading differs only orthographically, not in substance, from other readings.
causeclassifies the cause for the variant reading, according to any appropriate typology of possible origins.
Status Optional
Datatype teidata.enumerated
Sample values include:
homeoteleuton
homeoarchy
paleographicConfusion
haplography
dittography
falseEmendation
varSeq(variant sequence) provides a number indicating the position of this reading in a sequence, when there is reason to presume a sequence to the variants.
Status Optional
Datatype teidata.count
Note

Different variant sequences could be coded with distinct number trails: 1-2-3 for one sequence, 5-6-7 for another. More complex variant sequences, with (for example) multiple branchings from single readings, may be expressed through the <join> element.

requirepoints to other readings that are required when adopting the current reading or lemma.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

This element class defines attributes inherited by rdg, <lem>, and rdgGrp.

att.timed

att.timed provides attributes common to those elements which have a duration in time, expressed either absolutely or by reference to an alignment map. [8.3.5. Temporal Information]
Moduletei
Membersgap
Attributes
startindicates the location within a temporal alignment at which this element begins.
Status Optional
Datatype teidata.pointer
Note

If no value is supplied, the element is assumed to follow the immediately preceding element at the same hierarchic level.

endindicates the location within a temporal alignment at which this element ends.
Status Optional
Datatype teidata.pointer
Note

If no value is supplied, the element is assumed to precede the immediately following element at the same hierarchic level.

att.transcriptional

att.transcriptional provides attributes specific to elements encoding authorial or scribal intervention in a text when transcribing manuscript or similar sources. [11.3.1.4. Additions and Deletions]
Moduletei
Membersadd addSpan del delSpan mod redo restore retrace undo
Attributes
statusindicates the effect of the intervention, for example in the case of a deletion, strikeouts which include too much or too little text, or in the case of an addition, an insertion which duplicates some of the text already present.
Status Optional
Datatype teidata.enumerated
Sample values include:
duplicate
all of the text indicated as an addition duplicates some text that is in the original, whether the duplication is word-for-word or less exact.
duplicate-partial
part of the text indicated as an addition duplicates some text that is in the original
excessStart
some text at the beginning of the deletion is marked as deleted even though it clearly should not be deleted.
excessEnd
some text at the end of the deletion is marked as deleted even though it clearly should not be deleted.
shortStart
some text at the beginning of the deletion is not marked as deleted even though it clearly should be.
shortEnd
some text at the end of the deletion is not marked as deleted even though it clearly should be.
partial
some text in the deletion is not marked as deleted even though it clearly should be.
unremarkable
the deletion is not faulty. [Default]
Note

Status information on each deletion is needed rather rarely except in critical editions from authorial manuscripts; status information on additions is even less common.

Marking a deletion or addition as faulty is inescapably an interpretive act; the usual test applied in practice is the linguistic acceptability of the text with and without the letters or words in question.

causedocuments the presumed cause for the intervention.
Status Optional
Datatype teidata.enumerated
seq(sequence) assigns a sequence number related to the order in which the encoded features carrying this attribute are believed to have occurred.
Status Optional
Datatype teidata.count

att.typed

att.typed provides attributes that can be used to classify or subclassify elements in any way. [1.3.1. Attribute Classes 17.1.1. Words and Above 3.6.1. Referring Strings 3.7. Simple Links and Cross-References 3.6.5. Abbreviations and Their Expansions 3.13.1. Core Tags for Verse 7.2.5. Speech Contents 4.1.1. Un-numbered Divisions 4.1.2. Numbered Divisions 4.2.1. Headings and Trailers 4.4. Virtual Divisions 13.3.2.3. Personal Relationships 11.3.1.1. Core Elements for Transcriptional Work 16.1.1. Pointers and Links 16.3. Blocks, Segments, and Anchors 12.2. Linking the Apparatus to the Text 22.5.1.2. Defining Content Models: RELAX NG 8.3. Elements Unique to Spoken Texts 23.3.1.3. Modification of Attribute and Attribute Value Lists]
Moduletei
MembersTEI ab abbr accMat add addName addSpan am anchor app bibl biblStruct c change cit corr correspAction correspDesc date decoNote del delSpan desc div filiation forename fw gb graphic head idno lb lg listApp listChange locus material measure milestone mod msDesc nameLink note num pb pc persName placeName provenance ptr quote ref reg relatedItem restore rhyme rs s seg settlement space standOff surface surname term text title titlePage titlePart w witDetail xr zone
Attributes
typecharacterizes the element in some sense, using any convenient classification scheme or typology.
Status Optional
Datatype teidata.enumerated
<div type="verse">
 <head>Night in Tarras</head>
 <lg type="stanza">
  <l>At evening tramping on the hot white road</l>
  <l></l>
 </lg>
 <lg type="stanza">
  <l>A wind sprang up from nowhere as the sky</l>
  <l></l>
 </lg>
</div>
Note

The type attribute is present on a number of elements, not all of which are members of att.typed, usually because these elements restrict the possible values for the attribute in a specific way.

subtype(subtype) provides a sub-categorization of the element, if needed
Status Optional
Datatype teidata.enumerated
Note

The subtype attribute may be used to provide any sub-classification for the element additional to that provided by its type attribute.

Schematron

<sch:rule context="tei:*[@subtype]">
<sch:assert test="@type">The <sch:name/> element should not be categorized in detail with @subtype unless also categorized in general with @type</sch:assert>
</sch:rule>
Note

When appropriate, values from an established typology should be used. Alternatively a typology may be defined in the associated TEI header. If values are to be taken from a project-specific list, this should be defined using the <valList> element in the project-specific schema description, as described in 23.3.1.3. Modification of Attribute and Attribute Value Lists .

att.witnessed

att.witnessed provides attributes used to identify the witnesses supporting a particular reading in a critical apparatus. [12.1. The Apparatus Entry, Readings, and Witnesses]
Moduletextcrit
Membersrdg
Attributes
wit(witness or witnesses) contains a space-delimited list of one or more pointers indicating the witnesses which attest to a given reading.
Status Optional
Datatype 1–∞ occurrences of teidata.pointer separated by whitespace
Note

If the apparatus contains readings only for a single witness, this attribute may be consistently omitted.

This attribute may occur both within an apparatus gathering variant readings in the transcription of an individual witness and within an apparatus gathering readings from different witnesses.

Additional descriptions or alternative versions of the sigla referenced may be supplied as the content of a child wit element.

att.written

att.written provides attributes to indicate the hand in which the content of an element was written in the source being transcribed. [1.3.1. Attribute Classes]
Moduletei
Membersatt.textCritical [rdg rdgGrp] att.transcriptional [add addSpan del delSpan mod redo restore retrace undo] ab closer div fw head hi note opener p salute seg signed text zone
Attributes
handpoints to a handNote element describing the hand considered responsible for the content of the element concerned.
Status Optional
Datatype teidata.pointer

Schema tei_lite: Datatypes

teidata.certainty

teidata.certainty defines the range of attribute values expressing a degree of certainty.
Moduletei
Used by
Content model
<content>
 <valList type="closed">
  <valItem ident="high"/>
  <valItem ident="medium"/>
  <valItem ident="low"/>
  <valItem ident="unknown"/>
 </valList>
</content>
Declaration
tei_teidata.certainty = "high" | "medium" | "low" | "unknown"
Note

Certainty may be expressed by one of the predefined symbolic values high, medium, or low. The value unknown should be used in cases where the encoder does not wish to assert an opinion about the matter.

teidata.count

teidata.count defines the range of attribute values used for a non-negative integer value used as a count.
Moduletei
Used by
Element:
Content model
<content>
 <dataRef name="nonNegativeInteger"/>
</content>
Declaration
tei_teidata.count = xsd:nonNegativeInteger
Note

Any positive integer value or zero is permitted

teidata.duration.iso

teidata.duration.iso defines the range of attribute values available for representation of a duration in time using ISO 8601 standard formats
Moduletei
Used by
Content model
<content>
 <dataRef name="token"
  restriction="[0-9.,DHMPRSTWYZ/:+\-]+"/>

</content>
Declaration
tei_teidata.duration.iso = token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Example
<time dur-iso="PT0,75H">three-quarters of an hour</time>
Example
<date dur-iso="P1,5D">a day and a half</date>
Example
<date dur-iso="P14D">a fortnight</date>
Example
<time dur-iso="PT0.02S">20 ms</time>
Note

A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the last, which may have a decimal component (using either . or , as the decimal point; the latter is preferred). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator T must precede the first ‘time’ number-letter pair.

For complete details, see ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times.

teidata.duration.w3c

teidata.duration.w3c defines the range of attribute values available for representation of a duration in time using W3C datatypes.
Moduletei
Used by
Content model
<content>
 <dataRef name="duration"/>
</content>
Declaration
tei_teidata.duration.w3c = xsd:duration
Example
<time dur="PT45M">forty-five minutes</time>
Example
<date dur="P1DT12H">a day and a half</date>
Example
<date dur="P7D">a week</date>
Example
<time dur="PT0.02S">20 ms</time>
Note

A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the S number, which may have a decimal component (using . as the decimal point). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator T must precede the first ‘time’ number-letter pair.

For complete details, see the W3C specification.

teidata.enumerated

teidata.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities.
Moduletei
Used by
Element:
Content model
<content>
 <dataRef key="teidata.word"/>
</content>
Declaration
tei_teidata.enumerated = teidata.word
Note

Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace.

Typically, the list of documented possibilities will be provided (or exemplified) by a value list in the associated attribute specification, expressed with a <valList> element.

teidata.language

teidata.language defines the range of attribute values used to identify a particular combination of human language and writing system. [6.1. Language Identification]
Moduletei
Used by
Content model
<content>
 <alternate>
  <dataRef name="language"/>
  <valList>
   <valItem ident=""/>
  </valList>
 </alternate>
</content>
Declaration
tei_teidata.language = xsd:language | ( "" )
Note

The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 5646 and RFC 4647; over time, other IETF documents may succeed these as the best current practice.

A ‘language tag’, per BCP 47, is assembled from a sequence of components or subtags separated by the hyphen character (-, U+002D). The tag is made of the following subtags, in the following order. Every subtag except the first is optional. If present, each occurs only once, except the fourth and fifth components (variant and extension), which are repeatable.

language
The IANA-registered code for the language. This is almost always the same as the ISO 639 2-letter language code if there is one. The list of available registered language subtags can be found at https://www.iana.org/assignments/language-subtag-registry. It is recommended that this code be written in lower case.
script
The ISO 15924 code for the script. These codes consist of 4 letters, and it is recommended they be written with an initial capital, the other three letters in lower case. The canonical list of codes is maintained by the Unicode Consortium, and is available at https://unicode.org/iso15924/iso15924-codes.html. The IETF recommends this code be omitted unless it is necessary to make a distinction you need.
region
Either an ISO 3166 country code or a UN M.49 region code that is registered with IANA (not all such codes are registered, e.g. UN codes for economic groupings or codes for countries for which there is already an ISO 3166 2-letter code are not registered). The former consist of 2 letters, and it is recommended they be written in upper case; the list of codes can be searched or browsed at https://www.iso.org/obp/ui/#search/code/. The latter consist of 3 digits; the list of codes can be found at http://unstats.un.org/unsd/methods/m49/m49.htm.
variant
An IANA-registered variation. These codes ‘are used to indicate additional, well-recognized variations that define a language or its dialects that are not covered by other available subtags’.
extension
An extension has the format of a single letter followed by a hyphen followed by additional subtags. These exist to allow for future extension to BCP 47, but as of this writing no such extensions are in use.
private use
An extension that uses the initial subtag of the single letter x (i.e., starts with x-) has no meaning except as negotiated among the parties involved. These should be used with great care, since they interfere with the interoperability that use of RFC 4646 is intended to promote. In order for a document that makes use of these subtags to be TEI-conformant, a corresponding <language> element must be present in the TEI header.

There are two exceptions to the above format. First, there are language tags in the IANA registry that do not match the above syntax, but are present because they have been ‘grandfathered’ from previous specifications.

Second, an entire language tag can consist of only a private use subtag. These tags start with x-, and do not need to follow any further rules established by the IETF and endorsed by these Guidelines. Like all language tags that make use of private use subtags, the language in question must be documented in a corresponding <language> element in the TEI header.

Examples include

sn
Shona
zh-TW
Taiwanese
zh-Hant-HK
Chinese written in traditional script as used in Hong Kong
en-SL
English as spoken in Sierra Leone
pl
Polish
es-MX
Spanish as spoken in Mexico
es-419
Spanish as spoken in Latin America

The W3C Internationalization Activity has published a useful introduction to BCP 47, Language tags in HTML and XML.

teidata.name

teidata.name defines the range of attribute values expressed as an XML Name.
Moduletei
Used by
Element:
Content model
<content>
 <dataRef name="Name"/>
</content>
Declaration
tei_teidata.name = xsd:Name
Note

Attributes using this datatype must contain a single word which follows the rules defining a legal XML name (see https://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits.

teidata.numeric

teidata.numeric defines the range of attribute values used for numeric values.
Moduletei
Used by
Element:
Content model
<content>
 <alternate>
  <dataRef name="double"/>
  <dataRef name="token"
   restriction="(\-?[\d]+/\-?[\d]+)"/>

  <dataRef name="decimal"/>
 </alternate>
</content>
Declaration
tei_teidata.numeric =
   xsd:double | token { pattern = "(\-?[\d]+/\-?[\d]+)" } | xsd:decimal
Note

Any numeric value, represented as a decimal number, in floating point format, or as a ratio.

To represent a floating point number, expressed in scientific notation, ‘E notation’, a variant of ‘exponential notation’, may be used. In this format, the value is expressed as two numbers separated by the letter E. The first number, the significand (sometimes called the mantissa) is given in decimal format, while the second is an integer. The value is obtained by multiplying the mantissa by 10 the number of times indicated by the integer. Thus the value represented in decimal notation as 1000.0 might be represented in scientific notation as 10E3.

A value expressed as a ratio is represented by two integer values separated by a solidus (/) character. Thus, the value represented in decimal notation as 0.5 might be represented as a ratio by the string 1/2.

teidata.outputMeasurement

teidata.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display.
Moduletei
Used by
Content model
<content>
 <dataRef name="token"
  restriction="[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|ch|rem|vw|vh|vmin|vmax)"/>

</content>
Declaration
tei_teidata.outputMeasurement =
   token
   {
      pattern = "[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|ch|rem|vw|vh|vmin|vmax)"
   }
Example
<figure>
 <head>The TEI Logo</head>
 <figDesc>Stylized yellow angle brackets with the letters <mentioned>TEI</mentioned> in
   between and <mentioned>text encoding initiative</mentioned> underneath, all on a white
   background.</figDesc>
 <graphic height="600px"
  url="http://www.tei-c.org/logos/TEI-600.jpgwidth="600px"/>

</figure>
Note

These values map directly onto the values used by XSL-FO and CSS. For definitions of the units see those specifications; at the time of this writing the most complete list is in the CSS3 working draft.

teidata.pattern

teidata.pattern defines attribute values which are expressed as a regular expression.
Moduletei
Used by
Content model
<content>
 <dataRef name="token"/>
</content>
Declaration
tei_teidata.pattern = token
Note
A regular expression, often called a pattern, is an expression that describes a set of strings. They are usually used to give a concise description of a set, without having to list all elements. For example, the set containing the three strings Handel, Händel, and Haendel can be described by the pattern H(ä|ae?)ndel (or alternatively, it is said that the pattern H(ä|ae?)ndel matches each of the three strings)
Wikipedia

This TEI datatype is mapped to the XSD token datatype, and may therefore contain any string of characters. However, it is recommended that the value used conform to the particular flavour of regular expression syntax supported by XSD Schema.

teidata.point

teidata.point defines the data type used to express a point in cartesian space.
Moduletei
Used by
Content model
<content>
 <dataRef name="token"
  restriction="(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)"/>

</content>
Declaration
tei_teidata.point =
   token { pattern = "(-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?)" }
Example
<facsimile>
 <surface lrx="400lry="280ulx="0uly="0">
  <zone points="220,100 300,210 170,250 123,234">
   <graphic url="handwriting.png"/>
  </zone>
 </surface>
</facsimile>
Note

A point is defined by two numeric values, which should be expressed as decimal numbers. Neither number can end in a decimal point. E.g., both 0.0,84.2 and 0,84 are allowed, but 0.,84. is not.

teidata.pointer

teidata.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere.
Moduletei
Used by
Element:
Content model
<content>
 <dataRef name="anyURIrestriction="\S+"/>
</content>
Declaration
tei_teidata.pointer = xsd:anyURI { pattern = "\S+" }
Note

The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax. Note that the values themselves are encoded using RFC 3987 Internationalized Resource Identifiers (IRIs) mapping to URIs. For example, https://secure.wikimedia.org/wikipedia/en/wiki/% is encoded as https://secure.wikimedia.org/wikipedia/en/wiki/%25 while http://موقع.وزارة-الاتصالات.مصر/ is encoded as http://xn--4gbrim.xn----rmckbbajlc6dj7bxne2c.xn--wgbh1c/

teidata.probability

teidata.probability defines the range of attribute values expressing a probability.
Moduletei
Used by
Content model
<content>
 <dataRef name="double"/>
</content>
Declaration
tei_teidata.probability = xsd:double
Note

Probability is expressed as a real number between 0 and 1; 0 representing certainly false and 1 representing certainly true.

teidata.probCert

teidata.probCert defines a range of attribute values which can be expressed either as a numeric probability or as a coded certainty value.
Moduletei
Used by
Content model
<content>
 <alternate>
  <dataRef key="teidata.probability"/>
  <dataRef key="teidata.certainty"/>
 </alternate>
</content>
Declaration
tei_teidata.probCert = teidata.probability | teidata.certainty

teidata.replacement

teidata.replacement defines attribute values which contain a replacement template.
Moduletei
Used by
Content model
<content>
 <textNode/>
</content>
Declaration
tei_teidata.replacement = text

teidata.temporal.iso

teidata.temporal.iso defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the international standard Data elements and interchange formats – Information interchange – Representation of dates and times.
Moduletei
Used by
Content model
<content>
 <alternate>
  <dataRef name="date"/>
  <dataRef name="gYear"/>
  <dataRef name="gMonth"/>
  <dataRef name="gDay"/>
  <dataRef name="gYearMonth"/>
  <dataRef name="gMonthDay"/>
  <dataRef name="time"/>
  <dataRef name="dateTime"/>
  <dataRef name="token"
   restriction="[0-9.,DHMPRSTWYZ/:+\-]+"/>

 </alternate>
</content>
Declaration
tei_teidata.temporal.iso =
   xsd:date
 | xsd:gYear
 | xsd:gMonth
 | xsd:gDay
 | xsd:gYearMonth
 | xsd:gMonthDay
 | xsd:time
 | xsd:dateTime
 | token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Note

If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used.

For all representations for which ISO 8601:2004 describes both a basic and an extended format, these Guidelines recommend use of the extended format.

teidata.temporal.w3c

teidata.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes Second Edition specification.
Moduletei
Used by
Content model
<content>
 <alternate>
  <dataRef name="date"/>
  <dataRef name="gYear"/>
  <dataRef name="gMonth"/>
  <dataRef name="gDay"/>
  <dataRef name="gYearMonth"/>
  <dataRef name="gMonthDay"/>
  <dataRef name="time"/>
  <dataRef name="dateTime"/>
 </alternate>
</content>
Declaration
tei_teidata.temporal.w3c =
   xsd:date
 | xsd:gYear
 | xsd:gMonth
 | xsd:gDay
 | xsd:gYearMonth
 | xsd:gMonthDay
 | xsd:time
 | xsd:dateTime
Note

If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used.

teidata.text

teidata.text defines the range of attribute values used to express some kind of identifying string as a single sequence of Unicode characters possibly including whitespace.
Moduletei
Used by
Content model
<content>
 <dataRef name="string"/>
</content>
Declaration
tei_teidata.text = string
Note

Attributes using this datatype must contain a single ‘token’ in which whitespace and other punctuation characters are permitted.

teidata.truthValue

teidata.truthValue defines the range of attribute values used to express a truth value.
Moduletei
Used by
Element:
Content model
<content>
 <dataRef name="boolean"/>
</content>
Declaration
tei_teidata.truthValue = xsd:boolean
Note

The possible values of this datatype are 1 or true, or 0 or false.

This datatype applies only for cases where uncertainty is inappropriate; if the attribute concerned may have a value other than true or false, e.g. unknown, or inapplicable, it should have the extended version of this datatype: teidata.xTruthValue.

teidata.versionNumber

teidata.versionNumber defines the range of attribute values used for version numbers.
Moduletei
Used by
Content model
<content>
 <dataRef name="token"
  restriction="[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}"/>

</content>
Declaration
tei_teidata.versionNumber =
   token { pattern = "[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}" }

teidata.word

teidata.word defines the range of attribute values expressed as a single word or token.
Moduletei
Used by
teidata.enumeratedElement:
Content model
<content>
 <dataRef name="token"
  restriction="[^\p{C}\p{Z}]+"/>

</content>
Declaration
tei_teidata.word = token { pattern = "[^\p{C}\p{Z}]+" }
Note

Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace.

teidata.xpath

teidata.xpath defines attribute values which contain an XPath expression.
Moduletei
Used by
Content model
<content>
 <textNode/>
</content>
Declaration
tei_teidata.xpath = text
Note

Any XPath expression using the syntax defined in 6.2. .

When writing programs that evaluate XPath expressions, programmers should be mindful of the possibility of malicious code injection attacks. For further information about XPath injection attacks, see the article at OWASP.

teidata.xTruthValue

teidata.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown.
Moduletei
Used by
Element:
Content model
<content>
 <alternate>
  <dataRef name="boolean"/>
  <valList>
   <valItem ident="unknown"/>
   <valItem ident="inapplicable"/>
  </valList>
 </alternate>
</content>
Declaration
tei_teidata.xTruthValue = xsd:boolean | ( "unknown" | "inapplicable" )
Note

In cases where where uncertainty is inappropriate, use the datatype teidata.TruthValue.

Schema tei_lite: Macros

macro.abContent

macro.abContent (anonymous block content) defines the content of anonymous block elements. [1.3. The TEI Class System]
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.paraPart"/>
  <elementRef key="ab"/>
 </alternate>
</content>
Declaration
tei_macro.abContent = ( text | tei_model.paraPart | tei_ab )*

macro.limitedContent

macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. [1.3. The TEI Class System]
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.limitedPhrase"/>
  <classRef key="model.inter"/>
 </alternate>
</content>
Declaration
tei_macro.limitedContent =
   ( text | tei_model.limitedPhrase | tei_model.inter )*

macro.paraContent

macro.paraContent (paragraph content) defines the content of paragraphs and similar elements. [1.3. The TEI Class System]
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.paraPart"/>
 </alternate>
</content>
Declaration
tei_macro.paraContent = ( text | tei_model.paraPart )*

macro.phraseSeq

macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements. [1.4.1. Standard Content Models]
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.attributable"/>
  <classRef key="model.phrase"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Declaration
tei_macro.phraseSeq =
   (
      text
    | tei_model.gLike
    | tei_model.attributable
    | tei_model.phrase
    | tei_model.global
   )*

macro.phraseSeq.limited

macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. [1.4.1. Standard Content Models]
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.limitedPhrase"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Declaration
tei_macro.phraseSeq.limited =
   ( text | tei_model.limitedPhrase | tei_model.global )*

macro.specialPara

macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. [1.3. The TEI Class System]
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
  <classRef key="model.phrase"/>
  <classRef key="model.inter"/>
  <classRef key="model.divPart"/>
  <classRef key="model.global"/>
 </alternate>
</content>
Declaration
tei_macro.specialPara =
   (
      text
    | tei_model.gLike
    | tei_model.phrase
    | tei_model.inter
    | tei_model.divPart
    | tei_model.global
   )*

macro.xtext

macro.xtext (extended text) defines a sequence of character data and gaiji elements.
Moduletei
Used by
Content model
<content>
 <alternate maxOccurs="unbounded"
  minOccurs="0">

  <textNode/>
  <classRef key="model.gLike"/>
 </alternate>
</content>
Declaration
tei_macro.xtext = ( text | tei_model.gLike )*
Notes
1
A namespace is an XML concept. Its function is to identify the vocabulary from which a group of element names are drawn, using a standard identifier resembling a web address. The namespace for all TEI elements is http://www.tei-c.org/ns/1.0
2
The relevant standard is Best Current Practice 47 (http://tools.ietf.org/html/bcp47). The authoritative list of registered subtags is maintained by IANA and is available at http://www.iana.org/assignments/language-subtag-registry. For a general overview of the construction of language tags, see http://www.w3.org/International/articles/language-tags/, and for a practical step-by-step guide, see http://www.w3.org/International/questions/qa-choosing-language-tags.
3
The full TEI provides a range of elements for encoding metadata about manuscript production and description, which are not however included in TEI Lite
4
The analysis is taken, with permission, from Willard McCarty and Burton Wright, An Analytical Onomasticon to the Metamorphoses of Ovid (Princeton: Princeton University Press, forthcoming). Some simplifications have been undertaken.
Lou Burnard and C. M. Sperberg-McQueen. Date: August 2012