An Introduction to TEI:

Laura Mandell, Miami Univ. of Ohio

The TEI Header

For tutorials and instructions,

A picture is worth a thousand words. Here is teiHeader for a TEI P4 file by Laura Mandell:

 

 

(A) The first part of the TEI Header is the fileDesc which must include minimally three things: 1) a titleStmt 2) publicationStmt 3) sourceDesc

  1. title author and editing information or respStmt go inside the titleStmt
  2. publisher pubPlace date availability go inside publicationStmt
  3. biblFull or biblStruct, along with a statement enclosed by <p></p> codes about where the source resides, goes inside sourceDesc

Three additional items are the 4) editionStmt (important for putting a date at the foot of your document); 5) extent, and 6) seriesStmt

  1. Additional parts of the header appearing above are: extent and series statment

Note: You'll notice the title, the first slement in number 1, has not only "types" (main title, subtitle), but "levels":

a = analytic title (article, poem, or other item published as part of a larger item)
m = monographic title (book, collection, or other item published as a distinct item, including single volumes of multi-volume works)
j = journal title
s = series title
u = title of unpublished material (including theses and dissertations unless published by a commercial press)

For #3, inside biblFull or biblStruct, you will need to use the tags

<analytic> contains bibliographic elements describing an item (e.g. an article or poem) published within a monograph or journal and not as an independent publication

<monogr> contains bibliographic elements describing an item (e.g. a book or journal) published as an independent item

<series> contains information about the series in which a book or other bibliographic item has appeared

(B) The second part of the TEI Header is the encodingDesc, which can include very formal structures or simple paragraph descriptions (as here). To avoid confusion and errors that defy detection -- especially if you are cutting and pasting text from many different types of documents into your TEI document -- it is best to replace all punctuation marks with entity references.

-- for punctuation, see 4.2.1 under 4.2 Entry and Display Character References and also refer to The Unicode Standard General Punctuation and Controls and Basic Latin. Based on my discussions with and viewings of The Rossetti Archive and the Victorian Women Writers Project, I would say that library-quality texts in general use code to specify punctuation beyond commas, semicolons, and periods.

Whenever you type an ampersand in oXygen -- an ampersand always begins an entity reference; if you want to type a REAL ampersand, you must type "&amp;" -- a drop-down box will appear offering you possibilties for punctuation:

You can click on one of those possibilities for &, ', >, <, " -- OR you can type &hyphen; &colon; &mdash; for hyphens, colons, and longdashes.

(C) The third part of the TEI Header is the revisionDesc -- using TEI default XSL files (provided on the TEI site and by oXygen), the name that appears here will appear at the foot of your HTML page along with the author of the text

Go on to the next step, coding the text itself -- that is, the content

Back to the top

Back to Table of Contents