Introduction to the TEI


Introduction to the TEI

  • where the TEI came from
  • architecture of the TEI scheme
  • main and auxiliary DTDs
  • within the main DTD: core, base, and additional tag sets

Where does the TEI come from?

From the research community:

Sponsors

  • ACH Association for Computers and the Humanities
  • ACL Association for Computational inguistics
  • ALLC Association for Literary and Linguistic Computing

Funders

  • U.S. National Endowment for the Humanities
  • Mellon Foundation
  • Commission of European Communities DG XIII
  • Social Science and Humanities Research Council of Canada

TEI today

To ensure continued maintenance, a membership onsortium has been formed in 2001 and the further maintenance and development of the TEI has been transferred to it. Currently, the hosting members are:

  • Brown University
  • University of Virginia, USA
  • University of Bergen, Norway
  • Oxford University, United Kingdom

Goals of the TEI

  • better interchange and integration of data
  • support for all texts, in all languages, from all periods
  • guidance for the perplexed: what to encode
  • assistance for the specialist: how to encode any information of interest

TEI Deliverables

  • A coherent set of recommendations for text encoding
    • comprising several distinct SGML tag sets
    • based on existing practice
    • documented in a reference manual
  • Tutorials for general and specialized audiences (in progress)
  • Sample texts (not yet)

... but no TEI software

TEI DTD Structure

  • how to make one markup scheme handle infinite variety of requirements and interests
  • all texts are alike
  • every text is different
  • similar to the database design problem: one construct, many views
  • each view a selection from the whole

How Many DTDs?

How many DTDs are necessary for a project like the TEI?

  • one (a `Prussian' DTD)
  • none (a `Waterloo' DTD)
  • one per document (a `Californian' DTD)

The TEI DTDs

  • a single main DTD with many faces (a `British' DTD)
  • many tags (over 400)
  • organized into tag sets
  • grouped into classes
  • several auxiliary DTDs for specialized information:
    • writing system / character set
    • feature system (for feature-structure notation)
    • tag set documentation
    • independent, free-standing TEI header

The Pizza Model (XML version)

<!ELEMENT pizza     (base, (tomatoSauce & cheese), topping*) > 
<!ELEMENT base      (thinCrust | pan | stuffed) > 
<!ELEMENT topping   (mushrooms | pepperoni |  sausage | pepper | anchovies | ...) >


4 Next | First| Previous Introduction to XML, Markup and the TEI Guidelines