THE GEDCOM STANDARD


                                           DRAFT Release 5.3

                                            4 November 1993


                                            Prepared by the
                                       Family History Department
                            The Church of Jesus Christ of Latter-day Saints


Suggestions and Correspondence:
GEDCOM Coordinator - 3T
Family History Department
50 East North Temple
Salt Lake City, UT 84150
USA

Telephone (USA) 801-240-4534
                    240-5225


       "Copyright þ 1987,1989,1992,1993 by Corporation of the President of The Church of Jesus Christ of Latter-day
       Saints. This document may be copied for purposes of review or programming of genealogical software, provided this
       notice is included. All other rights reserved."TABLE OF CONTENTS


Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3


    Purpose and Content of Document. . . . . . . . . . . . . . . . . . . . . . . . . .3
       Changes in Version 5.x. . . . . . . . . . . . . . . . . . . . . . . . . . . . .3
       GEDCOM Product Registration . . . . . . . . . . . . . . . . . . . . . . . . . .5
       GEDCOM Software Library . . . . . . . . . . . . . . . . . . . . . . . . . . . .5


Chapter 1
    Data Representation Grammar. . . . . . . . . . . . . . . . . . . . . . . . . . . .6
       Concepts. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
       Grammar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7
       Usage Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9


Chapter 2
    Lineage-linked Grammar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
       Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
       Lineage-linked Grammar Organization . . . . . . . . . . . . . . . . . . . . . 14
       Record Structures of the Lineage-linked Form. . . . . . . . . . . . . . . . . 15
       Substructures of the Lineage-linked Form. . . . . . . . . . . . . . . . . . . 19
       Primitive Elements of the Lineage-linked Form . . . . . . . . . . . . . . . . 26
       Compatibility with other GEDCOM versions. . . . . . . . . . . . . . . . . . . 42
       Packaging the GEDCOM Transmission File  . . . . . . . . . . . . . . . . . . . 43
       User Defined Tags . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
       Sample Lineage-linked GEDCOM Transmission . . . . . . . . . . . . . . . . . . 44
       Sample EVENT_RECORD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46


Chapter 3
    Using Character Sets in GEDCOM . . . . . . . . . . . . . . . . . . . . . . . . . 47
       8-bit ANSEL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
       Unicode (ISO 10646) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48


Appendix:
    A  Lineage-linked GEDCOM Tag Definition. . . . . . . . . . . . . . . . . . . . . 50
    B  Proposed Event and Role Tags. . . . . . . . . . . . . . . . . . . . . . . . . 62
    C  Ansel Character Set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
                                            Introduction

GEDCOM was developed by the Family History Department of the Church of Jesus Christ of
Latter-day Saints to provide a flexible uniform format for exchanging computerized genealogical
data.  GEDCOM is an acronym for GEnealogical Data Communication.  GEDCOM is provided to
foster the sharing of genealogical information and the development of a wide range of inter-operable
software products to assist genealogists, historians, and other researchers.

Purpose and Content of This Document

This technical document is written for computer programmers, system developers, and technically
sophisticated users. 

The chapters in this document contain the following GEDCOM specifications: 
       *  Data Representation Grammar              *  Values
       *  Lineage-linked GEDCOM Grammar            *  Character Sets
       *  GEDCOM Transmission File                       

This document describes GEDCOM at two different levels.  The lower level defines a general-
purpose data representation language for representing any kind of structured information in a
sequential media.  The higher level defines specific content for data to be exchanged between
compatible systems.

The lower level is known as the GEDCOM data format and deals with the syntax and
identification of structured information in general, but does not deal with the semantic content of
any particular kind of data.  The lower level GEDCOM format and the basic GEDCOM concepts
are presented in chapter 1.  This chapter will also be useful to those using GEDCOM for other
kinds of data, not just genealogical data.

The higher level is known as a GEDCOM form.  A GEDCOM form is defined for each kind of data
that uses the GEDCOM data format.  The only GEDCOM form presented in this document is called
the Lineage-linked GEDCOM form.  Other GEDCOM forms have been used for other kinds of
data, including several that are not related to genealogy.  The Lineage-linked GEDCOM form is
defined in chapter 2 and is the form used by commercial genealogical software systems for
exchanging compiled, linked information about individuals with accompanying source citations and
evidence records.  The other forms of GEDCOM are not publicly exchanged at this time, and are
not discussed in this document.

Changes in Version 5.x

Prior versions of The GEDCOM Standard were released in October 1987 (3.0) and August 1989
(4.0).  Versions 1 and 2 were drafts for public discussion and were not established as a standard.

This GEDCOM draft version (5.x) includes the first standard definition of the Lineage-linked form
of GEDCOM and also includes the first major expansion of the Lineage-linked form since its initial
use in GEDCOM 3.0.  The existing registered GEDCOM-compatible systems should still be able to
exchange most data with newer systems that use this version and will still be considered GEDCOM-
compatible for submitting information to the Family History Department. See chapter 2,
"Compatibility with previous GEDCOM releases", for compatibility detail.
There are several purposes for version 5.x of GEDCOM:

       *   Re-define the description of the GEDCOM data representation grammar in a shorter,
           more precise format, for ease of understanding (see chapter 1).  The GEDCOM format
           remains the same, even though the description of it is changed.

       *   Define the combinations of tags, values, and pointers allowed in the Lineage-linked form
           (see chapter 2).  This is the form of GEDCOM currently exchanged by commercial
           genealogical software systems, and it remains unchanged except for new tags and
           upward-compatible  structural extensions listed below.  (The Lineage-linked form should
           not be confused with other forms of GEDCOM, which apply the basic GEDCOM data
           format with different tag, value, and pointer combinations for other purposes.)

       *   Define representations for support information such as source citations, and or notes. 
           (See chapter 2 for suggested source citation structure in the Lineage-linked grammar.)

       *   Define additional EVENt and Role tags.

       *   Define user-defined ASSOciations with INDIviduals including direct family relationships.

       *   Require SOURce VERSion (product version) and GEDCom VERSion information in the
           HEADer record.

       *   Define DATE modifier (ABT, BEF, AFT, BET) and a more rigorously defined regular
           date format.
       
Some changes in Version 5.2 - 5.3 that were not in previous 5.x versions are:

       *   An address structure was defined to provide consistency to the addresses used in the
           many different structures.  The Phone number is now subordinate to address.

       *   A new tag for marrital status (MSTAT) at the time of an event was used added to the
           event structure.

       *   A mechanism for creating user-defined tags.  These are defined in a SCHEMA definition
           in the header record.

       *   The inclusion of the Unicode standard (ISO 10646) as an additional character set standard
           (see chapter 3).

       *   A MULTI_MEDIA_LINK structure was introduced to provide links to digitized video
           and sound files.

       *   The NAME tag used in the SOURCE_STRUCTURE was changed back to the TITLe tag
           to be used with the title of a book or article.

       *   The SOURCE_STRUCTURE was changed.  Compatibility may affect 5.x systems that
           was using the CPLR, XLTR, AUTH, INFT tags in substructures within the source
           structure.  See originator (ORIG) substructure for handling the name of the originator of
           the source data.
       *   Relocated all tags from the SUPPORT_INFO structure to the various structures where
           they specifically apply.

       *   Added the use of the FORM {FORMAT} tag in both the HEADER and
           PLACE_STRUCTURE.  The FORM tag in the header record subordinate to the PLAC
           tag indicates that all of the locality names are specified in a consistent hiarchy as
           specified by the value of the FORM.  For example; 2 FORM City, County, State. 
           GEDCOM 5.2 used the TYPE tag subordinate to the PLAC tag for this purpose.


GEDCOM Product Registration

Developers of GEDCOM-compatible products using the Lineage-linked form of GEDCOM (see
chapter 2) should register their product by submitting the following information to the GEDCOM
coordinator:

       *   A diskette containing a small sample of GEDCOM output from the product being
           registered.  This should be data which represents all of the fields managed by your
           system and that can be used for testing compatibility with other developer's systems.

       *   A proposed unique SOURce name in the GEDCOM header record to identify the product
           (not the company).  This name can be up to 40 characters long, allowing mixed upper
           and lower case, with no embedded spaces.  Use an underscore (_) to connect multiple
           words instead of spaces or a combination of upper and lower case letters i.e.
           FamilyRecords or Family_Records.  Family History reserves the right to require
           uniqueness within the first 10 characters of this name.

       *   An optional text file containing relevant technical documentation about the product's
           GEDCOM implementation.

GEDCOM Software Library

A library of unrestricted public domain source code, in the C programming language, is available to
help reduce the work required to achieve GEDCOM compatibility.Chapter 1
                                      DATA REPRESENTATION GRAMMAR

INTRODUCTION

This chapter describes the core GEDCOM data representation language.

The generic data representation language defined in this chapter may be used to represent any form
of structured information, not just genealogical data, using a sequential stream of characters. 

CONCEPTS

A GEDCOM transmission represents a database in the form of a sequential stream of related
records.  A record is represented as a sequence of tagged, variable-length lines, arranged in a
hierarchy.  A line always contains a hierarchical level number, a tag, and an optional value.  A line
may also contain a cross-reference identifier or a pointer.  The GEDCOM-line is terminated by a
carriage return, a line feed character, or any combination of these.

The tag in the GEDCOM-line identifies the type of information contained in the line, in the same
sense that a field-name identifies a field in a database record.  This means that the data is self-
defining.  Tags allow a field to occur any number of times within a record, including zero times. 
They also allow the use of different or new fields to be included in the GEDCOM data without
introducing incompatibility, because the receiving system will ignore data which it does not
understand and process only the data that it does understand.

The hierarchical relationships are indicated by the hierarchical level number.  Subordinate lines have
a higher level number.  The hierarchy allows a line to have sub-lines, which in turn may have their
own sub-lines, and so forth.  A line and its sub-lines constitute a context or enclosure, that is, a
cluster of information pertaining directly to the same thing.  This hierarchical arrangement
corresponds with the natural hierarchy found in most structured information.

A series of one or more lines constitutes a record.  The beginning of a new record is indicated by a
line whose level number is 0 (zero).

A GEDCOM receiver system scans the input for expected information by looking for specific tags
and processing the associated values.  Unrecognized tags (perhaps from a sending system whose
database contains some different information) are handled by not processing the associated value nor
its enclosed sub-lines; that is, the entire context is ignored. These are treated as exceptions by
printing them in an exception report or saving them in some generic way.  Saved exception lines
may be recombined when the data is exported.

In addition to hierarchical relationships, GEDCOM defines inter-record relationships which allow a
record to be logically related to other records, without introducing redundancy.  These relationships
are represented by two additional but optional parts of a line: a cross-reference pointer and a cross-
reference identifier.  The cross-reference pointer "points at" a related record, identified by a
required, matching unique cross-reference identifier.  The cross-reference identifier is analogous to a
primary key in relational database terminology.


GRAMMAR

The grammar for the GEDCOM data format--a data representation language--is defined in this
chapter.  The grammar is a set of rules that specify what sequences of characters are valid
GEDCOM expressions.  The rules are expressed as a set of pattern definitions, where each pattern
is defined in terms of either a more primitive sub-pattern, or a constant.  Pattern definitions consist
of the pattern name, a separator (:=), followed by either a constant, a more primitive sub-pattern,
or a set of alternatives of these.  When a set is used, the alternatives are enclosed in square brackets
[] with the alternatives separated by a vertical bar ([alternative_1 | alternative_2]).  Only one is to
be selected.  The user can read the grammar components of the selected sub-pattern by substituting
any sub-patterns until all sub-patterns are resolved. 

A GEDCOM transmission consists of a sequence of physical records, each of which consists of a
sequence of gedcom_lines, all contained in a sequential file or stream of characters.  The
following rules pertain to the gedcom_line:

       *   The beginning of a new physical record is designated by a line whose level number is 0.

       *   Physical records are intended to be small enough to fit within a memory buffer of typical
           size, though absolute limits are not established.

       *   The total length of a GEDCOM-line, including leading white space and terminators, does
           not exceed 255 characters.  Long text can be represented by using CONTinue or
           CONCatenate tags.

       *   Leading white space (tabs, spaces, and extra line terminators) preceding a GEDCOM-line
           should be ignored by the reading system.  Systems generating GEDCOM should not place
           any white space in front of the GEDCOM-line (at least for the near future, see
           "Compatibility With Previous GEDCOM Versions" at the end of chapter 2).

       *   Level numbers must not contain leading zeroes which are not significant, for example,
           level one must be 1, not 01.

       *   GEDCOM-lines constructed with user defined tags must include a tag definition in the a
           schema substructure in the transmission header record.  The user defined tag must begin
           with an underscore (_).  The schema allows a receiving system to interpret the associated
           data.  (See the User Defined Tags section in chapter 2 for more information).


GRAMMAR SYNTAX

A gedcom_line has the following syntax:

    gedcom_line:=
         level delim  opt_xref_id  tag  opt_line_value  terminator 
       
        for example:
                               1 OCCU Teacher

    The components of the sub-patterns above are defined below in alphabetical order.  Some of the
    components are defined in terms of more primitive sub-patterns:

    alpha:=
     [ (0x41)-(0x5A) | (0x61)-(0x7A) | 0x5F ]
       Any ASCII letter: A-Z, a-z, and (_) underscore 

    alphanum:=
     [ alpha | digit ]

    any_char:=
     [ alpha | digit | otherchar | (#) | ( ) | (@) (@) ]

    delim:=
     [ (0x20) ]
       space_character

    digit:=
     [ (0x30)-(0x39) ]
       One of the digits 0,1,2,3,4,5,6,7,8,9

    escape:=
     [ (@) (#) escape_text (@) non_at ]
      
    escape_text:=
     [ any_char | escape_text any_char ]
       The escape_text is coded to meet the rules of a particular GEDCOM form.  For the lineage-
       linked form the definitions are found in Chap. 2.

    level:=
     [ digit | level digit ]
       (Do not use non-significant leading zeroes such as 02.)

    line_item:=
     [ pointer | escape | any_char ]

    line_value:=
     [ line_item |    line_value  line_item ]

    non_at:=
     [ alpha | digit | otherchar | (#) | ( ) ]
 
    null:=
       () nothing

    opt_line_value:=
     [ null | delim | delim line_value ]

    opt_xref_id:=
     [ null | pointer delim ]

    otherchar:=
     [(0x21)-(0x22) | (0x24)-(0x2F) | (0x3A)-(0x3F) | (0x5B)-(0x5E) | (0x60) |
      (0x7B)-(0x7E) | (0x80)-(0xFF)]
    Any ASCII character except control characters (0x00 - 0x1F), alphanum, space ( ), number sign
    (#),  at character (@), and the DEL character (0x7F). 

    pointer:=
     [ "@" alphanum pointer_string "@" ]

    pointer_char:=
     [ non_at ]

    pointer_string:=
     [ null | pointer_char | pointer_string pointer_char ]

    tag:=
     [ alphanum | tag alphanum ]

    terminator:=
     [ carriage_return |     line_feed |   carriage_return line_feed |
       line_feed carriage_return ]


USAGE DESCRIPTION:

    alpha:=
       The alpha characters include the underscore which is used to link word pieces together in
       forming tag names or tag labels.

    any_char:=
       Any character except the control characters found in the range of 0x00 - 0x1F.  If an @ is
       desired as part of the line_value, it must be written in GEDCOM as a double @, ie., "3 doz.
       @  $20.00" must be stored as "3 doz. @@ $20.00".

    delim:=
       The delim (delimiter), a single space character, terminates both the variable-length level
       number and the variable-length tag.  Note that space characters may also be present in a
       value.

    escape:=
       The  escape is a sequence in the grammar used to specify special processing, such as
       switching character sets or calendars for date interpretation, or for indicating an inclusion of
       a non_GEDCOM data form into the GEDCOM structure. The form of the escape sequence
       is:
       @# escape_text @ non_at.
                      for example:
                      @#DJULIAN@.
       The non_at after the final at character (@) should be discarded if it is a space ( ). 
       Otherwise, it should be retained as part of the text following the escape.  Output systems
       should always place a space ( ) after the escape sequence.  

       The specific format of the escape sequence is defined for the specific GEDCOM form being
       defined. (See chapter 2 for the escape sequence definition for the lineage-linked form).  

    escape_text:=
       The escape_text is defined to meet the requirements of a particular GEDCOM form.  For the
       lineage-linked form the definitions are found in Chap. 2.

    level:=
       The level number works the same way as the level of indentation in an indented outline,
       where indented lines provide detail about the item under which they are indented.  A line at
       any level L is enclosed by and pertains directly to the nearest preceding line at level L-1. 
       The Level L may increase by 1 at most.  Level numbers must not contain leading zeroes
       which are not significant, for example level one must be (1), not (01).

       The enclosed subordinate lines at level L are said to be in the context of the enclosing
       superior line at level L-1.  The meaning of a tag (see tag below) is interpreted in the context
       of the tags of the enclosing line(s).  Take the following record about an individual's birth
       and death dates, for example:

              0 INDI
                1 BIRT
                  2 DATE 12 MAY 1920
                1 DEAT
                  2 DATE 1960

       In this example, the expression DATE 12 MAY 1920 is interpreted within the INDI
       (individual) BIRT (birth) context, representing the Individual's birth date.  The second
       DATE is in the INDI DEAT (death) context.  The complete meaning of DATE depends on
       the context.  (Note: the above example is indented according to the level numbers to make
       the concept more obvious.  In the actual GEDCOM data there is no indentation, just level
       numbers lined up vertically on the left margin).

       NOTE: Some existing systems provide an option to produce an indented GEDCOM output
       for user readability, using space or tab characters between the terminator and the level
       number of the next line to visibly show the hierarchy.  Also, some have suggested allowing
       extra blank lines to visibly separate physical records.  These features may be incorporated
       into the GEDCOM standard at some future time, but for now, such a change would render
       some existing systems incompatible.  Therefore, we recommend that new systems be
       prepared to discard extra carriage returns, line feeds, spaces and tabs immediately preceding
       the level number during input.  Output should still be constrained to level numbers without
       indentation or blank lines, until most receiving systems are prepared to deal with this change.

    line_value:=
       The line_value identifies an object within the domain of possible values allowed in the
       context of the tag.  The combination of the tag, the line_value, and the hierarchical context
       of the supporting gedcom_lines provides the understanding of the enclosed values.  This
       domain is defined by a specific grammar for representing a given GEDCOM form (see
       chapter 2 for Lineage-linked grammar).

       Values whose source information contains illegible parts of the value should be indicated by
       replacing the illegible part with ... (ellipses).

       Values are generally not encoded in binary or other abbreviation schemes for reducing space
       requirements, and they are generally constrained to be understandable by a typical user
       without decoding.  This is intended to reduce the decoding burden on the receiving software. 
       A GEDCOM-optimized data compression standard will be defined in the future to reduce
       space requirements.  Meanwhile, users may agree to compress and decompress GEDCOM
       files using any compression system available to both sender and receiver.

       The line_value within the context of a tag hierarchy of gedcom_lines represents one piece of
       information and corresponds to one field in traditional database or file terminology.

    opt_xref_id:=
       (See pointer.)
       The opt_xref_id is formed by any arbitrary combination of characters from the pointer_char
       set.  The first character must be an alpha or a digit.  The opt_xref_id is not retained in the
       receiving system, and may therefore be formed from any convenient combination of
       identifiers from the sending system.  No meaning is attributed by the receiver to any part of
       the opt_xref_id, other than its unique association with the associated record.  The use of the
       colon (:) character is also reserved.

    otherchar:=
     [(0x21)-(0x22) | (0x24)-(0x2F) | (0x3A)-(0x3F) | (0x5B)-(0x5E) | (0x60) |
      (0x7B)-(0x7E) | (0x80)-(0xFF)]
       Any ASCII character except control characters (0x00 - 0x1F), alphanum, space ( ), number
       sign (#),  at character (@), and the DEL character (0x7F). 

       If any of these characters appear in the level, xref_ID, or pointer segments of the GEDCOM
       line, then that substructure should be written to an exception file.  If any of these characters
       appear in the value segment and the proper escape processing has not been invoked, then
       they should be replaced by a (^) (0x5E) character, unless the character is a TAB (0x09)
       character which can be replaced with a space (0x20) character.  These changes should also
       be recorded on an exception file.

    pointer:=
       A pointer stands in the place of the context identified by the matching xref_id. 
       Theoretically, a receiving system should be prepared to follow a pointer to find any needed
       value in a manner that is transparent to the logic of the subsystem that is looking for specific
       tags.  This highly-flexible facility will probably be used more in the future.  For the time
       being, however, the use of pointers is explicitly defined within the GEDCOM form (Such as
       defined in Chapter 2).

       The pointer represents the association between two objects that usually reside in different
       records.  There can, however, be an association between objects within the same logical
       record.  If this condition exists it is indicated in the pointer record composition containing an
       (!) character that separates the parent record's cross-reference ID from the specific
       substructure's cross-reference ID which is at some subordinate level to the logical at level
       zero.  The cross-reference ID of the substructure subordinate to a zero level record is always
       composed of the Record ID number and the Substructure ID number, such as @I132!1@. 
       By including the Record Id number in the pointers which associate objects within a record
       will allow the GEDCOM processors to build the index only at the record level and then
       search sequentially for the appropriate substructure cross reference ID.

       Complex logical record structures are divided into small physical records to accommodate
       memory constraints, many-to-many relationships, and independent record creation and
       deletion.

       The pointer must match a corresponding xref_id within the transmission, unless the colon (:)
       character is present (future network reference to a permanent file record).  A pointer is
       given instead of duplicating an object, though the logical result is equivalent.  An expanded
       traversal of a record tree includes following the pointers to related records to some depth,
       and splicing those records (logically) into the resultant expanded tree. Pointers may refer to
       either records which have not yet appeared in the transmission (forward reference) or to
       records that have already appeared earlier in the transmission (backward reference).  This
       arrangement usually requires a preliminary pass to construct a look up table to support
       random access by xref_id during subsequent passes. 

    tag:=
       A tag consists of a variable length sequence of alphanum characters.  All user defined tags,
       that is tags used which have not been defined by the GEDCOM standard must begin with an
       underscore character. (0x95).  All user defined tags must be defined in the SCHEMA
       substructure of the HEADer record. 

       The tag represents the meaning of the line_value within the context of the enclosing lines,
       and contributes to the meaning of enclosed subordinate lines.  Specific tags are defined in
       Appendix A.  

       Although existing tags are only three or four characters long, systems should prepare to
       handle tags of any length. Tags will be unique within the first 15 characters.

       Valid combinations of specific tags, line_values, xref_ids, and pointers are constrained by
       the GEDCOM form defined for representing a given kind of information (see chapter 2 for
       the Lineage-linked form grammar).

    terminator:=
       The terminator delimits the variable-length line_value and signals the end of the 
       gedcom_line. The valid terminator characters are:
                       [     carriage_return |
                             line_feed |
                             carriage_return line_feed |
                             line_feed carriage_return ]

Examples:

    The following are examples of valid but unrelated GEDCOM-lines:

              0 @1234@ INDI
              . . .
              1 AGE 13
              . . .
              1 CHIL @1234@
              . . .
              1 NOTE This is a note field that is
              2 CONT continued on the next line.

    The first line has a level number 0, a xref_id of @1234@, an INDI tag, and no value.  The
    second line has a level number 1, no xref_id, an AGE tag, and a value of 13.  The third line
    has a level number 1, no xref_id, a CHIL tag, and a value of a pointer to a xref_id named
    @1234@.                                   Chapter 2
                                        LINEAGE-LINKED GRAMMAR

INTRODUCTION

This chapter describes the specific tag, value, and pointer combinations used for exchanging
lineage-linked genealogical information in the GEDCOM format.  Lineage-linked data pertains to
individuals linked in family relationships across multiple generations.  The chapter also addresses
specific compatibility issues pertaining to previous Lineage-linked GEDCOM releases and contains a
sample Lineage-linked GEDCOM transmission.

The Lineage-linked grammar defined in this chapter is based on the general framework of the
GEDCOM data representation grammar defined in the Chapter 1.  The lineage-linked grammar
defines the GEDCOM form used by commercial genealogical software systems to exchange data. 
Other specialized GEDCOM-based grammars have been created for different uses.  These other uses
of the general-purpose GEDCOM data representation should not be confused with this specific usage
for lineage-linked genealogical data, as defined in this chapter as the only approved form of
GEDCOM exchanged by commercial genealogical software systems at this time.

LINEAGE-LINKED GRAMMAR ORGANIZATION

This Lineage-linked GEDCOM grammar is organized into three sections:

    * Record structure components
    * Substructure patterns         (Arranged alphabetically by substructure name)
    * Primitive elements                   (Arranged alphabetically by primitive name)

Structures and substructures are indicated by enclosing the structure name within double angle
<<brackets>>.  Primitive element patterns are enclosed in single angle <brackets>.

The definition of each structure consists of the structure name, a separator (:=), and the structure's
component pattern.  This pattern consists of (a) GEDCOM-lines composed of primitive elements,
and/or (b) substructures.  Some primitive elements consist of two or more alternative sub-pattern
choices.  These choices are shown by listing the alternative sub-patterns between opening and
closing square [brackets] and separating each choice with a vertical bar (|), meaning that exactly
one of the alternate substitutions must be selected.  Some definitions of primitive elements use the
definition of other primitive elements to complete their definition. This is shown by including the
name of the detailed element type inside angle <brackets> in the definition.

The number of sub-pattern occurrences allowed within a pattern is defined in an occurrence
definition in curly {braces} on each line.  This number indicates the minimum and maximum
number of occurrences allowed for a pattern component in the form {minimum:maximum}.  Note
that minimum and maximum occurrence limits are defined relative to the enclosing superior line. 
This means that a required line (minimum = 1) is not required in an instance where the optional
enclosing line is not given.  Similarly, a line occurring only once (maximum = 1) may occur
multiple times as long as each occurs only once under its own multiple-occurring superior line.

The level numbers for any sub-structure are represented as (n), (+1), (+2), and so forth, so that
they may be used in more than one place at different starting level numbers. In these cases, (n)
equals the level number where the pattern first appears, and the (+1) means one level greater than
level n, (+2) means two levels greater than level n, and so forth.

Unless stated otherwise, the only ordering imposed on GEDCOM-lines within an enclosure arises
when multiple opinions or other items are presented for which only one may be expected by a
receiving system.  For example, a person may have been known by more than one name, or
evidence may suggest a birth either in 1840 in New York or in 1837 in Pennsylvania.  In these
cases, the most credible or preferred information is listed first, followed by less credible or less
preferred items. The QUAY tag may also be used to show the preferred data (see appendix A). 
Systems that support only a single field within a context should use the first item in the list.

Conflicting dates or places of an event should be represented in separate event structures to provide
a place for the accompanying source citations, rather than place multiple dates or multiple places
under the same enclosing event.

Even though no other ordering is defined beyond the one described above, some GEDCOM
programming tools optimize performance based on the assumption that tags generally appear in a
typical order.  Therefore, sending systems are encouraged to present GEDCOM structures in the
same general order as the one given in these patterns, unless there is a reason to use a different
sequence. 

This form uses the tag TYPE as a subordinate tag to names, places, events, etc.  The intent of this
tag is meant to further define its superior tag for the viewer only, it is not intended to inform a
computer program how to process the data.  The difference between this value and a note value
would be that displaying systems should always display the type value when they display the
associated data.  Therefore, cautious consideration should be used in using the TYPE tag. 

RECORD STRUCTURES OF THE LINEAGE-LINKED FORM

LINEAGE_LINKED_GEDCOM:=
      This is a model of the Lineage-linked GEDCOM structure for submitting data to other
      lineage-linked GEDCOM processing systems.  A header and a trailer record are required and
      they enclose any number of data records.

      0  <<HEADER>>                                                       {1:1}           
      0  <<RECORD>>                                                       {0:M}           
      0 TRLR                                                              {1:1}           

      There are specific subordinate GEDCOM-lines that may be used as subordinate GEDCOM-
      lines to other superior GEDCOM-lines.  For example:
         1 BIRT
         2 DATE 02 Oct 1937
         3 QUAY 1
      In the above example QUAY at level 3 indicates how reliable or correct the birth date value
      is.  The QUAY tag applies to any tag that contains a value.  This tag is not shown in any of
      the structures but the reader and writer of GEDCOM should expect that the QUAY tag
      could be present as a subordinate tag to any tag that has an associated value.

HEADER:=
      The header structure provides information about the entire transmission.  The SOURce system
      name identifies which system sent the data.  The DESTination system name identifies the
      receiving system.  Submission to the Family History Department for Ancestral File is
      ANSTFILE. For LDS temple submissions it is TempleReady.


      n  HEAD                                                             {1:1}
         +1 SOUR <SYSTEM_NAME>                                            {1:1}
            +2 VERS <VERSION_NUMBER>                                      {1:1}
            +2 NAME <PRODUCT_NAME>                                        {0:1}
            +2 CORP <CORPORATE_NAME>                                      {0:1}
               +3 <<ADDRESS_STRUCTURE>>                                   {0:1}
            +2 DATA <NAME_OF_SOURCE_DATA>                                 {0:1}           
               +3 DATE <PUBLICATION_DATE>                                 {0:1}           
         +1 DEST <SYSTEM_NAME>                                            {0:1}           
         +1 DATE <TRANSMISSION_DATE>                                      {0:1}
            +2 TIME <TIME_VALUE>                                          {0:1}           
         +1 SUBM @XREF:SUBM@                                              {1:1}           
         +1 FILE <FILE_NAME>                                              {0:M}           
         +1 COPR <COPYRIGHT_STATEMENT>                                    {0:1}           
            +2 CONT <TEXT>                                                {0:M}           
         +1 SCHEMA                                                        {0:1}
            +2 <<USER_TAG_SCHEMA>>                                        {1:M}
         +1 GEDC                                                          {1:1}           
            +2 VERS <VERSION_NUMBER>                                      {1:1}           
            +2 FORM <GEDCOM_FORM>                                         {0:1}
         +1 CHAR <CHARACTER_SET>                                          {0:1}           
            +2 VERS <VERSION_NUMBER>                                      {0:1}
         +1 LANG <LANGUAGE_OF_TEXT>                                       {0:1}
         +1 PLAC                                                          {0:1}
            +2 FORM <PLACE_HIERARCHY>                                     {1:1}

RECORD:=
      [   
         n  <<EVENT_RECORD>>                                              {0:1}    
      |                                                                 
         n  <<FAMILY_RECORD>>                                             {0:1}      
      |
         n  <<INDIVIDUAL_RECORD>>                                         {0:1}
      |
         n  <<NOTE_RECORD>>                                               {0:1}
      |
         n  <<REPOSITORY_RECORD>>                                         {0:1}
      |
         n  <<SOURCE_RECORD>>                                             {0:1}    
      |                                                                         
         n  <<SUBMITTER_RECORD>>                                          {1:1}
      ]

FAMILY_RECORD:=
      n  @XREF:FAM@ FAM                                                   {0:1}           
         +1 HUSB @XREF:INDI@                                              {0:1}           
         +1 WIFE @XREF:INDI@                                              {0:1}           
         +1 CHIL @XREF:INDI@                                              {0:M} 
         +1 REFN <USER_REFERENCE_NUMBER>                                  {0:M}
         +1 <FAM_EVNT_TAG>                                                {0:M}
            +2 TYPE <FAMILY_EVENT_DESCRIPTOR>                             {0:1}
            +2 DATE <DATE_VALUE>                                          {0:1}           
            +2 <<PLACE_STRUCTURE>>                                        {0:1}
         +1 <DIV_EVNT_TAG>                                                {0:M}
            +2 TYPE <DIVORCE_DESCRIPTOR>                                  {0:M}           
            +2 DATE <DATE_VALUE>                                          {0:1}           
            +2 <<PLACE_STRUCTURE>                                         {0:1}    
         +1 ASSO @XREF:ANY@                                               {0:M}
            +2 TYPE <ASSOCIATION_DESCRIPTOR>                              {0:1}    
         +1 NCHI <COUNT_OF_CHILDREN>                                      {0:1}
         +1 <<LDS_FAM_ORDINANCE_EVENT>>                                   {0:M}           
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}
         +1 <<MULTI_MEDIA_LINK>>                                          {0:M}
         +1 <<CHANGE_DATE>>                                               {0:M}

INDIVIDUAL_RECORD:=
      The occurrence of FAMS and FAMC tags show {0:1}, however; when an individual is
      referenced in a FAMily record as either a spouse or child, then this record must include a
      corresponding FAMS and/or FAMC tags.  The association of one individual to another can be
      represented by using the ASSO tag in the individual record to point to the record of the
      associated individual.  The relationship or association is shown in the value field of the
      subordinate TYPE tag.

      n  @XREF:INDI@ INDI                                                         
         +1 <<INDIVIDUAL>>                                                {1:1}    
         +1 FAMS @XREF:FAM@                                               {0:M}    
         +1 FAMC @XREF:FAM@                                               {0:M}    
            +2 <<CHILD_FAMILY_EVENT>>                                     {0:M}    
         +1 ASSO @XREF:REC@                                               {0:M}
            +2 TYPE <ASSOCIATION_DESCRIPTOR>                              {0:1}
         +1 <<LDS_INDI_ORDINANCE_EVENT>>                                  {0:M}
         +1 RFN <PERMANENT_RECORD_FILE_NUMBER>                            {0:M}    
         +1 REFN <USER_REFERENCE_NUMBER>                                  {0:M}    
         +1 AFN <ANCESTRAL_FILE_NUMBER>                                   {0:1}    
         +1 ALIA @XREF:INDI@                                              {0:M}    
         +1 ANCI @XREF:SUBM@                                              {0:M}    
         +1 DESI @XREF:SUBM@                                              {0:M}    
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}
         +1 <<MULTI_MEDIA_LINK>>                                          {0:M}
         +1 <<CHANGE_DATE>>                                               {0:M}

EVENT_RECORD:=
      This structure represents event-oriented evidence information that is claimed as a basis for a
      submitter's opinion expressed in Lineage-linked INDIVIDUAL and FAMILY records.  Event
      records define an event in terms of a what happened, where and when it happened, and what
      individuals are mentioned in the record.

      These event records in some cases will be the source for assertions made in compiling lineage-
      linked data. SOURce pointers to the bibliographic description of where this event information
      was recorded should be a part of this record.   
      Evidence records from historical sources are kept separate from opinion records created by the
      submitter.  The information contained in evidence records is not redundant with respect to the
      information contained in submitter's opinions, even when names, dates, or places are the
      same, because the authority for asserting the information is different.

      Roles of an event which pertain to the event itself are placed subordinate to the event tag. 
      Roles of individuals mentioned in the event which are relationship roles such as the
      "husband's father" is placed subordinate to the role tag of the groom.  For example, the
      minister at a wedding's role would be represented by the 0 EVENt-MARRiage-OFFIciator
      structure.  The father of the husband would be represented by the 0 EVENt-MARRiage-
      HUSBand-FATHer structure.  

      n  @XREF:EVEN@ EVEN
         +1 <<CHANGE_DATE>>                                               {0:M}
         +1 <EVENT_TAG>                                                   {1:1} 
            +2 TYPE <EVENT_DESCRIPTOR>                                    {0:1}
            +2 DATE <DATE_VALUE>                                          {0:1}    
            +2 <<PLACE_STRUCTURE>>                                        {0:1}    
            +2 PERI <TIME_PERIOD>                                         {0:M}
            +2 RELI <RELIGIOUS_AFFILIATION>                               {0:1}
            +2 <<MULTI_MEDIA_LINK>>                                       {0:M}
            +2 <<TEXT_STRUCTURE>>                                         {0:1}
            +2 <<SOUR_STRUCTURE>>                                         {0:M}
            +2 <<NOTE_STRUCTURE>>                                         {0:M}
            +2 <ROLE_TAG>                                                 {0:M}
               +3 TYPE <ROLE_DESCRIPTOR>                                  {0:1}    
               +3 <<INDIVIDUAL>>                                          {0:1}    
               +3 ASSO @XREF:INDI@                                        {0:M}
                  +4 TYPE <ASSOCIATION_DESCRIPTOR>                        {1:1}
               +3 <RELATIONSHIP_ROLE_TAG> [NULL | @XREF:INDI@ ]           {0:M}
                  +4 TYPE <ROLE_DESCRIPTOR>                               {0:1}    
                  +4 <<INDIVIDUAL>>                                       {0:1}    

NOTE_RECORD:=             /* must contain cross reference ID */
      n  <<NOTE_STRUCTURE>>                                               {1:1} 
         +1 <<CHANGE_DATE>>                                               {0:M}

REPOSITORY_RECORD:=       /* must contain cross reference ID */
      n  <<REPOSITORY_STRUCTURE>>                                         {1:1}
         +1 <<CHANGE_DATE>>                                               {0:M}

SOURCE_RECORD:=           /* must contain cross reference ID */
      n  <<SOURCE_STRUCTURE>>                                             {1:1}
         +1 <<CHANGE_DATE>>                                               {0:M}


SUBMITTER_RECORD:=
      The submitter record identifies individuals or organizations that contributed the opinion
      information contained within the GEDCOM transmission.  All records in the transmission are
      assumed to be submitted by the SUBMITTER referenced in the HEADer, unless a SUBMitter
      reference inside a specific record points at a different SUBMITTER.
                                                                              
      n  @XREF:SUBM@ SUBM                                                 {1:1}
         +1 <<NAME_STRUCTURE>>                                            {1:1}
         +1 <<ADDRESS_STRUCTURE>>                                         {0:1}
         +1 LANG <LANGUAGE_PREFERENCE>                                    {0:3}
         +1 <<CHANGE_DATE>>                                               {0:M}


SUBSTRUCTURES OF THE LINEAGE-LINKED FORM

ADDRESS_STRUCTURE:=
      n  SITE <SITE_NAME>                                                 {0:1}
      n  ADDR <ADDRESS_LINE>                                              {0:1}
         +1 CONT <ADDRESS_LINE>                                           {0:M}
         +1 PHON <PHONE_NUMBER>                                           {0:3}

BURIAL_STRUCTURE:=  
      Used only when cemetery information is managed separately from the burial place name.  It is
      permissible to include the cemetery name as the low level locality name; for example,
      Richmond Cemetery, Richmond, Cache, Utah, USA.
      n  CEME <CEMETERY_NAME>                                             {0:1}
         +1 PLOT <BURIAL_PLOT_ID>                                         {0:1}

CHANGE_DATE:=
      n  CHAN                                                             {1:1}           
         +1 DATE <CHANGE_DATE>                                            {1:1}           
            +2 TIME <TIME_VALUE>                                          {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}

CHILD_FAMILY_EVENT:=
      [
      n  ADOP                                                             {1:1}           
         +1 TYPE <CHILD_FAMILY_EVENT_DESCRIPTOR>                          {0:1}
         +1 AGE  <AGE_VALUE>                                              {0:1}
         +1 DATE <DATE_VALUE>                                             {0:1}           
         +1 <<PLACE_STRUCTURE>>                                           {0:1}           
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}
      |                                                                         
      n  <<LDS_CHILD_SEALING_EVENT>>                                      {0:1}           
      ]

CORRECTNESS_ASSESMENT:=
      n  QUAY <QUALITY_OF_DATA>                                           {0:1}
                         /* used subordinate to any tag containing a value */

EVENT_STRUCTURE:=
      Information about an individual with respect to a specific event, such as the age, marital
      status, religious affiliation of this individual at time of this event.  Keep in mind that this is
      data specific to the individual owning this event and not the data that belongs to the source in
      which this data was found.  For instance Immigration and Emigration events should use a
      reference a source structure to show the SHIP and PORT information concerning the event. 
      Roles of other individuals can be shown using the EVENt record.  A link to the event record
      can be made by using the SOURce structure to point to the EVENt record.  The event record
      in this case would be an evidence record supporting the assertions made in creating this event
      structure.

      n  <EVENT_TAG>                                                      {1:1} 
         +1 TYPE <EVENT_DESCRIPTOR>                                       {0:M}
         +1 DATE <DATE_VALUE>                                             {0:1}    
         +1 <<PLACE_STRUCTURE>>                                           {0:1}    
            +2 <<BURIAL_STRUCTURE>>                                       {0:1}
         +1 AGE <AGE_VALUE>                                               {0:1}
         +1 MSTAT <MARITAL_STATUS>                                        {0:1}
         +1 CAUS <CAUSE_OF_DEATH>                                         {0:1}
         +1 RELI <RELIGIOUS_AFFILIATION>                                  {0:1}
         +1 AGNC <GOVERNMENT_AGENCY>                                      {0:1}
         +1 <<TEXT_STRUCTURE>>                                            {0:1}
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}
         +1 <<CHANGE_DATE>>                                               {0:M}

INDIVIDUAL:=
      n  <<NAME_STRUCTURE>>                                               {1:M}
      n  TITL <INDI_TITLE>                                                {0:M}    
      n  SEX <SEX_VALUE>                                                  {0:1}    
      n  <<EVENT_STRUCTURE>>                                              {0:M}
      n  <<ADDRESS_STRUCTURE>>                                            {0:M}    
      n  RELI <RELIGIOUS_AFFILIATION>                                     {0:M}    
      n  NAMR <RELIGIOUS_NAME>                                            {0:M}    
         +1 RELI <RELIGIOUS_AFFILIATION>                                  {0:1} 
      n  EDUC <SCHOLASTIC_ACHIEVEMENT>                                    {0:M}
      n  OCCU <OCCUPATION>                                                {0:M}
      n  SSN  <SOCIAL_SECURITY_NUMBER>                                    {0:M}    
      n  IDNO <NATIONAL_ID_NUMBER>                                        {0:M}
         +1 TYPE <TYPE_OF>                                                {1:1}
      n  PROP <POSSESSIONS>                                               {0:M}
      n  DSCR <PHYSICAL_DESCRIPTION>                                      {0:M}    
         +1 CONT <PHYSICAL_DESCRIPTION>                                   {0:M}    
      n  SIGN <SIGNATURE_INFO>                                            {0:M}
      n  NMR <COUNT_OF_MARRIAGES>                                         {0:M}    
      n  NCHI <COUNT_OF_CHILDREN>                                         {0:M}    
      n  NATI <NATIONALITY>                                               {0:M}    
      n  CAST <CASTE_NAME>                                                {0:M}    

LDS_CHILD_SEALING_EVENT:=
      n  SLGC                                                             {1:1}
         +1 TYPE <LDS_CHILD_SEALING_DESCRIPTOR>                           {0:1} 
         +1 DATE <DATE_VALUE>                                             {0:1}           
         +1 TEMP <TEMPLE_VALUE>                                           {0:1}           

LDS_FAM_ORDINANCE_EVENT:=
      n  SLGS                                                             {1:1}           
         +1 TYPE <LDS_FAM_ORD_DESCRIPTOR>                                 {0:1}
         +1 DATE <DATE_VALUE>                                             {0:1}           
         +1 TEMP <TEMPLE_VALUE>                                           {0:1}           

LDS_INDI_ORDINANCE_EVENT:=
      n  <LDS_INDI_ORD>                                                   {1:1}           
         +1 TYPE <LDS_INDI_ORD_DESCRIPTOR>                                {0:1}           
         +1 DATE <DATE_VALUE>                                             {0:1}           
         +1 TEMP <TEMPLE_VALUE>                                           {0:1}           
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}

MULTI_MEDIA_LINK:=
      n  AUDIO <ESCAPE_TO_AUXILLARY_PROCESSING>                           {0:1}
      n  PHOTO <ESCAPE_TO_AUXILLARY_PROCESSING>                           {0:1}
      n  VIDEO <ESCAPE_TO_AUXILLARY_PROCESSING>                           {0:1}

NAME_STRUCTURE:=
      n  NAME <PERSONAL_NAME>                                             {1:1}
         +1 TYPE <NAME_TYPE_DESCRIPTOR>                                   {0:1}
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}

NOTE_STRUCTURE:=
      This structure contains information originated by the submitter.
      n  [ @XREF:NOTE@ | NULL ] NOTE [ <SUBMITTER_TEXT> | NULL ]          {1:1} 
         +1  CONT <SUBMITTER_TEXT>                                        {1:M}      
         +1  NOTE @XREF:NOTE@                                             {0:1}           

PLACE_STRUCTURE:=
      n  PLAC <PLACE_VALUE>                                               {1:1}
         +1 FORM <PLACE_HIERARCHY>                                        {0:1}
         +1 <<ADDRESS_STRUCTURE>>                                         {0:1}
         +1 <<SOUR_STRUCTURE>>                                            {0:1}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}

REPOSITORY_STRUCTURE:=
      n  [ @XREF:REPO@ | NULL ] REPO                                      {1:1}
            +2 NAME <NAME_OF_REPOSITORY>                                  {0:1}
            +2 CNTC <NAME_OF_CONTACT_PERSON>                              {0:1}
            +2 <<ADDRESS_STRUCTURE>>                                      {0:1}
            +2 MEDI <MEDIA_TYPE>                                          {0:1}
            +2 CALN <SOURCE_CALL_NUMBER>                                  {0:1}
               +3 ITEM <FILM_ITEM_IDENTIFICATION>                         {0:1}
               +3 SHEE <SHEET_NUMBER>                                     {0:1}
               +3 PAGE <PAGE_NUMBER>                                      {0:1}
            +2 REFN <MANUAL_FILING_IDENTIFICATION>                        {0:1}
            +2 <<NOTE_STRUCTURE>>                                         {0:1}

SOURCE_STRUCTURE
      The source structure represents the submitter's basis (justification) for the opinions asserted in
      a lineage linked transmission.  This information is used by other researchers to (1) determine
      how much confidence to place in the associated assertions, (2) compare new evidence to old
      evidence from prior research, and (3) locate and examine the evidence to make an independent
      evaluation of it.  If a source is not explicitly cited for a given context, the source is by default
      ascribed to be the personal opinion of the submitter, with no further basis for its credibility.

      The justification takes the form of a description of the source from which the evidence was
      obtained, and may include a machine-readable representation of the evidence itself, such as an
      image of a document or an extract of its contents.

      A given source may be the basis for many different assertions.  Thus, much of the information
      is the same for many different citations of that source, such as the publisher information; and
      yet, some of the information varies from one citation to the next, such as the page number for
      a specific item.  Consequently, the SOURCE_STRUCTURE includes a sophisticated
      mechanism for sharing general source description information that is common across multiple
      citations, while at the same time allowing more specific information to be more directly
      associated with individual citations.  All tags within the SOURCE_STRUCTURE participate in
      this approach.

      To implement the mechanism, the SOURCE_STRUCTURE includes a SOURce pointer that
      refers to another SOURCE_STRUCTURE containing more general information to be included
      in the citation.  This forms a chain of records, beginning within an individual or family record
      and ending in a source record that does not contain another SOURce pointer.

      A given tag may appear in more than one record along the chain.  In this case, the tag
      occurring in one link (source record) of the chain is said to shadow or supersede the same tag
      found in subsequent records of the chain.  A program looking for a particular tag (or tags) in
      the citation starts looking in the first record of the chain and continues looking in each
      subsequent record in the chain for the appropriate tag, succeeding when the tag is found or
      failing when the end of the chain is reached.  In effect, a complete logical source citation is
      the set of all tags of all records within the source chain, excluding shadowed tags.

      The chain may consist of only one SOURCE_STRUCTURE contained entirely inside an
      individual or family record, with no SOURce pointer leading out from the individual or family
      record.  More typically, the chain will begin in the individual or family record and end in an
      ordinary source description record.  Occasionally, a multiple volume source may be
      represented using a record in the middle of the chain for specific information about the
      volume.
      For example, in a multiple volume source where each volume covered a range of years, a
      volume description would contain the PERIod covered by the volume, and the more general
      description of the set of volumes would contain the PERIod covered by the entire set of
      volumes.  In assembling the complete source citation, the program would stop searching for
      the PERIod as soon as it found a PERIod tag, which in this case would be in the volume
      description.  In a multiple volume source where each volume covered a specific place as part
      of a larger grouping of places, the program would find the PLACE_STRUCTURE information
      in the intermediate volume description, and it would find the PERIod information in the final,
      more general description of the set of volumes.

      We encourage data entry systems to develop flexible entry screens which will prompt their
      users for information which will meet the minimum standards for citing sources.  At the
      minimum there should be an entry form for published sources and one for unpublished
      sources.  The elements below are marked if they were recommended by the National
      Genealogical Society as being a help in citing puplished (p) or unpublished (u) sources.

SOURCE_STRUCTURE:=  
                               /****** TYPE OF SOURCE ******/
      n  [ @XREF:SOUR@ | NULL ] SOUR [ <TEXT> | NULL ]             
         +1 [ CONT | CONC ] <TEXT>                                        {0:1}
         +1 CLAS <SOURCE_CLASSIFICATION_CODE>                             {1:1}up
         +1 EVEN  <EVENT_CLASSIFICATION_CODE>                             {0:1}
         +1 PERI <TIME_PERIOD_COVERED>                                    {0:M}up
                               /****** CITATION SPECIFIC INFO ******/
         +1 TITL [<DESCRIPTIVE_TITLE> | @XREF:SOUR@]                      {0:1}up
         +1 SOUR [ @XREF:SOUR@ | @XREF:EVEN ]                             {0:M}up
         +1 PAGE <PAGE_DESCRIPTION>                                       {0:1}up
         +1 DATE <ENTRY_RECORDED_DATE>                                    {0:1}u
         +1 CENS                                                          {0:1}
            +2 DATE <CENSUS_DATE>                                         {0:1}u
            +2 LINE <LINE_NUMBER>                                         {0:1}u
            +2 DWEL <DWELLING_NUMBER>                                     {0:1}u
            +2 FAMN <FAMILY_NUMBER>                                       {0:1}u
            +2 <<NOTE_STRUCTURE>>                                         {0:1}
                               /****** WHO CREATED IT ******/
      +1 ORIG                                                             {0:M}
            +2 NAME <ORIGINATOR_NAME>                                     {0:1}up
            +2 TYPE <ORIGINATOR_TYPE>                                     {1:1}up
            +2 <<NOTE_STRUCTURE>>                                         {0:1}

                               /****** PUBLICATION INFO ******/ 
         +1 PUBL                                                          {0:1}
            +2 TYPE <PUBLICATION_TYPE>                                    {1:1}up
            +2 NAME <NAME_OF_PUBLICATION>                                 {0:1}p
            +2 PUBR <PUBLISHER_NAME>                                      {0:1}p
            +2 <<ADDRESS_STRUCTURE>                                       {0:1}
            +2 DATE <PUBLICATION_DATE>                                    {0:1}up
            +2 EDTN <PUBLICATION_EDITION>                                 {0:1}p
            +2 SERS <SERIES_VOLUME_DESCRIPTION>                           {0:1}p
            +2 ISSU <PERIODICAL_ISSUE_NUMBER>                             {0:1}p
            +2 LCCN <LIBRARY_CONGRESS_CALL_NUMBER>                        {0:1}

                               /****** WHERE IS IT STORED ******/
         +1 <<REPOSITORY_STRUCTURE>>                                      {0:1}up

                               /****** IMMIGRATION/EMIGRATION ***/
            +2 NAME <NAME_OF_VESSEL>                                      {0:1}
            +2 PORT                                                       {0:1}
               +3 ARVL                                                    {0:1}
                  +4 DATE <ARRIVAL_DATE>                                  {0:1}
                  +4 PLAC <ARRIVAL_PLACE>                                 {0:1}
               +3 DPRT                                                    {0:1}
                  +4 DATE <DEPARTURE_DATE>                                {0:1}
                  +4 PLAC <DEPARTURE_PLACE>                               {0:1}
            +2 <<TEXT_STRUCTURE>>                                         {0:1}
            +2 <<NOTE_STRUCTURE>>                                         {0:1}
                               /****** SUPPORT DATA ******/
         +1 <<TEXT_STRUCTURE>>                                            {0:1}
         +1 <<MULTI_MEDIA_LINK>>                                          {0:M}
         +1 <<NOTE_STRUCTURE>>                                            {0:1}
         +1 STAT <SEARCH_STATUS>                                          {0:1}
            +2 DATE <SEARCH_STATUS_DATE>                                  {0:1}
         +1 REFS @XREF:SOUR@    /* REFERENCED SOURCE */                   {0:1}
         +1 FIDE <SOURCE_FIDELITY_CODE>                                   {0:1}
         +1 QUAY <QUALITY_OF_DATA>                                        {0:1}

TEXT_STRUCTURE:=
      This structure contains information from the source document.
      n  TEXT <SOURCE_TEXT>                                               {1:1} 
         +1  [ CONT | CONC ] <SOURCE_TEXT>                                {1:M}
         +1  <<NOTE_STRUCTURE>>                                           {0:1}           

USER_TAG_IN_CONTEXT:=
      A context structure which represents all of the superior level numbers and associated tags
      from level zero to the level of the new user tag.  All user tag names must start with and
      underscore (_).

      0  <OLD_TAG_1>                                                      {1:1}
         1 <OLD_TAG_2>                                                    {0:M}
            2 _<NEW_TAG>                                                  {0:M}
        
                /* always start user tag name with an underscore (_).*/        
      For example, two new user tags are to be defined as _HOSP and _NURS and placed
      subordinate to an individual's birth.  The user tag in context would be: (Example only)
      n  INDI
         +1 BIRT
            +2 _HOSP
            +2 _NURS

      The resulting USER_TAG_SCHEMA, to be included in the HEADer record, would then look
      like the following:

      (Example only)
      n  SCHEMA
         +1 INDI
            +2 BIRT
               +3 _HOSP
                  +4 LABL <FULL_TAG_NAME>
                  +4 DEFN <USER_TAG-DEFINITION>
                  +4 ISA  <IS_A_KIND_OF_TAG>
               +3 _NURSE
                  +4 LABL <FULL_TAG_NAME>
                  +4 DEFN <USER_TAG-DEFINITION>
                  +4 ISA  <IS_A_KIND_OF_TAG>
      See User Defined Tag section at the end of chapter 2 for additional information.

USER_TAG_SCHEMA:=
      n  <<USER_TAG_IN_CONTEXT>>                                          {1:M}
         +m LABL <FULL_TAG_NAME>                                          {1:1}
         +m DEFN <USER_TAG_DEFINITION>                                    {1:1}
         +m ISA  <IS_A_KIND_OF_TAG>                                       {1:1}
           /*  +m represents the first subordinate level to the new user defined tag level.  (See
               example shown under the substructure definition for USER_TAG_IN_CONTEXT). */
PRIMITIVE ELEMENTS OF THE LINEAGE-LINKED FORM

The fields sizes are to show the minimum recommended field length within a database that is
constrained to fixed length fields.  GEDCOM lines are limited to 255 characters.  However, data of
any length can be included in GEDCOM by using the CONCatenation or CONTinuation tag to
expand a field beyond the 255 limit.  These two tags are being used to extend text type messages
rather than extending, for example, a name line.  Text lines are used in ADDR, DSCR, NOTE,
SOUR, TEXT, etc.

ADDRESS_LINE:=                                            {Size=1:40}
    Address information that, when combined with NAME and CONTinuation lines, meets
    requirements for sending communications through the mail.

AGE_VALUE:=                                               {Size=1:30}
    A number that indicates the age in years, months, and/or days.  Any labels must come after their
    corresponding number, for example; 4 yr 8 mo 10 da.  The year is required, and listed first,
    even if it is 0 (zero).

ANCESTRAL_FILE_NUMBER:=                                   {Size=1:8}
    A unique permanent record number of an individual record contained in the LDS Ancestral File.

ARRIVAL_DATE:=                                            {Size=1:90}
    <DATE_VALUE> 
    A date associated with an arrival event, such as the arrival of a ship into a port.

ARRIVAL_PLACE:=                                           {Size=1:120}
    <PLACE_VALUE> 
    The place from which travel terminated, such as the locality name of a port of arrival, such as
    Ellis Island, New York, New York.

ASSOCIATION_DESCRIPTOR:=                                  {Size=1:90}
    A word or phrase that describes the association between this person and another person identified
    by a pointer. (For example, n ASSO great grandfather @XREF:SUBM@ would be read, this
    person is a great-grandfather of the person defined in the submitter record.)

AUXILLARY_FILE_REFERENCE:=                                {Size=1:30}
    A full file reference to the auxillary data to be linked to the GEDCOM context.

AUXILLARY_SET_FORMAT:=                                    {Size=1:10}
   [ OLE | GIF | TIF | WPG | etc. ]
    Indicates the format of the data that is being linked to the GEDCOM context.  This will allow
    the GEDCOM processor to determine whether they are able to process the auxillary data. The
    auxillary file should contain a header record with data required, by the indicated format, to
    process the file data.

CALENDAR_ESCAPE_SEQUENCE:=                                {Size=4:15}   
    [ @#DHEBREW@ | @#DROMAN@ | @#DFRENCH R@ | @#DGREGORIAN@ |
      @#DJULIAN@ | @#DUNKNOWN@ ]
    An escape sequence that allows dates from one of the indicated calendars to be represented.  The
    default calendar is the Gregorian calendar.
CASTE_NAME:=          {Size=1:90}
    A name assigned to a particular group that this person was associated with, such as a particular
    racial group, religious group, or a group with an inherited status.

CAUSE_OF_DEATH:=                                          {Size=1:90}
    The cause of death of this person. This should be the same cause as listed on the death
    certificate if known.  (A medical history structure may be developed for a future GEDCOM
    release.)

CEMETERY_NAME:=                                           {Size=1:90}
    The name of the cemetery where a person was buried.

CHANGE_DATE:=                                             {Size=10:11}
    <DATE_EXACT>
    The date that this data was last changed.

CHARACTER_SET:=                                           {Size=1:8}
    A code value that represents the character set to be used to interpret this data.  The default
    character set is ANSEL which includes ASCII as a subset.  UNICODE is also will be allowed. 
    See chapter 3.

CHILD_FAMILY_EVENT_DESCRIPTOR:=                           {Size=1:90} 
    A word or phrase that describes or modifies the adoption event being reported.

CONCATENATED_DATA:=                                       {Size=1:247}
    Adds new data to the end of the data in the preceding context.

CONTACT_PERSON:=                                          {Size=1:120}  
    <PERSONAL_NAME>
    The name of the person to whom communications should be addressed.
 
CONTINUED_DATA:=                                          {Size=1:247}
    A new line which logically is included in the preceding line.  This may be used in specified
    situations where the value length exceeds the maximum allowed length for the line.
                                                                             
COPYRIGHT_STATEMENT:=                                     {Size=1:90}
    A copyright statement needed to protect the rights of the owner of this data.

CORPORATE_NAME:=                                          {Size=1:90}
    The company, corporate or government agency name.

COUNT_OF_CHILDREN:=                                       {Size=1:3, Type=NUMBER}
    The number of children of this individual from all marriages or of this family, regardless of
    whether the associated children are represented in the GEDCOM file.
                                                                              
COUNT_OF_MARRIAGES:=                                      {Size=1:3, Type=NUMBER}
    The number of different families that this person was known to have been a member of as a
    spouse or parent, regardless of whether the associated families are represented in the GEDCOM
    file.

DATE_DUAL:=           {Size=1:90}
    <DATE_REGULAR/<YEAR_ALTERNATIVE> 
    A date which shows the possible date alternatives arising from a calendar change, for example,
    15 Dec 1752/3. 

DATE_EXACT:=          {Size=10:11}
    <DAY> <MONTH> <YEAR>
    A formatted date with one space between the day and the month and one space between the
    month and the year.

DATE_MODIFIER:=                                           {Size=3:15}
    [ ABT | AFT | BEF | EST | <CALENDAR_ESCAPE_SEQUENCE>]
    Qualifies the meaning of a date.
       ABT = About
       AFT = After
       BEF = Before
       EST = Estimated

DATE_PHRASE:=         {Size=1:90}
    <text>
    Any statement offered as a date when the specific year is not known, but which gives
    information about when an event occurred. 

DATE_RANGE:=          {Size=17:31}
    [ BET <DATE_REGULAR> AND <DATE_REGULAR> ]

DATE_REGULAR:=                                            {Size=4:35}
    [ <DATE_MODIFIER | blank ] [ <DATE_EXACT> | <MONTH> <YEAR> |
    <YEAR> ]
    
DATE_VALUE:=          {Size=1:90}
    [ <DATE_REGULAR> | <DATE_PHRASE> | <DATE_RANGE> |
    <DATE_WITH_BC> |
     <DATE_DUAL> | <DATE_MODIFIER> <DATE_REGULAR> ]
    Examples:
       15 JUN 1990
       2 days after easter 1790
       BET NOV 1830 AND 25 DEC 1830
       600 B.C.
       ABT 1 JAN 1440
       @#DFRENCH R@28 NIVOSE AN09

DATE_WITH_BC:=                                            {Size=1:90}
    [ <DATE_PHRASE> <YEAR> B.C. ]
    A date of an event that occurred before Christ.

DAY:=                                                     {Size=1:2, Type=NUMBER}
    dd
    Day of the month, where dd is a numeric digit whose value is within the valid range of the days
    for the associated month.
DEPARTURE_DATE:=                                          {Size=1:90}
    <DATE_VALUE> 
    A date associated with an departure event, such as the departure of a ship from a port.

DEPARTURE_PLACE:=                                         {Size=1:120}
    <PLACE_VALUE> 
    The place from which travel began, such as the locality name of a port of departure, such as
    Pier 37, San Francisco, California.

DESCRIPTIVE_TITLE:=                                       {Size=1:247}      
    A descriptive title of the information source, such as a description of:   
       *        A title of an article published in a periodical.                       
       *        A letter including the date, the sender and the receiver.
       *        A transaction between a buyer and seller including their names and date of
                transaction.       
       *        A Family Bible containing genealogical information including past and present
                owners and a physical description of the book.        
       *        A personal interview.                                 

DIVORCE_DESCRIPTOR:=                                      {Size=1:90}
    A word or phrase that commonly describes the kind of separation, such as  "divorce" or
    "separated", that took place between husband and wife.  The separation descriptor should use the
    same word or phrase and in the same language, whenever possible, that was used by the
    recorder of the event.

DIV_EVNT_TAG:=                                            {Size=3:4}
    [ ANUL | DIV | DIVF ]     (See Appendix B for additional Tags)
    A family event tag which describes the event of separation.

ENTRY_RECORDING_DATE:=                                    {Size=1:90}
    <DATE_VALUE>
    The date that the entry was entered into the source record by the recorder.

ESCAPE_TO_AUXILLARY_PROCESSING:=                          {Size=1:30}
   [ @#A<AUXILLARY_FILE_REFERENCE> <AUXILLARY_SET_FORMAT>
    An escape sequence which allows for alternate data formats to be linked to a specific context
    within the GEDCOM file.  The linked data referenced is for special processing and is tied to the
    context in which the escape was issued.  For instance, data specific to Window's Object linking
    and embedding servers would be referenced in this manner.  See Chapter 6, "Microsoft
    Windows Programmer's Reference" for the format of the standard OLE data stream.  This
    allows the transmission of images, sounds, or other auxillary processing associated with the
    enclosing context.  The format of the escape sequence has only been designed for including data
    by referencing a specific file name.  This means that there will be an unique auxillary data file
    for each link.  In the future we may adopt a method of including all of the auxillary data in a
    single auxillary transmission file.  Other auxillary process formats may also be defined in later
    GEDCOM versions.

EVENT_CLASSIFICATION_CODE:=                               {Size=1:90}
    [ <IND_EVNT_TAG> | <EVENT_DESCRIPTOR> ]
    A code that classifies the principal event that caused this source record to be created.
EVENT_DESCRIPTOR:=                                        {Size=1:90}
    A descriptor that should be used whenever the EVEN tag is used to define the event being cited. 
    For example, if the event was a purchase of a residence, the EVEN tag would be followed by
    the phrase "Purchased Residence."   When this descriptor is used with any of the defined event
    tags, it modifies the basic definition of the associated tag.  For example the BIRT tag could be
    used in connection with an EVENT_DESCRIPTOR of "Stillborn" to modify the birth event as a
    stillborn birth.  An EVENT_DESCRIPTOR of "DEAD" shows a person is dead but the death
    date is not known.  The event descriptor should use the same word or phrase and in the same
    language, when possible, that was used by the recorder of the event.  Systems that display data
    from the GEDCOM form should be able to display the descriptor value in their screen or printed
    output.

EVENT_TAG:=           {Size=3:4}
     [ <IND_EVNT_TAG> | <FAM_EVNT_TAG> | <DIV_EVNT_TAG> ]   
    An event tag chosen from the tags identifying either individual or family events, including the
    EVEN tag with an event descriptor.

FAMILY_EVENT_DESCRIPTOR:=                                 {Size=1:90}
    A word or phrase that best describes the circumstances that created this family.  The marriage
    descriptor should use the same word or phrase and in the same language, when possible, that
    was used by the recorder of the event. Possible descriptor values include "Childbirth-
    unmarried," "Common Law," "Tribal Custom," for example.  Systems that display data from
    the GEDCOM form should be able to display the descriptor value in their screen or printed
    output. (See also <DIV_EVNT_TAG>.)

FAM_EVNT_TAG:=                                            {Size=3:4}
    [ CENS | MARR | MARB | MARC | MARL | MARS | ENGA | EVEN ]
                (See Appendix B for additional Tags)
    An event tag indicating the reason for defining a family.

FILE_NAME:=                                               {Size=1:90}
    The name of the GEDCOM transmission file on the source operating system.  It includes the
    path, file name, and file extension.  The path may optionally include the drive letter.

FILM_ITEM_IDENTIFICATION:=                                {Size=1:90}
    A particular book or unit of material that may have been filmed with other books or units on the
    same microfilm.  The convention used in the Family History Department microfilms is to
    include a separator frame with a sequential item number to separate multiple books on a single
    film.

FULL_TAG_NAME:=                                           {Size=1:15}
    The long name of a user defined GEDCOM tag. For example, HOSP tag would have a long
    name of HOSPITAL. This name should be a name that could be used as a field label for reports
    and screens.  The name may include underscore characters (_).

GEDCOM_FORM:=                                             {Size=1:15}
    [ LINEAGE-LINKED | (others to be registered) ]
    The GEDCOM form used to construct this transmission.

GOVERNMENT_AGENCY:=                                       {Size=1:90}
    The name of the branch of government associated with this event or data.

IND_EVNT_TAG:=                                            {Size=3:4}
    [ ADOP | BIRT | BAPM | BARM | BASM | BLES | BURI | CENS | CHR  | CHRA |
      CONF | DEAT | EVEN | EMIG | GRAD | IMMI | MARR | NATU | ORDN | RETI |
      PROB | WILL ]

    An individual event tag.  The EVEN tag must be followed by a TYPE and an
    <EVENT_DESCRIPTOR>. The <EVENT_DESCRIPTOR> is optional for the defined event
    tags, for example: 
                1 EVEN
                2 TYPE Farley Family Reunion
                1 BIRT
                2 TYPE illegitimate

    (See Appendix A for tag definitions or see Appendix B for proposed Tags.  These proposed tags
    have not been standardized.  They may be used as a value for the TYPE tag under the EVEN
    tag or under the appropriate approved event tags.  Appropriate means that the event should be
    processed the same as the selected superior tag)

INDI_TITLE:=                                              {Size=1:90}
    A formal designation used by an individual in connection with the individuals name, for
    example, (Captain) John Smith. 

INFORMANTS_NAME:=                                         {Size=1:90}
    <PERSONAL_NAME>
    The name of a person who contributed evidence information.

INTERVIEWERS_NAME:=                                       {Size=1:90}
    <PERSONAL_NAME>
    The name of the person who conducted the interview for information.

IS_A_KIND_OF_TAG:=                                        {Size=1:25}
    [ <LANGUAGE_TABLE> ]
    The human language in which the data in the transmission is normally read or written.  It is used
    primarily by programs to select language-specific sorting sequences and phonetic name matching
    algorithms.

LANGUAGE_PREFERENCE:=                                     {Size=1:90}
    [ <LANGUAGE_TABLE> ]
    The language in which a person prefers to communicate.  Multiple language preference is shown
    by using multiple occurrences in order of priority.

LANGUAGE_TABLE:=                                          {Size=1:25}
    A table of valid language codes.  This table of valid languages may be found in the
    Encyclopedia Britannica 1989 Book of the Year.

LDS_CHILD_SEALING_DESCRIPTOR:=                            {Size=1:20}
    <LDS_ORDINANCE_DESCRIPTOR>
    A descriptor that describes the disposition of this ordinance.  The appropriate descriptor is one
    of the choices defined by <LDS_ORDINANCE_DESCRIPTOR>. 

LDS_FAM_ORD_DESCRIPTOR:=                                  {Size=1:20}
    <LDS_ORDINANCE_DESCRIPTOR>
    A descriptor that describes the disposition of this ordinance.  The appropriate descriptor is one
    of the choices defined by <LDS_ORDINANCE_DESCRIPTOR>. 

LDS_INDI_ORD:=        {Size=3:4}
    [ BAPL | CONL | WAC  | ENDL ]
    A tag that represents an individual's religious event associated with The Church of Jesus Christ
    of Latter-day Saints.   (See Appendix A for a definition of these tags.)

LDS_INDI_ORD_DESCRIPTOR:=                                 {Size=1:90}
    <LDS_ORDINANCE_DESCRIPTOR>
    A descriptor that specifies the disposition of this ordinance.  The appropriate descriptor is one of
    the choices defined by <LDS_ORDINANCE_DESCRIPTOR>. 

LDS_ORDINANCE_DESCRIPTOR:=                                {Size=1:20}
    [ BIC | CANCELED | COMPLETED | DNS | DONE | INFANT | STILLBORN |
    SUBMITTED ]
    A code indicating the status of an LDS ordinance.
    BIC       =       This person was born in the covenant, meaning that he or she automatically
                      receives the blessing of 'child to parent' sealing.
    COMPLETED=        This ordinances has been completed but the date is not known.
    DNS       =       This record is not being submitted for this temple ordinances.
    DONE      =       This ordinance has been completed but the date is not known.
    INFANT    =       This person died before eight years old.
    STILLBORN =       This person was stillborn.
    SUBMITTED =       This ordinance was previously submitted.

LIBRARY_CONGRESS_CALL_NUMBER:=                            {Size=1:20}       
    The call number assigned to this item by the U.S. Library of Congress. 

MANUAL_FILING_IDENTIFICATION:=                            {Size=1:90}
    A description of where the source is manually filed at this repository or personal collection. 
    Personal genealogical collections should be organized and filed so that items can be specifically
    identified and retrieved.  For example, "Probate file Drawer 83, File D, Number 18", or "Box
    3, Smith Folder".

MARITAL_STATUS:=                                          {Size=1:20}
    [ D | S | W | _<TEXT> ]
    The marital status at the time of the associated event.  Status values are:
       D =      Single but legally Divorced at time of event.
       M =      Married at time of event.
       S =      Single, never married at time of event.
       W =      Single because of the death of a spouse.
       _ =      If other information about marital status is to be shown add the appropriate text
                preceded by an underscore "_".
 
MEDIA_TYPE:=          {Size=1:15}
    [ AUDIO | BOOK | CARD | ELECTRONIC | FICHE | FILM | MAGAZINE |
    MANUSCRIPT | MAP | NEWSPAPER | PHOTO | TOMBSTONE | VIDEO ]
    A code, selected from one of the media classifications choices above that indicates the type of
    material in which the referenced source is stored.

MONTH:=                                                   {Size=3:3}
    [ JAN | FEB | MAR | APR | MAY | JUN |
      JUL | AUG | SEP | OCT | NOV | DEC ]
    A month name abbreviation selected from the choices above, used in forming dates.

NAME_OF_SOURCE_DATA:=                                     {Size=1:90}
    The name of the electronic data source that was used to obtain the data in this transmission.  For
    example, the data may have been obtained from a CD-ROM disc that was named "U.S. 1880
    CENSUS CD-ROM vol. 13."

NAME_OF_VESSEL:=                                          {Size=1:90}
    A name of the ship, air ship, or commercial vehicle used for travel, immigration, emigration,
    etc.

NATIONALITY:=         {Size=1:90}
    The person's national origin in common usage.  Examples: Irish, Native American, Swede, and
    so forth.

NATIONAL_ID_NUMBER:=                                      {Size=1:30}
    A nationally-controlled number assigned to an individual.  Commonly known national numbers
    should be assigned their own tag, such as SSN for U.S. Social Security Number.  The use of the
    IDNO tag requires a subordinate TYPE tag to identify what kind of number is being stored. For
    example:
       n  IDNO 43-456-1899
         +1 TYPE Canadian Health Registration 

NEW_TAG:=             {Size=3:15}
    A user defined tag that is contained in the GEDCOM current transmission.  This tag must be
    defined within the SCHEMA context in the HEADer record and its name must begin with an
    underscore (_).  The SCHEMA context defines the data associated with this new tag. (See tags
    LABL, DEFN, and ISA).

NULL:=                                                    {Size=0:0}
    convention that indicates the absence of any characters in the value including
    A  the null character (0x00) which is prohibited.

OCCUPATION:=                                              {Size=1:90}
    The kind of activity that an individual does for a job, profession, or principal activity.

OLD_TAG_1:=                                               {Size=3:15}
    This is any tag defined by the GEDCOM standard and is used in the SCHEMA context of the
    HEADer record to show the context in which a new user defined tag is being used.  This tag
    always represents a tag which was used at level 0.

OLD_TAG_2:=                                               {Size=3:15}
    This is any tag defined by the GEDCOM standard and is used in the SCHEMA context of the
    HEADer record to show the context in which a new user defined tag is being used.  Old_TAG_2
    represents any tag at any level between level 1 and the level in which the new user defined tag
    resides. For example, 
   n SCHEMA
    +1 INDI  (zero level)
       +2 BURI
         +3 PLAC
                +4 CEME
                  +5 _PLOT (new user tag)

ORD_BY_PATRON_CODE:=                                      {Size=1:1}
    [ Y | N ]
    A code that identifies whether the patron will provide proxies for the cleared ordinances
    specified by the associated tag.
       Y = Patron will provide proxies for the associated cleared ordinance.
       N = Temple is to provide proxies for the associated cleared ordinance.

ORIGINATOR_NAME:=                                         {Size=1:120}
    [ <PERSONAL_NAME> | <CORPORATE_NAME> ]
    The name of the person or organization that created this source.

ORIGINATOR_TYPE:=                                         {Size=3:15}
    [ AUTHOR | COMPILER | TRANSCRIBER | ABSTRACTOR | EDITOR |
      INFORMANT | INTERVIEWER | GOVERNMENT | BUSINESS | ORGANIZATION ]
    A classification of the type of the person or entity that created this source.

PAGE_DESCRIPTION:=                                        {Size=1:90}
    A field that identifies the page within the source.  This may be a page number range, a specific
    page number, or another way of defining how to find the specified information within the
    source.

PERIODICAL_ISSUE_NUMBER:=                                 {Size=1:90}
    The number or description of the specific periodical publication.

PERMANENT_RECORD_FILE_NUMBER:=                            {Size=1:18}    
    <REGISTERED_RESOURCE_IDENTIFIER>:<RECORD_IDENTIFIER>
    The record number that uniquely identifies this record within a registered network resource. 
    The number will be usable as a cross-reference pointer. The use of the colon (:) is reserved to
    indicate the separation of the 'registered resource identifier'(precedes the colon) and the unique
    'record identifier' within that resource (follows the colon).  In cases where the colon is used,
    implementations that check pointers should not expect to find a matching cross reference
    identifier in the transmission but would find them in the indicated database within a network. 
    Making resource files available to a public network is a future implementation.

PERSONAL_NAME:=                                           {Size=1:120}
    [
     <TEXT> |
    /<TEXT>/ |
     <TEXT> /<TEXT>/ |
    /<TEXT>/ <TEXT> |
     <TEXT> /<TEXT>/ <TEXT>
    ]
    The surname of an individual, if known, is enclosed between two slash (/) characters. The order
    of the name parts should be the order that the person would customarily have used when giving
    it to a recorder. If part of name is illegible, that part is indicated by ... (ellipses).

    Examples:
       William Lee
       /Parry/
       William Lee /Parry/
       William /Lee/ Parry
       William Lee /Pa.../

PHONE_NUMBER:=                                            {Size=1:25}
    A phone number.

PHYSICAL_DESCRIPTION:=                                    {Size=1:247}
    A comma delimited, unstructured list of the attributes that describe the physical characteristics of
    a person, place, or object.
    Example:
       1 DSCR Hair Brown, Eyes Brown, Height  5 ft 8 in

PLACE_VALUE:=         {Size=1:120}
    [
    <TEXT> |                                                                 
    <TEXT>, <PLACE_VALUE>                                                    
    ]                                                                          
    The jurisdictional name of the place where the event took place. Jurisdictions are separated by
    commas, that is, town, county, state or village, parish, country.  Receiving systems cannot
    assume that the nth locality position is necessarily a specific level of jurisdiction.  Some systems
    may include a PLAC context in the HEADer record which will specify the jurisdictional levels
    to the place names.  Missing intermediate jurisdictions is represented by adjacent placeholder
    commas.  If FORM value within the PLACe context of the HEADer record is present, then all
    levels of jurisdiction must be accounted in this way.  For example if the following was included
    in the header record:
       0 HEAD
        1 PLAC
         2 FORM city, county, state, country
    Then each place name would be expected to account for the four levels by using appropriately
    placed commas.

    A FORM tag showing a change to this default assumption shown in the HEADer record can be
    used subordinate to an individual place structure to show the variant jurisdictional levels.

    A place of origin that is not necessarily a birth place is shown by preceding the place name with
    the word "of."  Missing or illegible characters within a place name are indicated by ...
    (ellipses).

POSSESSIONS:=                                             {Size=1:247}
    A list of possessions (real estate or other property) belonging to this individual, separated by
    commas.

PRODUCT_NAME:=                                            {Size=1:90}
    The name of the software product that produced this transmission.

PUBLICATION_DATE:=                                        {Size=1:90}
    <DATE_REGULAR>
    The date this source was published or compiled.

PUBLICATION_EDITION:=                                     {Size=1:90}
    A description of the specific version of the publication which is being referenced.

PUBLICATION_NAME:=                                        {Size=1:90}
    The name of a publication such as a book, pamphlet, periodical, newspaper, or other
    monographic publication.

PUBLICATION_PLACE:=                                       {Size=1:120}
    <PLACE_VALUE>
    The name of the place (city, state) where an item was published or the location of the publisher's
    main office.

PUBLICATION_TYPE:=                                        {Size=4:12}
 [ BOOK | PERIODICAL | NEWSPAPER | UNPUBLISHED | ELECTRONIC ]

PUBLISHER_NAME:=                                          {Size=1:90}
    The name of the publisher of the referenced publication.

QUALITY_OF_DATA:=                                         {Size=1:1, Type=NUMBER}
    [ 0 | 1 | 2 | 3 ]
    The submitter's assessment of the reliability of the information for the associated fact:       
      0 = Unreliable evidence or data was estimated.                          
      1 = Direct or primary evidence with some question of reliability        
          or potential for bias for example, an autobiography).                     
      2 = Secondary evidence.                                                 
      3 = Direct and primary evidence used, or by dominance of the evidence.   

RECORD_IDENTIFIER:=                                       {Size=1:18}
    An identification number assigned to each record within a specific data base.  If this identifier is
    associated with a preceding colon (:), then it is the record number within the registered resource
    identified by the data that precedes the (:) else it is a specific reference to a record within the
    current database if no registered resource identifier precedes the (:).  If the colon is not present
    it is the identification of a record within the current GEDCOM transmission file.

REGISTERED_RESOURCE_IDENTIFIER:=                          {Size=1:18}
    This is an identifier assigned to a resource data base which is available through access to an
    available network. (Future plans.)

RELATIONSHIP_ROLE_TAG:=                                   {Size=1:90}       
   [ BROT | CHIL | FATH | HEIR | HUSB | MOTH | PARE | PHUS | PWIF | SIBL |
     SIST | WIFE ]  

RELIGIOUS_AFFILIATION:=                                   {Size=1:90}
    A name of the religion with which this person or record was affiliated.

RELIGIOUS_NAME:=                                          {Size=1:120}
    A name given to a person to be used in connection with a religion.

REPOSITORY_NAME:=                                         {Size=1:90}
    The official name of the archive in which the stated source material is stored.

ROLE_DESCRIPTOR:=                                         {Size=1:90}       
    A word or phrase that identifies the role of each person in the event being described. This
    should be the same word or phrase, and in the same language, that the recorder used to define
    the role in the actual record. This is used in connection with the ROLE_TAG.

ROLE_TAG:=                                                {Size=1:20}       
   [ BUYR | CHIL | FATH | GODP | HDOH | HDOG | HEIR | HFAT | HMOT | HUSB |
     INFT | LEGA | MEMBER| MOTH | OFFI | PARE | PHUS | PWIF | RECO | REL  |
     ROLE | SELR | TXPY | WFAT | WIFE | WITN | WMOT | INDI ]  
    A tag that indicates the role of the individuals mentioned in a source event record.  If the above
    list does not include the role being cited, use the ROLE_TAG followed by a
    ROLE_DESCRIPTOR to define the role. (See appendix A for the definition of these tags and
    Appendix B for additional ROLEs which have been proposed as GEDCOM tags).  Names of
    individuals mentioned in the event but their role was not mentioned, should be identified by
    using the INDI role tag.  Any associations between others of known roles and this individual can
    be shown by using the ASSOciation pointer.

SCHOLASTIC_ACHIEVEMENT:=                                  {Size=1:247}
    A description of a scholastic or educational achievement or pursuit.

SEARCH_STATUS:=                                           {Size=1:90}
    [ ACTIVE | FOUND | NO | ORDERED | PLANNED |  PROVED ]
    A field that shows the research status with respect to the cited source. Where:
    ACTIVE      =     This source is currently being searched.         
    FOUND       =     Part or all of the expected information has been found.
    NO          =     This source is no longer in use because the information could not be found.
    ORDERD      =     A request for this source has been sent to the Repository.   
    PLANNED=          This source is to be examined.                   
    PROVED      =     This source has been reconciled with the data in this record.
                                                                              
SEARCH_STATUS_DATE:=                                      {Size=1:90}
    <DATE_EXACT>
    The date on which the current SEARCH_STATUS was set.

SERIES_VOLUME_DESCRIPTION:=                               {Size=1:247}
    A description of a successive publication.  The description should identify the timing of the
    publication, for example, Spring, Summer, Fall, Winter.  The description should also state the
    volume number of periodicals or of multi-volume books.

SEX_VALUE:=                                               {Size=1:7}
    A code that indicates the sex of the individual:
     M = Male                                                               
     F = Female

SIGNATURE_INFO:=                                          {Size=1:90}  
    A description of the capabilities of this person to sign documents, the symbol used in signing, 
    did they know how to sign,  did they use a model to produce a signature.

SITE_NAME:=                                               {Size=1:90}
    The name of a specific site associated with an event, address, or place.

SOCIAL_SECURITY_NUMBER:=                                  {Size=9:11}
    A social security identification number assigned to this person.

SOURCE_CALL_NUMBER:=                                      {Size=1:90}
    An identification number used to file and retrieve items from the holdings of a repository.

SOURCE_CLASS_DESCRIPTOR:=                                 {Size=1:25}       
    A descriptive word or phrase that classifies the type of source being cited.  This descriptor is
    used only when none of the classifications defined under the
    <SOURCE_CLASSIFICATION_CODE> fit this source type.  Systems that display data from
    the GEDCOM form should be able to display the descriptor value in their screen or printed
    output.

SOURCE_CLASSIFICATION_CODE:=                              {Size=7:90}
    [ BOOK | CENSUS | CHURCH | COURT | HISTORY | INTERVIEW | JOURNAL |
      LAND | LETTER | MILITARY | NEWSPAPER | PERIODICAL | PERSONAL |
      RECITED | TRADITION | VITAL | OTHER!<SOURCE_CLASS_DESCRIPTOR> ]

    A code which classifies the source which contained the evidence data.  Where:
    BOOK           =  A published work including biographies and genealogies.
    CENSUS         =  A official census. 
    CHURCH         =  A church record.
    COURT          =  A record from a court, both criminal and civil.
    HISTORY        =  A published historical account. 
    INTERVIEW      =  An interview.
    JOURNAL        =  A personal record or diary.
    LAND           =  A record of land holdings or transactions, both federal and state.
    LETTER         =  A letter or other written communication.
    MILITARY       =  A military record.
    NEWSPAPER      =  A newspaper account.
    PERIODICAL     =  A work that is published at certain intervals, such as monthly, quarterly, or
                      yearly.
    PERSONAL       =  A source that was compiled from accounts given from a person's memory.
    RECITED        =  A recited genealogy, such as a tribal or clan genealogy.
    TRADITION      =  A source that was compiled from accounts communicated by word-of-mouth
                      from one generation to another.
    VITAL          =  A vital record created by a government agency of vital records such as births,
                      marriages, and divorces.
    OTHER!         =  Other sources can be identified by using (OTHER!) followed by
                      <SOURCE_CLASS_DESCRIPTOR>.
    Systems that display data from the GEDCOM form should be able to display the descriptor value
    in their output.

SOURCE_FIDELITY_CODE:=                                {Size=7:17}
    [ ORIGINAL | PHOTOCOPY | TRANSCRIPT | EXTRACT ]
    A code is a selected from the above choices that provides an assessment of the fidelity (the
    exactness) of this source material.
      ORIGINAL         =    This source is the original record being cited.
      PHOTOCOPY        =    This source is a photocopy of the original record. 
      TRANSCRIPT       =    This source is a complete transcription of the original record.
      EXTRACT          =    This source is an abridgement, subset, and/or interpretation.

SOURCE_FILM_NUMBER:=                                      {Size=1:15}
    A unique number assigned by the repository to identify the specific microfilm containing
    information about the event of interest.

SOURCE_JURISDICTION_PLACE:=                               {Size=1:120}
    <PLACE_VALUE>
    The name of the lowest jurisdiction that encompasses all lower-level places named in this source. 
     For example, "Franklin, Idaho" would be used as a source jurisdiction place for events
    occurring in the various towns within Franklin county but "Idaho" would be used as a source
    jurisdiction place if the source records referenced other counties in Idaho besides Franklin
    county.

SOURCE_TEXT:=         {Size=1:247}                        
    <TEXT>
    A verbatim copy of any description contained within the source.  This indicates notes that are
    actually contained in the source document, not the submitter's opinion about the source. 

SUBMITTER_TEXT:=                                          {Size=1:247}
    Comments or opinions from the submitter.

SYSTEM_NAME:=                                             {Size=1:20}
    The name of the sending or receiving GEDCOM-compatible product.  The system name for the
    sending system was obtained when the product was registered as a GEDCOM-compatible
    product.  All GEDCOM transmissions must be so identified.  The system name used with the
    DESTination tag should be:
      *   "ANSTFILE" when sending to the ancestral file.
      *   "TempleReady" when submitting for temple ordinances.
      *   The same DESTination system name as was used with the SOURce tag is used when the
          destination is unknown.

TEMPLE_VALUE:=                                            {Size=5:5}
    A 5-character abbreviation of the temple in which LDS temple ordinances are performed.
    (Contact the GEDCOM Coordinator for a table of valid abbreviations)

TEXT:=                                                    {Size=1:247}
    A string composed of any valid character or string of characters in the GEDCOM character set.
TIME_PERIOD:=         {Size=1:90}
    [ FROM <DATE_REGULAR> TO <DATE_REGULAR> |
      FROM <DATE_REGULAR> |
      TO <DATE_REGULAR> ]
    The range in time of an event or set of events, inclusive.  The choice FROM
    <DATE_REGULAR> indicates a range from a beginning date to an indefinite future date.
    This differs from the date range notation in that the date range is to indicate that an event took
    place on a given date within the range.  The time period date indicates that the event or events
    cover or happened over the time period specified.

    The choice TO <DATE_REGULAR> indicates from an indefinite beginning to a specified
    date.
    Examples:
       FROM 1904 to 1915
       FROM 1904    
       TO 1905

TIME_VALUE:=          {Size=1:10}
     [ hh:mm:ss.fs ]
    The time of a specific event, usually a computer-timed event, where:
    hh = hours on a 24 hour clock
    mm = minutes
    ss = seconds, (optional)
    fs = decimal fraction of a second, (optional)

TRANSMISSION_DATE:=                                       {Size=10:11}
    <DATE_EXACT>
    The date that this transmission was created.

TYPE_OF:=             {Size=1:20}
    A user-defined number or text that the submitter uses to identify this record.  For instance, it
    may be a record number within the submitter's automated or manual system, or it may be a page
    and position number on a pedigree chart.

USER_TAG_DEFINITION:=
    A formal description of the user defined tag.  This description can be used by the receiving
    system to give meaning to the user defined tags.  (See Chapter 2, User Defined Tags section.)

VERSION_NUMBER:=                                          {Size=1:15}
    An identifier that represents the version level assigned to the associated product.  It is defined
    and changed by the creators of the product.

XREF:=                                                    {Size=1:15}
    Either a pointer or a cross-reference identifier.  If this element appears before the tag in a
    GEDCOM-line, then it is a cross-reference identifier.  If it appears after the tag in a GEDCOM-
    line, then it is a pointer.  The method of delimiting a pointer or cross-reference identifier is to
    enclose the pointer or cross reference identifier within at-signs (@), for example, @I123@.  A
    XREF may not begin with a number sign (#).  This is to avoid confusion with an escape
    sequence prefix (@#). The use of a colon (:) in the XREF is reserved for creating future
    network cross-references.
XREF:ANY:=                                                {Size=1:15}       
   <XREF>                                                                     
   A universal pointer.  It may point to any other cross-reference identifier type.   

XREF:EVEN:=                                               {Size=1:15}       
   <XREF>                                                                     
   A pointer to or a cross reference identifier of a source event   
   record.                                                                    
                                                                              
XREF:FACT:=                                               {Size=1:15}       
   <XREF>                                                                     
   A pointer to or a cross reference identifier of a facts record.
                                                        
XREF:FAM:=                                                {Size=1:15}
    <XREF>
    A pointer to or a cross reference identifier of a family record.

XREF:INDI:=                                               {Size=1:15}
    <XREF>
    A pointer to or a cross reference identifier of an individual record.

XREF:NOTE:=                                               {Size=1:15}
    <XREF>
    A pointer to or a cross reference identifier of a note record.

XREF:REPO:=                                               {Size=1:15}
    <XREF>
    Either a pointer to a REPOsitory, a SUBMitter, or an INDIvidual record, or a cross reference
    identifier of a repository record.

XREF:REC!ID:=                                             {Size=1:15}
    [ <FILE:REC!ID> | <REC!ID> | <!ID> ] 
    Enclosed in at-signs (@), this is a pointer to a context within a record.  Normally the pointer
    will only be used to point to role contexts within the current event record but the principle
    should allow the reference to a context within a specific record within a specific file.  The
    following are valid ways of representing this pointer:
   
   @FILE:REC!ID@  =       A pointer to a specific context <!ID>, within a specific record
                          <REC> within a specific file <FILE:>, that logically replaces  the
                          context containing the cross reference pointer. (Future.)

    @REC!ID@      =       A pointer to a specific context <!ID> within a specific record within the
                          current GEDCOM transmission.
   not valid:
    @!ID@         =       A pointer to a specific context <!ID> within the current record of this
                          GEDCOM transmission must also contain the record level pointer, such
                          as @I13!3@.

XREF:SOUR:=                                               {Size=1:15}
    <XREF>
    Either a pointer to a SOURce, a SUBMitter, or an INDIvidual record, or a cross reference
    identifier of a source record.

XREF:SUBM:=                                               {Size=1:15}
    <XREF>
    Either a pointer to a SUBMitter, or an INDIvidual record, or a cross reference identifier of a
    submitter record.

YEAR:=                                                    {Size=3:4, Type=NUMBER}
    A numeric representation of the calendar year in which an event occurred.  

YEAR_ALTERNATIVE:=                                        {Size=1:1, Type=NUMBER}
    A year modifier which shows the possible date alternatives for pre-1752 date brought about by a
    calendar change, for example, 15 Dec 1752/3.

COMPATIBILITY WITH OTHER GEDCOM VERSIONS

Products based on GEDCOM 5.3 are generally compatible with products based on prior GEDCOM
versions.  However, there are four issues related to specific products that introduce incompatibilities
which can be accommodated by programming to handle the information in both the standard and the
non-standard way.  Compatibility with prior implementations may be maintained by doing the
following:

1.    Treat a TITL tag found at level 0 as if it were a SOUR record, including its subordinate
      structure.  Roots III points from a SOUR structure in an INDI record to a 0 TITL source
      record in this manner.  Likewise, the TITL tag must be used instead of the SOUR tag in the
      level 0 SOUR record to send source information to Roots III.

2.    The structure for LDS sealing of child to parents was changed in the standard from the FAM-
      CHIL-SLGC structure to the INDI-FAMC-SLGC structure to conform with the more natural
      access path to this information.  PAF 2.1 reads the sealing date in the FAM-CHIL-SLGC
      structure, while other products read it in the INDI-FAMC-SLGC structure.  To accommodate
      all implementations, systems handling the LDS ordinance events should look for the child
      sealing information in either place.  Systems should also write the child sealing information in
      both structures when preparing a transmission.  Other child events were also moved to the
      INDI-FAMC structure, namely ADOPtion, which should receive the same treatment.

3.    When an individual has multiple names, GEDCOM 5.x requires listing the preferred instance
      first, followed by less-preferred names.  However, PAF and other products take only the last
      instance during a transmission, causing the preferred name to be dropped when more than one
      name is present.  The same happens with all multiple-instance tags where only one instance is
      received.  When writing to GEDCOM 4.0 (or earlier) compatible systems you should only
      output the preferred name under the name tag and export the also-known-as name in a note
      field.

We anticipate a future change to allow use of indentation to make GEDCOM files easier to read. 
To make this transition easier, beginning with GEDCOM 5.3, leading white space in a GEDCOM
line should be handled by receiving systems by ignoring it.  Indentation should NOT be transmitted
in GEDCOM files until this change is established in a future version of The GEDCOM Standard.

PACKAGING THE GEDCOM TRANSMISSION FILE

The GEDCOM transmission is normally created on a DOS or Macintosh compatible diskette.  The
DOS filename extension is (.GED).  Macintosh filenames do not use file extensions.

When the GEDCOM file is too large to fit on a single diskette, the file is divided after any whole-
line (last character is the terminator), and the DOS filename extension becomes (G##) where (##) is
(00) for the second disk, (01) for the third, and so forth.  For Macintosh filenames, append the two
digits to the subsequent filenames in parentheses. (See example below.)  This allows the receiving
software to ensure that disks are read in the correct sequence.

Given that the user-supplied portion of the file name is SMITH, then the complete filenames for a
three-disk transmission would be:

Disk      DOS Filename         Macintosh Filename
  1       SMITH.GED                  SMITH
  2       SMITH.G00                  SMITH(00)
  3       SMITH.G01                  SMITH(01)

The required GEDCOM HEADer record appears only on the first disk and the required TRLR
(trailer) record appears only on the last disk and must be followed by the  terminator.

USER DEFINED TAGS
Data stored in different systems within a user defined context will not be easy to share between
other systems.  GEDCOM defines a schema that can be included within the HEADer record which
will give receiving systems the information to assist them in interpreting the user defined data. 
Utmost care should be taken when defining User tags.  The primary use would be for transmitting
data between the same software driven system, system developers are encouraged to find ways of
supporting user defined tags, but GEDCOM only provides a way to express the data, it usage is left
to the receiving software.

This schema is designed to show:
      a.  The context within which the new tag appears in the records.
      b.  The name of the new tag, which must start with an underscore (_).
      c.  The definition of the new tag.
      d.  The label or long name of the new tag, if different from the tag name.
      e.  The kind of data that this new tag represents in terms of a predefined standard GEDCOM
          tag. For Example, if HOSPital was being defined as a user tag, then we would use the
          SITE tag to show that hospital is a kind of SITE.

In the Sample Lineage-linked GEDCOM Transmission example below is the SCHEMA required for
defining a new user defined tag "_HOSP" which is intended to show the name of the name of the
hospital where a birth took place.

Included in the schema context is:
      1.  The LABL tag to define a longer tag name that can be used as a field label.
      2.  The DEFN tag which allows sharing of the definition of the new tag.
      3.  The ISA tag to show that this tag is a kind of another standardized tag.  In this case
          _HOSPital is a kind of SITE.

ESCAPE SEQUENCE FORMAT FOR THE LINEAGE-LINKED FORM

The Lineage-linked form utilizes the escape sequence feature provided in the GEDCOM grammar in
the following way:

      *  An escape sequence in the HEADer structure invokes variant processing for the entire
         transmission.

      *  An escape sequence that appears in subsequent structures affect only the line on which the
         escape sequence appears unless that line has subordinate CONTinuation or CONCatenation
         lines.  In this case the variant processing applies to the subordinate CONTinuation and
         CONCatenation substructure lines as well.

      *  The form of the escape sequence is @# escape_type_code escape_text @ where the
         escape_type_code indicates that:
                 A =     A auxillary data format or processing is being referenced.  Auxillary data
                         formats include such forms as images, sound, or other data requiring
                         auxillary processing. (See primitive element
                         ESCAPE_TO_AUXILLARY_PROCESSING above in this chapter).  

                 C =     Character set processing is being invoked.

                 D =     Date processing for special calendar is being invoked. (see primitive
                         element CALENDAR_ESCAPE_SEQUENCE above in this chapter).

         The escape_text specifies the specific processing to be done within that particular type, for
         example, @#DJULIAN@ indicates julian date processing.

SAMPLE LINEAGE-LINKED GEDCOM TRANSMISSION

The example below shows how some of these value types appear in a valid GEDCOM Lineage-
linked transmission.   The example is a sample transmission of genealogical information about three
individuals who are members of the same family--husband, wife, and child.  In the example,
"Joe/Williams/" is the value specified by the tag NAME under the INDI tag for the record (@3@). 
Other values in other lines, such as the birth date and place, provide additional information about
Joe Williams. The value (@4@) specified by the FAMC tag is a pointer to the FAMily record
(@4@) of which Joe Williams is a child. Included also in this transmission example are three other
record types: a source record, a submitter record, and a repository record.  These records are
pointed to from within other records in the transmission.  This shows how pointer values can be
used in creating the GEDCOM Lineage-linked form.

      Example:  (Indentation is for readability only.)
       0 HEAD
         1 SOUR PAF
           2 VERS 2.1
         1 DEST ANSTFILE
         1 SUBM @5@
         1 GEDC
           2 VERS 5.2
         1 SCHEMA
           2 INDI
               3 BIRT
                 4 _HOSP
                   5 LABL HOSPITAL
                   5 DEFN The name of a hospital
                   5 ISA  SITE
       0 @1@ INDI
         1 NAME Robert Eugene/Williams/
         1 SEX M
         1 BIRT              
           2 DATE 02 OCT 1822
           2 PLAC Weston, Madison, Connecticut
           2 _HOSP St. Marks
           2 SOUR @6@
         1 DEAT              
           2 DATE 14 APR 1905
           2 PLAC Stamford, Fairfield, CT
           2 QUAY 2
         1 BURI              
           2 PLAC Stamford, CT
             3 CEME Spring Hill Cemetery 
         1 OCCU Publisher
         1 FAMS @4@
       0 @2@ INDI
         1 NAME Mary Ann/Wilson/
         1 SEX F
         1 BIRT
           2 DATE BEF 1828
           2 PLAC Connecticut
         1 FAMS @4@
       0 @3@ INDI
         1 NAME Joe/Williams/
         1 SEX M
         1 BIRT
           2 DATE 11 JUN 1861
           2 PLAC Idaho Falls, Bonneville, Idaho
         1 FAMC @4@
       0 @4@ FAM
         1 HUSB @1@
         1 WIFE @2@
         1 CHIL @3@
         1 MARR
           2 DATE DEC 1859
       0 @5@ SUBM
         1 NAME Reldon /Poulson/
         1 ADDR 1900 43rd Street West
           2 CONT Billings, MT 68051
           2 PHON (406) 555-1232
       0 @6@ SOUR
         1 TYPE VITAL
         1 EVEN BIRT
         1 TITL County Birth Records 
         1 PERI FROM 1820 TO 1825
         1 PLAC ,Madison, Connecticut
         1 RECO CIVIL
         1 FIDE PHOTOCOPY
         1 REPO @7@
           2 MEDI FILM
           2 CALN 13B-1234.01
       0 @7@ REPO
         1 NAME Family History Library
         1 ADDR 35 N West Temple Street
           2 CONT Salt Lake City, UT 84150
       0 TRLR


SAMPLE EVENT_RECORD

This example shows how the Evidence_Record format might be used to store an extraction of a
christening record:

         0 @EV13@ EVEN
          1 TYPE CHR
             2 DATE 17 NOV 1830
             2 PLAC Littlehampton, West Sussex, England
                3 ADDR 9 Chiltern Close
                  4 CONT East Preston
             2 @EV13!1@ CHIL
                3 NAME Jason \Wilde\
                3 AGE 4 yrs
             2 @EV13!2@ MOTH
                3 NAME Wilma \Wilson\
                3 BIRT
                  4 DATE 15 MAY 1810
                  4 PLAC Nottingham, England
             2  @EV13!3@ FATH
                3 NAME William \Wilde\
                3 BIRT
                  4 DATE 15 OCT 1805
                  4 PLAC Nottingham, England
                3 ASSO @EV13!4@
                  4 TYPE BROTHER
             2 @EV13!4@ GODF
                3 NAME David \Wilde\          Chapter 3
                                    USING CHARACTER SETS IN GEDCOM

INTRODUCTION

GEDCOM needs to be designed to accommodate different character sets to facilitate the sharing of
genealogical data in different languages.  In order to minimize the number of differing standards to
accomplish this, we have chosen to have each system convert their usage to ANSEL and eventually
UNICODE.   In January of 1991 a Unicode Consortium was founded to promote the use of the
Unicode standard which accommodates all characters in one character set (see the section on
Unicode below).  Unicode Consortium has agreed with the ISO 10646 standard to merge and
Unicode will be a subset of the ISO 10646 international character encoding standard.  The difficulty
is in handling the two character code sequences.  Therefore, until the multi-byte handling becomes
more common, the usage of ANSEL to represent the latin-based international characters will be the
standard.

The GEDCOM specification does not address the implementation methods for multilingual
processing, such as keyboard arrangements, sorting sequences, or character and graphic
representations (font styles, proportional spacing, and so forth) on the CRT or printers, however,
Unicode standard has defined formatting characters which will indicate the direction of the text
presentation as well as other text formatting character codes.

Most of the genealogy systems developed so far utilize either ASCII or ANSEL, or both.  ANSEL
accommodates the set of Latin-based languages, as explained below.    

8-Bit ANSEL

The 8-Bit ANSEL (American National Standard for Extended Latin Alphabet Coded Character Set
for Bibliographic Use, Z39.47, 1985 copyright) is the default character set for GEDCOM. It is used
for all transmissions of information unless another character set is specified.  The use of this
character set standard makes it possible to preserve the full integrity of the language by providing a
method of using the standard ASCII character set and supplementing it with both non-spacing
character modifiers (diacritic) as well as spacing special characters.  Non-spacing means that the
diacritic is printed without advancing the device's print position.  The character being modified is
then printed in the same position, resulting in a combined image of both the character and the
diacritic(s). The storage of ANSEL requires storing the non-spacing graphic character(s) preceding
the ASCII character that the diacritic is to modify.  The ANSEL standard specifies an extended 8-bit
configuration (above 128) to represent the spacing and non-spacing graphic characters that make up
most of the Latin based languages.  ANSEL is a super-set of ASCII.  The standard ASCII
characters including the control characters are preserved.

ANSEL is known by two other names: (1) ANSI Z39.47-1985) and (2) the American Library
Association character set, used in library systems worldwide, including the MARC (MAchine-
Readable Catalog) format.

A description of the codes for the ANSEL character set has been reproduced with permission and is
included with the printed version of The GEDCOM Standard. The description of ANSEL codes is
not included in the electronic version.  This description may be purchased from the American
National Standards Institute at 1430 Broadway, New York, N.Y. 10018. 
The description of the ANSEL character set standard includes the following:
       *       An 8-Bit Code Table showing the ASCII and extended ANSEL codes
       *       An explanation or legend of these codes
       *       A chart that identifies the ANSEL Non-spacing Graphic Characters
       *       A chart that identifies the ASCII Control Characters
       *       A chart that identifies the ASCII Graphic Characters

Character-set codes 0 through 127 are the same for 8-Bit ANSEL and 8-Bit ASCII (USA version--
ANSI 8-Bit). Character-set codes 128 through 255 are unique to the ANSEL character set.

ASCII (USA version)

When there isn't a need for diacritics or other special characters, and if you are not transmitting
binary data, you will find it convenient to use ASCII (8-bit USA version) if your computer already
supports it. This is a standard of the American National Standards Institute (ANSI). Most of the
basic printable characters of ANSEL and ASCII (USA version--ANSI 8-Bit) are identical.

Binary Character Set

Binary formats for representing photographs and other bit-mapped graphics should use the escape
sequence "escape_to_supplementary_processing" for linking supplementary files to the GEDCOM
context (see chapter 2).

UNICODE (ISO 10646)

The Unicode standard is a new character code designed to encode text for storage in computer files. 
It is a subset of the upcoming ISO 10646 standard.  The design of the Unicode standard is based on
the simplicity and consistency of today's prevalent character code set, extended ASCII code set, but
goes far beyond ASCII's limited ability to encode only the Latin alphabet: the Unicode encoding
provides the capacity to encode all of the characters used for written languages throughout the
world.  In order to accommodate the many thousands of characters used in the international text, the
Unicode standard uses a 16-bit code set instead of extended ASCII's 8-bit code set.  This expansion
provides codes for more than 65,000 characters.  The Unicode standard assigns each character a
unique 16-bit value, and does not use complex modes or escape codes to specify modified characters
or special cases. The text representation of the Unicode 16-bit numbers is U+0041 which is
assigned to the letter A, 65 decimal.  The Unicode standard includes the Latin alphabet used for
English, the Cyrillic alphabet used for Russian, the Greek, Hebrew, and Arabic alphabets.  Other
alphabets used in countries across Europe, Africa, the Indian subcontinent, and Asia, such as
Japanese Kana, Korean Hangul, and Chinese Bopomofo are included.  The largest part of the
Unicode standard is devoted to thousands of unified character codes for Chinese, Japanese, and
Korean ideographs.  (See "The Unicode standard", vol. 1 and 2, published by Addison-Wesley
Publishing, for character code standards.)

The Unicode character set environment, which contains a character set for all languages, minimizes
previous GEDCOM requirements to provide escape_sequences for moving from one character set to
another.  If the Unicode environment is used to produce a GEDCOM transmission, the header
record would also be in Unicode, requiring receiving systems to determine whether the transmission
is Unicode or ASCII before they could interpret the GEDCOM header.  This would be done by
reading the first two bytes of the transmission.  If the first two bytes are 0x30 and 0x20 then the
transmission will be in either ASCII or ANSEL as determined by the header record.  If the first two
bytes are 0x30 and 0x00 then the transmission should be processed as a Unicode transmission.
(Different platforms may reverse the position of the null byte, in which case the test would be for
0x00 and 0x30.)

How to change character sets

The character set for an entire transmission is specified in the character-set line of the header
record. 

       The example below shows the specification in the header record.

       Example:
       Lvl     Tag    Value
       0       HEAD
       1       SOUR          PAF
       2       VERS          2.1
       1       DEST          ANSTFILE
       1       CHAR          ANSEL

The character-set change remains in effect until the TRLR record is encountered at the end of the
transmission.

The lineage_linked form no longer makes use of the character escape_sequence to change a
character set context inside of the transmission.  Unicode does not require shifting from character
set to character set and we should encourage its use for multi-language support.

For more information about character sets, see the following:

*      Extended Latin Alphabet Coded Character Set for Bibliographic Use. American National
       Standards (ANSI), Z39.47, 1985.

*      "8-Bit ASCII--Structure and Rules." American National Standards (ANSI) X3.134.1-198x.

*      "7-Bit and 8-Bit ASCII Supplemental Multilingual Graphic Character Set (ASCII Multilingual
       Set)" (manuscript). American National Standards (ANSI), X3.134.2-198x.

*      "The Unicode standard", vol. 1 and 2, published by Addison-Wesley Publishing.

                                             Appendix A
                                 LINEAGE-LINKED GEDCOM TAG DEFINITION

Introduction

Appendix A is a glossary of the tags approved for use with Lineage-linked GEDCOM. (See chapter
2 for an example of the tags in context that describes the Lineage-linked structure.)  Every tag must
be used within the context shown to ensure that all information transmitted by means of GEDCOM
is uniformly identified.

The tags vary in type, depending on their role or use in a transmission. They are used to identify
individuals, families, names, dates, places, events, roles, sex, sources, relationships, control codes
and other kinds of data for computers, computer programs, and computer systems.

Generally, the definition for each tag is broad enough to cover all uses of the tag. Any new tag
needed to extend the Lineage-linked form can be used for by a system that generates GEDCOM
output may be used and will not violate the Lineage-linked GEDCOM standard as long as the
context for the Lineage-linked GEDCOM grammar is not violated.  System builders using new tags
should register them and their definitions with the GEDCOM Coordinator at the address listed on
the title page of this document.  The Coordinator will evaluate the feasibility of including them as a
part of the next release of the standard.  Suggestions and proposed additions are welcome. 


Lineage-Linked GEDCOM Tag Definitions

This section provides the definition of the standardized GEDCOM tags and shows the formal name
of the tag inside of {braces}.

ADDR {ADDRESS}:=
       The contemporary place, usually required for postal purposes, of an individual, a submitter
       of information, a repository, a business, a school, or a company.

ADOP {ADOPTION}:=
       The event of a legal creation of the child-parent relationship that does not exist biologically.

AFN {AFN}:=
       A unique permanent record file number of an individual record stored in the Ancestral File.   
       
AGE  {AGE}:=
       The age of the individual at the time an event occurred, or the age listed in the document.

AGNC {AGENCY}:=
       The name of the branch of government.

ALIA {ALIAS}:=
       A pointer to which indicates that another record is suspected of being the same person. 
       When the suspicions are confirmed, drop the alias line, combine all data into one record, and
       delete the other record.  Alias should NOT be used to record alternate names for the same
       person. (See Name tag definition.)
ANCI {ANCES_INTEREST}:=
       Indicates an individual in which the submitter has interest in additional research for ancestors
       of this individual. (See also DESI)

ANUL {ANNULMENT}:=
       An event declaring a marriage void from the beginning (never existed).

ARVL {ARRIVAL}:=
       An event declaring the arrival or reaching of a destination.

ASSO {ASSOCIATES}:=
       Identifies friends, neighbors, or associates of an individual.

AUTH {AUTHOR}:=
       The name of the individual who created or compiled information.
                                                                             
BAPL {BAPTISM-LDS}:=
       The event of baptism performed at age eight or later by priesthood authority of The Church
       of Jesus Christ of Latter-day Saints. (See also BAPM.)

BAPM {BAPTISM}:=
       The event of baptism (not LDS), performed in infancy or later. (See also BAPL and CHR.

BARM {BAR_MITZVAH}:=
       The ceremonial event held when a Jewish boy reaches age 13.

BASM {BAS_MITZVAH}:=
       The ceremonial event held when a Jewish girl reaches age 12, also known as "Bat Mitzvah".

BIRT {BIRTH}:=
       The event of entering into life.

BLES {BLESSING}:=
       A religious event of bestowing divine care or intercession.

BROT {BROTHER}:=
       A male sibling.

BURI {BURIAL}:=
       The event of the proper disposing of the mortal remains of a deceased person.

BUYR {BUYER}:=
       A person who purchased or purchases from another.

CALN {CALL_NUMBER}:=
       The number used by a repository to identify the specific items in its collections.

CAST {CASTE}:=                                                                
   The name of an individual's rank or status in society, based    
   on racial or religious differences, or differences in wealth, inherited    
   rank, profession, occupation, etc.

CAUS {CAUSE}:=
       A description of the cause of the associated event or fact, such as the cause of death.  

CEME {CEMETERY}:=
       The name of the cemetery or other resting place where an individual is buried.

CENS {CENSUS}:=
       The event of the periodic count of the population for a designated locality, such as a national
       or state Census.

CHAN {CHANGE}:=
       Indicates a change, correction, or modification. Typically used in connection with a DATE to
       specify when a change in information occurred.

CHAR {CHARACTER}:=
       An indicator of the character set used in writing this automated information.

CHIL {CHILD}:=
       The natural, adopted, or sealed (LDS) child of a father and a mother.

CHR  {CHRISTENING}:=
       The religious event (not LDS) of baptizing and/or naming a child. 

CHRA {ADULT_CHRISTNG}:=
       The religious event (not LDS) of baptizing and/or naming an adult person.

CLAS {CLASSIFICATION}:=
       A classification name given to identify objects because they posses a set of similar attributes
       or characteristics.

CNTC {CONTACT_PERSON}:=
       The name of a person that is listed as the contact person at an institution such as a
       repository, college, business, etc.

CONC {CONCATENATION}:=
       An indicator that the additional value information follows and is to be connected to the value
       of the superior preceding line without a new line.

CONF {CONFIRMATION}:=
       The religious event (not LDS) of conferring the gift of the Holy Ghost and, among
       protestants, full church membership.

CONL {CONFIRMATION_L}:=
       The religious event by which a person receives membership in The Church of Jesus Christ of
       Latter-day Saints.

CONT {CONTINUED}:=
       An indicator that additional value information follows and is to be connected with the value
       of the superior preceding line as a new line.

COPR {COPYRIGHT}:=
       A statement that accompanies data to protect it from unlawful duplication and distribution.

CORP {CORPORATE}:=
       A name of an institution, agency, corporation, or company.

CPLR {COMPILER}:=
       The name of the person that compiled writings of others.

DATA {DATA}:=
       Pertaining to stored automated information.

DATE {DATE}:=
       The time of an event in calendar days.

DEAT {DEATH}:=
       The event when mortal life terminates.

DEFN {DEFINITION}:=
       A textual description of something.

DESI {DESCENDANT_INT}:=
       Indicates the submitter that has interest in research to identify additional descendants of this
       individual. (See also ANCI.)

DEST {DESTINATION}:=
       A system receiving data.

DIV  {DIVORCE}:=
       An event of dissolving a marriage through civil action.

DIVF {DIVORCE_FILED}:=
       An event of filing for a divorce by a spouse.

DPRT {DEPARTURE}:=
       An event declaring the departure or leaving for another destination.

DSCR {PHY_DESCRIPTION}:=
       The physical characteristics of a person, place, or thing.

EDTR {EDITOR}:=
       The name of a person who edited information.

EDUC {EDUCATION}:=
       Indicates the education attained.

ENDL {ENDOWMENT}:=
       A religious event where an endowment ordinance for an individual was performed by
       priesthood authority in an LDS Temple.

ENGA {ENGAGEMENT}:=
       An event of recording or announcing an agreement between two people to become married.

EMIG {EMIGRATION}:=
       An event of leaving one's homeland with the intent of residing elsewhere.

EVEN {EVENT}:=
       A noteworthy event related to an individual, a group, or an organization.

FAM {FAMILY}:=
       Identifies a legal, common law, or other customary relationship of husband and wife and
       their children, if any, or a family created by virtue of the birth of a child to its biological
       father and mother.

FAMC {FAMILY_CHILD}:=
       Identifies the family in which an individual appears as a child.

FAMS {FAMILY_SPOUSE}:=
       Identifies the family in which an individual appears as a spouse.

FATH {FATHER}:=
       Identifies the male parent in a family.  In the Lineage-linked form this tag is used only in the
       EVENT_RECORD role tag structure (See Chapter 2).  Direct parent relationships are
       represented using the HUSBand and WIFE tags as part of the FAMILY_RECORD.

FIDE {FIDELITY}:=
       A description of the state of originality of the record to permit an assessment of the potential
       for accuracy or errors due to the use of a copy of the record.

FILE {FILE}:=
       An information storage place that is ordered and arranged for preservation and reference.

FILM {FILM_NUMBER}:=
       An assigned, unique number used to identify a reel of film.

FORM {FORMAT}:=
       An assigned name given to a consistent format in which information can be conveyed.  

GEDC {GEDCOM}:=
       Information about the use of GEDCOM in a transmission.

GODP {GODPARENT}:=
       A sponsor at a religious rite (baptism).

GRAD {GRADUATION}:=
       An event of awarding educational diplomas or degrees to individuals.

HDOH {HEAD_HOUSEHOLD}:=
       Identifies a person whose role was recorded as "head of household" for an event such as a
       census.

HEAD {HEADER}:=
       Identifies information pertaining to an entire GEDCOM transmission.

HEIR {HEIR}:=
       A role of an individual who inherited or is entitled to inherit an estate.

HFAT {HUSB_FATHER}:=
       A role of an individual acting as the husband's father for a cited event.
               
HMOT {HUSB_MOTHER}:=
       A role of an individual acting as the husband's mother for a cited event.

HUSB {HUSBAND}:=
       An individual in the family role of a married man or father.

IDNO {IDENT_NUMBER}:=
       A number assigned to identify a person within some significant external system.

IMMI {IMMIGRATION}:=
       An event of entering into a new locality with the intent of residing there.

INDI {INDIVIDUAL}:=
       A person.

INDX {INDEXED}:=
       Specifies information about an index to simplify finding information in a source.

INFT {INFORMANT}:=
       An individual who reported facts concerning an event.

INTV {INTERVIEWER}:=
       The person who facilitated, recorded, and obtained information during an interview.

ISA  {IS_A_KIND_OF}:=
       Indicates the tag of an object of which this object inherits its characteristics from.

ISSUE  {ISSUE}:=
       An identifier used to differentiate one giving out from another, such as a number
       differentiating one periodical publication from another. 

ITEM {ITEM}:=
       Refers to a unit within a set of things that belong together.  The unit itself may be made up
       of other objects but collectively they are referred to as an unit (item) of the set.  A group of
       papers filmed together under one header page is referred to as an item on a film.

LABL {LABEL}:=
       A name assigned to a field or product which helps to identify it.
       
LANG {LANGUAGE}:=
       The name of the language used in a communication or transmission of information.

LCCN {LIB_CONGRS_CALL}:=
       The number assigned by the U.S. Library of Congress to a document, book, etc.

LGTE {LEGATEE}:=
       A role of an individual acting as a person receiving a bequest or legal devise.

MARB {MARRIAGE_BANN}:=
       An event of an official public notice given that two people intend to marry.

MARC {MARR_CONTRACT}:=
       An event of recording a formal agreement of marriage, including the prenuptial agreement in
       which marriage partners reach agreement about the property rights of one or both, securing
       property to their children.

MARL {MARR_LICENSE}:=
       An event of obtaining a legal license to marry.

MARR {MARRIAGE}:=
       A legal, common-law, or customary event of creating a family unit of a man and a woman as
       husband and wife.

MARS {MARR_SETTLEMENT}:=
       An event of creating an agreement between two people contemplating marriage, at which
       time they agree to release or modify property rights that would otherwise arise from the
       marriage.

MEDI {MEDIA}:=
       The medium used to store or transmit information.

MBR {MEMBER}:=
        Identifies an individual (element)  belonging to a group (set).

MOTH {MOTHER}:=
       Identifies the female parent in a family.  In the Lineage-linked form this tag is used only in
       the EVENT_RECORD role tag structure (See Chapter 2).  Parent relationships are
       represented using the HUSBand and WIFE tags as part of the FAMILY_RECORD.

NAME {NAME}:=
       A word or combination of words used to help identify an individual, title, or other item.
       More than one NAME line should be used for people who were known by multiple names.

NAMR {NAME_RELIGIOUS }:=
       A name given to an individual to be used in association with one's religion.

NAMS {NAME_SAKE}:=
       Identifies the person that an individual is named after to perpetuate the person's name.

NATI {NATIONALITY}:=
       The national heritage of an individual.

NATU {NATURALIZATION}:=
       The event of obtaining citizenship.

NCHI {CHILDREN_COUNT}:=
       The number of children that this person is known to be the parent of (all marriages), or that
       belong to this family.

NMR {MARRIAGE_COUNT}:=
       The number of times this person has participated in a family as a spouse or parent.

NOTE {NOTE}:=
       Additional information provided by the submitter for understanding the enclosing data.

OCCU {OCCUPATION}:=
       The type of work or profession of an individual.

OFFI {OFFICIATOR}:=
       A person acting in an authorized capacity as voice in performing an ordinance or ceremony.

ORDN {ORDINATION}:=
       A religious event of receiving authority to act in religious matters.

ORIG {ORIGINATION}:=
       Pertains to the creation or root of an object.

OWNR {OWNER}:=
       The name of the person who is the owner of the associated item or property.

PAGE {PAGE}:=
       A number or description to identify the page in a document.

PERI {PERIOD}:=
       Indicates the range of time during which an event took place.

PHON {PHONE}:=
       A unique number assigned to dial a specific telephone.

PHOTO {PHOTO}:=
       A photograph (graphic image) of a person, place, or thing, depending on the enclosing
       context.

PHUS {PREV_HUSB}:=
       An individual in the role of the principal's previous husband for a cited event.

PLAC {PLACE}:=
       A jurisdictional name to identify the place or location of an event.
PORT {PORT}:=
       A site identifier of entering or leaving, such as an air port, harbor, port of entry, or a data
       port where data enters or leaves a system.

PROB {PROBATE}:=
       An event of judicial determination of the validity of a will.  May indicate several related
       court activities over several dates.

PROP {PROPERTY}:=
       The name of land and/or other properties possessed by this individual. 

PUBL {PUBLICATION}:=
       A published work.

PUBR {PUBLISHER}:=
       The name of the company or individual who published a work.

PWIF {PREV_WIFE}:=
       An individual in the role of the principal's previous wife for a cited event.

QUAY {QUALITY_OF_DATA}:=
       An assessment of the reliability of the evidence to support the conclusion drawn from the
       evidence.

RECO {RECORDER}:=
       A person responsible for recording information about an event, place, or person.

REFN {REFERENCE}:=
       A description or number used to identify an item for filing, storage, or other reference
       purposes.

REFS {REFERENCED_SOUR}:=
       A source that was referenced by the cited source but was not examined by the submitter. 
       Examined sources are listed using a SOUR tag.

RELI {RELIGION}:=
       A religious denomination to which a person is affiliated or for which a record applies.

REPO {REPOSITORY}:=
       An institution that has the specified item as part of its collection(s).

RETI {RETIREMENT}:=
       An event of exiting an occupational relationship with an employer after a qualifying time
       period.

RFN  {REC_FILE_NUMBER}:=
       A permanent number assigned to a record that uniquely identifies it within a known file.

ROLE {ROLE}:=
       A name given to a role played by an individual in connection with an event.
SCHEMA {SCHEMA}:=
       A context pattern definition that specifies the meaning and the valid context(s) of a user
       defined tag.  See the SCHEMA_STRUCTURE substructure definition.

SELR {SELLER}:=
       A person who sold or sells to another.

SEQU {SEQUENCE}:=
       Indicates the sequence or order of an event or information.

SERS {SERIES}:=
       Designates the volume within a series in which a given work is a part.

SEX {SEX}:=
       Indicates the sex of an individual--male or female.  No SEX line is present if the sex is
       unknown.

SIBL {SIBLING}:=
       A male or female child of a common parent.

SIGN {SIGNATURE}:=
       Used to identify information about an individual's signature.

SIST {SISTER}:=
       A female sibling.

SITE {SITE}:=
       The name of the specific location, building, etc. that is in connection with the address or
       place value, such as, "Shriners Hospital" or "The Church of the Epiphany".

SLGC {SEALING_CHILD}:=
       A religious event pertaining to the sealing of a child to his or her parents in an LDS temple
       ceremony.

SLGS {SEALING_SPOUSE}:=
       A religious event pertaining to the sealing of a husband and wife in an LDS temple
       ceremony.

SOUND {SOUND}:=
       A collection of sound bits pertaining to the enclosed context.

SOUR {SOURCE}:=
       The initial or original material from which information was obtained.

SPOU {SPOUSE}:=
       A husband or wife of a person.

SSN  {SOC_SEC_NUMBER}:=
       A number assigned by the United States Social Security Administration.  Used for tax
       identification purposes.
STAT {STATUS}:=
       An assessment of the state or condition of something.

SUBM {SUBMITTER}:=
       An individual or organization who contributes genealogical data to a file or transfers it to
       someone else.

TEMP {TEMPLE}:=
       The name or code that represents the name of a temple of The Church of Jesus Christ of
       Latter-day Saints.

TEXT {TEXT}:=
       The exact wording found in an original source document.

TIME {TIME}:=   
       A time value in a 24-hour clock format, including hours, minutes, and optional seconds,
       separated by a colon ":".  Fractions of seconds are shown in decimal notation.

TITL {TITLE}:=
       A descriptive description of a specific writing, such as the title of a book when used in a
       source context, or a formal designation used by an individual in connection with individual's
       name, such as Captain.

TRLR {TRAILER}:=
       At level 0, specifies the end of a GEDCOM transmission.

TXPY {TAXPAYER}:=
       A role of a person who has been assessed a tax.

TYPE {TYPE}:=
       A further qualification to the meaning of the associated superior tag.  The value does not
       have any computer processing reliability.  It is more in the form of a short one or two word
       note that should be displayed any time the associated data is displayed.

VERS {VERSION}:=
       Indicates which version of a product, item, or publication is being used or referenced.

WFAT {WIFE_FATHER}:=
       A role of an individual acting as the wife's father for a cited event.

WIFE {WIFE}:=
      An individual in the family role of a married woman or mother.

WILL {WILL}:=
       A legal document treated as an event, by which a person disposes of his or her estate, to take
       effect after death.  The event date is the date the will was signed while the person was alive.
       (See also PROBate.)

WITN {WITNESS}:=
       An individual who attested that he or she saw an event take place.
WMOT {WIFE_MOTHER}:=
       A role of an individual acting as the wife's mother for a cited event.

XLTR {TRANSLATOR}:=
       The name of a person who translated words from one language to another.


                                          THE GEDCOM STANDARD


                                              Appendix B


                                PROPOSED EVENT AND ROLE TAG DEFINITIONS


The additional event and roll tags below have not yet been standardized.  They are shown here in
this draft form to obtain opinions as well as definitions.  We will standardize as many as makes
sense by the time the draft is finalized.  The underscore '_' in front of the tags indicate the tags
which have not been standardized and should be structured as user defined tags complete with your
own definition and classification using the ISA tag.  The other tags, the ones with the asterisk '*'
have been standardized and defined in the 5.x Appendix A.  Tags not appearing in Appendix A are
not used in any of the lineage-linked structures of 5.x and were therefore dropped from the standard
approved list.

Events:

TAG:                TAG NAME                     DEFINITION                         

_ABJUR              Abjuration
_ABSOL              Absolution
ADOP                Adoption*
_APPRN              Apprenticeship

BAPM                Baptism*
BIRT                Birth*

CENS                Census*
_CHARTR             Charter
CHR                 Christening*
_CITZN              Citizenship
_CIVIL              Court Civil
_CNFSCTN            Confiscation
_COMUN              Communion
CONF                Confirmation*           
_CRIME              Court Criminal
_CRTULRY            Cartulary

DEAT                Death*
_DEAT_NOTE          Death_Notice
DIV                 Divorce*
_DIV_ANUL           Divorce_Annulment
_DIV_SEP            Divorce_Separation
_DOWRY              Dowry
_DPORTN             Deportation

EDUC                Education*
EMIG                Emigration*
_EMPLYMT            Employment
_ENRLMNT            Enrollment - matriculation
_EXCUTN             Execution

_F_COMM             First_Communion
_FUNRL_HOME         Funeral Home
Events: (cont')

TAG:                TAG NAME                     DEFINITION                         

_GALLEY             Galley
GRAD                Graduation*

IMMI                Immigration*            
_INTRO              Introduction

_LAND               Land
_LND_LEAS           Land_Lease
_LND_PURC           Land_Purchase

_MARR_BTRO          Marriage_Betrothal
_MARR_CMLAW         Marriage Common Law
_MARR_CNSNU         Marriage_Consanguinity - marriage to blood relatives
_MARR_CNTRC         Marriage_Contract
_MARR_DIMIS         Marriage_Dimissorial    - permission to get married in another jurisdiction
_MARR_DISPN         Marriage_Dispensations
_MARR_ENGA          Marriage_Engagements
_MARR_INTNT         Marriage_Intention
_MARR_REHAB         Marriage_Rehabilitation
_MARR_BANN          Marriage_Banns - Announcements        
MARR                Marriage*

MILI                Military*
_MILI-INDU          Military_Induction
_MILI_DIS           Military_Discharge
_MISS_PRSN          Missing Person

_NAME_CHNG          Name Change     
NATU                Naturalization*

ORDN                Ordination*

_PASL               Passenger_List
_PASP               Passport
_POLI_RPT           Police_Reports
_POPL_REG           Population_Register
_POOR_LAW           Poor_law
PROB                Probate*

_ROSTR              Roster

_S_COMM             Solemn_Communion
_SASINE             Sasine
_SEPRTN             Separation
_SLAVE              Slavery
Events: (cont')

TAG:                TAG NAME                     DEFINITION                         

TXPY                Tax_payer*
_TSTMNT             Testament

_VOTE_REG           Voting_Registration
_VOW                Vow

WILL                Will*

Roles:
The following are roles which could be used to describe participants in events.  The status of these
tags are the same as those listed for the event tags listed above.

TAG:         TAG NAME                     DEFINITION                         

_ANCE               Ancestor
_APLCNT             Applicant       
_APPRN              Apprentice      
_APRSR              Appraiser       
_AUNT               Aunt     

_BISHP              Bishop   
_BOARDR             Boarder  
_BOROWR             Borrower        
_BRID               Bride    
_BRO                Brother  
BUYR                Buyer*

_CAPT               Captain
CHIL                Child*
_CLRGY              Clergymen       
_CMDR               Commander       
_COUSN              Cousins  
_CREW               Crew

_DEAD               Deceased        
_DESC               Descendant      

_EMPLYR             Employer
_EXCUTR             Executor
FATH                Father*
_FIANCE             Fiance
_FREND              Friend
TAG:         TAG NAME                     DEFINITION                         

_GODF               Godfather       
_GODM               Godmother       
GODP                Godparent*      
_GR_AUNT            Grand_Aunt      
_GR_FATH            Grand_Father    
_GR_MOTH            Grand Mother
_GR_UNCL            Grand_Uncle     
_GROO               Groom
_GUARDN             Guardian        

HDOH                Head_of_house*
_HEIR               Heir
HUSB                Husband*

INFT                Informant*      
_INSTR              Instructor      

_JRNYMN             Journeyman      
_JUDGE              Judge
_LENDR              Lender
_M_WIFE             Midwife
_MNSTR              Minister 
_MONK                        Monk   
MOTH                         Mother*
_MSTR                        Master

_NIECE              Niece
_NEPH                        Nephew         
_NLAW                        In_law         
_NLAW_BRO           Brother_in_law  
_NLAW_DAU           Daughter_in_law         
_NLAW_FATH          Father_in_law
_NLAW_MOTH          Mother_in_law   
_NLAW_SIS           Sister_in_law   
_NLAW_SON           Son_in_law
_NOTRY              Notary   
_NUN                Nun
_NURS               Nurse    

OFFI                Official*
_ORPHN              Orphan   

_PHYSN              Physician       
_PROF               Professor
_PRISNR             Prisoner 
_PATIENT            Patient  
_PASNGR             Passenger       
TAG:         TAG NAME                     DEFINITION                         

RECO                Recorder*
REL                 Relative*
_RNTR               Renter
_RSDNT              Resident        

_SASSIER            Sassier  
_SBLNG              Sibling
SELR                Seller*  
_SIS                Sister
_SLAV               Slave
_SOLDR              Soldier  
SPOU                Spouse*  
_SERVNT             Servant  
_STEWRT             Stewart
_STUD               Student  

_TEACHR             Teacher  
_TENANT             Tenant   

_UNCL               Uncle    

_WARD               Ward     
WIFE                Wife*
WITN                Witness*        


                                          THE GEDCOM STANDARD


                                              Appendix C


                                          ANSEL CHARACTER SET


                                     Reproduced by permission from
                                 American National Standards Institute
                                  1430 Broadway, New York, N.Y. 10018
The following tables show the spacing and non-spacing diacritic characters that are contained in the
ANSEL set.  This table was added to give help to those receiving the machine version to the
GEDCOM standard.  The graphic characters shown are not always accurate, however the name of
the diacritic and the decimal equivalent should agree with the ANSEL standard. 

          C/R column refers to the column and row of the American National Standard  Z39.47-
          1985 showing the ANSEL character graphic and its 8 bit binary representation.

          wpcode column shows the Wordperfect (code page , character number) (1,2) chosen as the
          closest representation of the diacritic as shown in Wordperfect Appendix P. of version
          (5.1) 

          Dec column shows to the decimal equivalent for that diacritic as is used in the ANSEL
          character set.

          Name column gives the english name of the diacritic.

          example of use column shows an example of words using this diacritic.  For the non-
          spacing diacritic, this mark appears before the character in which it should be
          superimposed.

ANSEL Non-spacing graphic characters
8-bit
C/R   wpcode    Dec  Graphic    Name              example of use

14/0      2,4      224    þ    low rising tone mark                  cþui

14/1      1,0      225    þ    grave accent                          rþegle

14/2      1,6      226    þ    acute accent                          estþa

14/3      1,3      227    þ    circumflex accent                     mþeme

14/4      1,2      228    þ    tilde                                 niþno

14/5      1,8      229    þ    macron                                gþajþejs

14/6      1,22     230    þ    breve                                 altþa

14/7      1,15     231    þ    dot above                             þzaba

14/8      1,7      232    þ    umlaut (diaeresis)                    þoppna

14/9      1,19     233    þ    hacek                                 vþzdy

14/10     1,14     234    ø    circle above (angstrm)                høarANSEL Non-spacing graphic characters
8-bit
C/R   wpcode    Dec  Graphic    Name              example of use

14/11     2,11     235    þ    ligature, left half                   akademiiþþa

14/12     2,12     236    þ    ligature, right hlf                   akademiiþþa

14/13     1,10     237    þ    high comma, off center                rozdeþlovac

14/14     1,16     238    þ    double acute accent                   idþoszaki

14/15     2,25     239    þ    candrabindu                           Aliþiev

15/0      2,15     240    þ    cedilla                               þca

15/1      2,17     241    þ    right hook                            vietþa

15/2      2,0      242    þ    dot below                             teþda

15/3      2,1      243    þ    double dot below                      khuþtbah

15/4      2,3      244    þ    circle below                          Maharþsicaritamþrtam

15/5      2,6      245    þ    double underscore                     þGhulam

15/6      2,7      246    þ    underscore                            samar

15/7      2,16     247    þ    left hook                             darziþna

15/8      2,14     248    þ    right cedilla                         khþong

15/9      2,9      249    þ    half circle below                     þhumantuþs

15/10              250         double tilde, left half               þngalan

15/11              251         double tilde, right hlf               þngalan

15/12     1,5      252    þ    diacritic slash through char (LDS extension)
15/13
15/14     1,9      254    þ    high comma, centered                  gþeotermika
ANSEL Spacing graphic characters
8-bit
C/R   wpcode    Dec  Graphic    Name              example of use
10/0      
10/1      1,152    161    þ    slash L - uppercase                   þ¢dþ

10/2      1,80     162    þ    slash O - uppercase                   þst

10/3      1,78     163    þ    slash D - uppercase                   þuro

10/4      1,88     164    þ    thorn - uppercase                     þann

10/5      1,36     165    ’    ligature AE - uppercase               ’gir

10/6      1,166    166    þ    ligature OE - uppercase               þuvre

10/7      1,6      167    þ    miagkii znak                          fakulþtet

10/8      1,1      168    þ    middle dot                            novelþla

10/9      5,28     169    þ    musical flat                          Bþ

10/10     4,22     170    þ    patent mark                           ABCþ

10/11     6,1      171    ñ    plus or minus                         AñB

10/12     1,230    172    þ    hook O - uppercase                    Bþ

10/13     1,232    173    þ    hook U - uppercase                    XþA

10/14     1,11     174    þ    alif                                  Unþyusho
10/15              175                                               reserved - future

11/0      2,11     176    þ    ayn                                   faþil

11/1      1,153    177    þ    slash l - lowercase                   rozbiþ

11/2      1,81     178    þ    slash o - lowercase                   hþj

11/3      1,79     179    þ    slash d - lowercase                   þavola

11/4      1,89     180    þ    thorn - lowercase                     þannANSEL Spacing graphic characters (cont.)
C/R   wpcode    Dec  Graphic    Name              example of use

11/5      1,37     181    ‘    ligature ae - lowercase               sk‘g

11/6      1,167    182    þ    ligature oe - lowercase               þuvre

11/7      1,16     183    þ    tverdyi znak                          obþiavlenie

11/8      1,24     184    þ    dotless i - lowercase                 masalþ

11/9      4,11     185    œ    British pound                         œ5.00

11/10              186         eth
11/11              187         reserved - future

11/12     1,231    188    þ    hook o - lowercase                    Sþ

11/13     1,233    189    þ    hook u - lowercase                    Tþ Dþc

11/14              190         empty box (LDS-extension)

11/15              191         black box (LDS-extension)

12/0      6,33     192    þ    degree sign                           10þ C

12/1      6,49     193    þ    script l                              25þ.

12/2      4,71     194    þ    phonograph cpyright mrk               Deccaþ

12/3      4,23     195    þ    copyright mark                        þ1993

12/4      5,27     196    þ    musical sharp                         Dþ

12/5      4,8      197    ¨    inverted question mark                ¨Que

12/6      4,7      198    ­    inverted exclamtn mrk                 ­Esta

12/13              205         e in middle of line (LDS extension)

12/14              206         o in middle of line (LDS extension)

12/15     1,23     207    á    Es Zet                                Preuáen