omission ellipsis

Encoding of ellipsis and other representations of omitted text in the source

To represent ellipsis in the source text, an ellipsis character rather than a series of periods should be used. For one thing, this will prevent line breaks within the sequence, and other misinterpretations (such as interpreting the first period as a sentence-ending period) which could interfere with retrieval and processing. For another, it allows one to search for ellipsis and for what it signifies (omitted passages, suspension of dialogue, etc.).

We recommend representing all ordinary ellipsis using the entity reference … or the Unicode character U+2026 (numeric character reference … or …). This character should not be used, however, for other cases where a series of dots is used for other purposes: for instance, as the leader in a table of contents. We recommend encoding leaders of this sort as renditional information rather than as character content.

It is also important to distinguish between ellipsis in the original (which represents an omission in the original’s representation of some other source text) and omissions you make from the original you are transcribing. The latter should be represented with the gap element. The gap element should not be used for ellipsis in the source.