TEI P3 calls these characters lexical punctuation (p. 897).
In any case, it is important to distinguish these textual diacritics, whatever they may or may not be in a given interpretation. The following codes are likely candidates for diacritic characters:
Tag Function - compounding hyphen {\-} end-of-line hyphen {--} dash {-} censorship hyphen ' possessive apostrophe {'} eliding apostrophe {`} single closing quotation markIf non-ASCII characters and abbreviated forms are to be recognized on the same basis as other letter-numbers, then both braces and the vertical bar, which delimit them, should be declared as diacritics. Should this be done, then every character employed in these character codes--as well as every character described in these guidelines as a letter-number--should be explicitly declared as a letter-number. Such code characters, if not found in the alphabet specified for a text-analysis program, may otherwise be taken as word-separator characters like the space. As a result, word-fragments will be sorted as separate word-integers.