TEI P3 calls these characters lexical punctuation (p. 897).
In any case, it is important to distinguish these textual diacritics, whatever they may or may not be in a given interpretation. The following codes are likely candidates for diacritic characters:
      Tag          Function
      -            compounding hyphen
      {\-}         end-of-line hyphen
      {--}         dash
      {-}          censorship hyphen
      '            possessive apostrophe
      {'}          eliding apostrophe
      {`}          single closing quotation mark
If non-ASCII characters and abbreviated forms are to be recognized on the same basis as other letter-numbers, then both braces and the vertical bar, which delimit them, should be declared as diacritics.  Should this be done, then every character employed in these character codes--as well as every character described in these guidelines as a letter-number--should be explicitly declared as a letter-number.  Such code characters, if not found in the alphabet specified for a text-analysis program, may otherwise be taken as word-separator characters like the space.  As a result, word-fragments will be sorted as separate word-integers.