Expand description
The dnm
can be used for easier switching between the DOM
(Document Object Model) representation and the plain text representation,
which is needed for most NLP tools.
Modules
Node auxiliaries for DNMs
Structs
The
DNM
is essentially a wrapper around the plain text representation
of the document, which facilitates mapping plaintext pieces to the DOM.
This breaks, if the DOM is changed after the DNM generation!Parameters for the DNM generation
Very often we’ll talk about substrings of the plaintext - words, sentences,
etc. A
DNMRange
stores start and end point of such a substring and has
a reference to the DNM
.Some temporary data for the parser
Enums
Specifies how to deal with a certain tag