pub struct Corpus {
pub path: String,
pub xml_parser: Parser,
pub html_parser: Parser,
pub tokenizer: Tokenizer,
pub dnm_parameters: DNMParameters,
pub extension: Option<String>,
}
Expand description
A parallel iterable Corpus of HTML5 documents
Fields
path: String
root directory
xml_parser: Parser
document XHTML5 parser
html_parser: Parser
document HTML5 parser
tokenizer: Tokenizer
DNM
-aware sentence and word tokenizer
dnm_parameters: DNMParameters
Default setting for DNM
generation
extension: Option<String>
Extension of corpus files (for specially tailored resources such as DLMF’s .html5) defaults to selecting .html AND .xhtml files
Implementations
Trait Implementations
Auto Trait Implementations
impl RefUnwindSafe for Corpus
impl Send for Corpus
impl Sync for Corpus
impl Unpin for Corpus
impl UnwindSafe for Corpus
Blanket Implementations
sourceimpl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
const: unstable · sourcefn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more