htmlNodeProcessing Class |
[This is preliminary documentation and is subject to change.]
Namespace: imbNLP.Core.contentStructureHtml.tokenizator
public static class htmlNodeProcessing
The htmlNodeProcessing type exposes the following members.
| Name | Description | |
|---|---|---|
| checkTextHtmlConsistensy | ||
| createSentencesFromNode |
Pravi rečenice na osnovu HtmlNode-a i vraća kolekciju -- koristi se za glavne rečenice kao i za pod rečenice
| |
| extractContent(IListIContentParagraph) | ||
| extractContent(IListIContentSentence) | ||
| extractContent(IListIContentToken) |
Extracts the content.
|
| Name | Description | |
|---|---|---|
| _select_sentenceSpliter |
Regex select sentenceSpliter : (?=[\.;!\?])\s*(?=[A-ZČŠĆŽĐ\d])
| |
| _select_sentenceTerminator |
Regex select sentenceTerminator : ([\.;!\?]+)$
|