templateDetectionAlgorithms Enumeration |
[This is preliminary documentation and is subject to change.]
[Missing <summary> documentation for "T:imbWEM.Mining.pageTemplate.templateDetectionAlgorithms"]
Namespace: imbWEM.Mining.pageTemplate
public enum templateDetectionAlgorithms
| Member name | Value | Description | |
|---|---|---|---|
| imbBasic | 0 | Za zadate stranice ide redom i pronalazi zajedničku strukturu na kraju uzme sadržaj iz prve stranice u nizu - | |
| RTDM_TD | 1 | Restricted top-down mapping for Template Detection ("A fast and robust method for web page template detection and removal", ACM conference 2006) Prvo se slučajnim uzorkom izaberu dve stranice, onda krene detekcija | |
| RBM_TD | 2 | RBM_TD: Restricted Bottom-Up Mapping for Template Detection ("On Finding Templates on Web Collections", WWW 2009) Oslanja se na "xPath tree" mapiranje | |
| MTD | 3 | Multiple Template Detection ("On Finding Templates on Web Collections", WWW 2009) Vrši klasterizaciju stranica prema njihovom template-u i simultano izdvaja templates | |
| imbGeneric | 4 | Koristi .NET alatke za prepoznavanje razlike u XML fajlu |