Click or drag to resize

imbMCRepository Class

Home | imbSCI | imbACE | imbNLP | imbWEM | imbWBI

[This is preliminary documentation and is subject to change.]

Repository holding Mining Context corpus for a set of web site
Inheritance Hierarchy
SystemObject
  fileDataStructure
    imbMiningContext.MCRepositoryimbMCRepository

Namespace:  imbMiningContext.MCRepository
Assembly:  imbMiningContext (in imbMiningContext.dll) Version: 0.1.0.0 (0.1.0.0)
Syntax
C#
[fileStructureAttribute("name", fileStructureMode.subdirectory, fileDataFilenameMode.propertyValue, 
	fileDataPropertyOptions.textDescription)]
public class imbMCRepository : fileDataStructure, 
	IFileDataStructure, IObjectWithNameAndDescription, IObjectWithDescription, IObjectWithName

The imbMCRepository type exposes the following members.

Constructors
  NameDescription
Public methodimbMCRepository
Initializes a new instance of the imbMCRepository class
Public methodimbMCRepository(String, String, folderNode)
Initializes a new instance of the imbMCRepository class.
Top
Properties
  NameDescription
Public propertydescription
Information on subset used, content, date of creation and such things
Public propertydoExcludeDuplicatePages
If true it will skip a crawled page that was found to have duplicate content, during MCWebSite repository construction
Public propertydoExcludeIrrelevantPages
If true it will skip a crawled page that was evaluated as irrelevant, during MCWebSite repository construction
Public propertydoThrowOnDomainMismatch
Throw if page URL is not the same domain as web site is
Public propertyfilesNotFound
Messages on pages that were not found
Public propertyfolder
Parent folder or it's own folder
(Inherited from fileDataStructure.)
Public propertyloger
Persistant log builder
Public propertyname
Mining Content repository name
Public propertysiteTable
Site page repositories index
Top
Methods
  NameDescription
Public methodBuildWebPage
Builds the web page repository using ISpiderTarget crawl information
Public methodBuildWebSite
Builds or updates web site repositorium using crawling information.
Public methodCheckSiteTable
Checks and recovers the site table, if broken
Public methoddebugReport
print out short report on content of the repository (if any)
Public methodDeleteAll
Public methodDeleteWebSite
Deletes the web site repository entry and its directory
Public methoddoContentReports
creates extra files describing the repository content
Public methodEquals (Inherited from Object.)
Public methodGetAllWebPages
Gets all web pages registered in the pageTable loaded, if takeSettings used then returns sampleTake
Public methodGetAllWebSites
Returns repository instances for all web sites registered in siteTable, if takeSettings used then returns sampleTake
Public methodGetHashCode (Inherited from Object.)
Public methodGetType (Inherited from Object.)
Public methodGetWebPage
Gets the web page repository by resolved URL
Public methodGetWebSite
Gets web site repositorium by clean domain name, like: "koplas.co.rs" for http://www.koplas.co.rs
Public methodisTargetProper
Determines whether target is proper according to repositorium settings
Public methodOnBeforeSave (Overrides fileDataStructure.OnBeforeSave.)
Public methodOnLoaded (Overrides fileDataStructure.OnLoaded.)
Public methodSave
Public methodSetFolderDescription
Sets the folder description
(Inherited from fileDataStructure.)
Public methodToString (Inherited from Object.)
Top
See Also

Reference

[!:aceCommonTypes.files.fileDataStructure.IFileDataStructure]