Click or drag to resize

spiderTarget Class

Home | imbSCI | imbACE | imbNLP | imbWEM | imbWBI

[This is preliminary documentation and is subject to change.]

Target is designated by the pointed absolute location
Inheritance Hierarchy
SystemObject
  imbWEM.Core.crawler.targetsspiderTarget

Namespace:  imbWEM.Core.crawler.targets
Assembly:  imbWEM.Core (in imbWEM.Core.dll) Version: 0.1.0.0 (0.1.0.0)
Syntax
C#
public class spiderTarget : ISpiderTarget

The spiderTarget type exposes the following members.

Constructors
  NameDescription
Public methodspiderTarget(String, spiderTargetCollection)
Initializes a new instance of the spiderTarget class
Public methodspiderTarget(spiderLink, spiderTargetCollection)
Initializes a new instance of the spiderTarget class
Top
Properties
  NameDescription
Public propertycontent
Public propertycontentBlocks
Public propertycontentTree
Public propertyduplicateOf
Reference to the first crawled target, having the same HTML source code hash fingerprint
Public propertyevaluatedLanguage
Language that was found during evaluation
Public propertyevaluation
Public propertyisDuplicate
True if this target is content duplicate (confirmed by HTML source code hash) of another, already crawled target. Target that was loaded first has False, any other duplicate has True.
Public propertyisLoaded
Da li je target ucitan?
Public propertyIsRelevant
Gets a value indicating whether this target is relevant. (shortcut for testing evaluation result language)
Public propertyiterationDiscovery
Discovery iteration
Public propertyiterationLoaded
Public propertykey
Public propertylinkVectors
Public propertymarks
Public propertypage
Attached page
Public propertypageHash
Public propertypageText
Public propertyparent
Public propertytargetHash
Public propertytokens
Token table describing this target - tokens that are extracted from URL
Public propertyurl
Top
Methods
  NameDescription
Public methodAddVector
Adds the new vector to the target. originPage has to be specified otherwise exception will be thrown. Returns true if it is new vector for this target
Public methodAttachPage
Attaches the page - if the page was already attached returns false
Public methodDispose
Releases all resources used by the spiderTarget
Public methodEquals (Inherited from Object.)
Public methodGetHashCode (Inherited from Object.)
Public methodGetHtmlDocument
Gets the HTML document from loaded page (HtmlDocument)
Public methodGetIndexPage
Gets indexPage entry from current Index database instance
Public methodgetQuery
Public methodGetType (Inherited from Object.)
Public methodGetVectors
Gets the all vectors, including those that are coming from the same page
Public methodToString (Inherited from Object.)
Top
See Also