HTML: Seltz analyzer
Seltzlab (contact me)
This class can be used to extract important words from HTML documents.
It can process a well-formed XHTML document and extract the words contained in the document.
The class gives scores to each word depending on conditions like, whether the first letter is upper case, whether the word is inside strong or bold tags, etc..
It returns an associative array of words sorted by importance score.
Click here for detailed information about this class on phpclasses.org