Download the PHP package kbariotis/documer without Composer
On this page you can find all versions of the php package kbariotis/documer. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.
Download kbariotis/documer
More information about kbariotis/documer
Files in kbariotis/documer
Package documer
Short Description Bayes algorithm implementation in PHP for auto document classification.
License MIT
Homepage http://github.com/kbariotis/documer
Informations about the package documer
Documer
Bayes algorithm implementation in PHP for auto document classification.
Concept
every document has key words e.g. Margaret Thatcher
every document has a label e.g. Politics
Suppose, that in every document there are key words all starting with an uppercase letter. We store these words in our DB end every time we need to guess a document against a particular label, we use Bayes algorithm.
Let's clear that out:
Training:
First, we tokenize the document and keep only our key words (All words starting with an uppercase letter) in an array. We store that array in our DB.
Guessing:
This is very simple. Again, we parse the document we want to be classified and create an array with the key words. Here is the pseudo code:
for every label in DB
for every key word in document
P(label/word) = P(word/label)P(label) / ( P(word/label)P(label) + (1 - P(word/label))(1 - P(label)) )
Usage
Install through composer
Instantiate
Pass a Storage Adapter object to the Documer Constructor.
Train
Guess
$scores
will hold an array with all labels of your system and the posibbility which the document will belong to
each label.
Storage Adapters Implement Documer\Storage\Adapter to create your own Storage Adapter.