Naive Bayes#
Naive Bayes is an algorithm used in classication, it’s one of the most ecient and eective inductive learning algorithm for machine learning.
In classication, the goal of a learning algorithm is to construct a classier given a set of training examples with class labels, where an example E is represented by a tuple of attribute values
and represents the classication variable.It has been observed that, however, its classication accuracy does not depend on the dependencies; i.e., naive Bayes may still have high accuracy on the datasets in which strong dependencies exist among attributes.
It is the distribution of de- pendencies among all attributes over classes that aects the classication of naive Bayes, not merely the dependencies themselves [Zhang, 2004]
Naive Bayes and augmented naive Bayes#
Classier: a function that assigns a class label to an example.
Bayesian classier
The the probability of an example
being class is is classified as the class iffwhere
is called Bayesian classifier.Naive Bayesian (NB) Classier
By assuming that all attributes are independent given the value of the class variable; that is,
the resulting classifier is then
Naive Bayes is the simplest form of Bayesian network, in which all attributes are independent given the value of the class variable.
Augmented naive Bayes (ANB) is an extended naive Bayes, in which the class node directly points to all attribute nodes, and there exist links among attribute nodes. From the view of probability, an ANB
represents a joint probability distribution represented below.where
denotes an assignment to values of the parents of .Local independence
For each node, the inuence of its parents is quantied by the correspondent conditional probabilities. We call the dependence between a node and its parents local dependence of this node.
Local dependence derivative
For a node
on ANB ,the local dependence derivative of in classes and are dened asFor example,
reflects the strength of the local dependence of node in class , which measures the influence of local dependence on the classification in class .Local dependence derivative ratio at node
For a node
on ANB ,the local dependence derivative ratio at node , denoted by is dened as Quantifies the influence of local dependence on the classification.Dependence distribution factor
Harry Zhang. The optimality of naive bayes. AA, 1(2):3, 2004.