Term frequency by inverse document frequency


A weighting function that depends on the term frequency (TF) in a given document calculated with its relative collection frequency (IDF). This weighting function is calculated as follows [ 1]. Assuming that term j occurs in at least one document d( dj ≠ 0), the inverse document frequency (IDF) would be
$$ {\mathrm{Log}}_2\left(N/{d}_j\right)+1={\mathrm{log}}_2N-{\mathrm{log}}_2{d}_j $$
