Keyword Density Analysis (KDA)

This section reports keyword densities of various page sections by different match type.

 

Although KDA has long been a standard among optimizer tools, keyword frequency and keyword density is not how search engines evaluate words on website pages.  Search engines use “Term Vector Theory”, a form of linear algebra, to analyze pages.  The science is well know among search engine designers and is still being used today.

But in order to properly employ Term Vector Theory, one must have access to the search engine corpus (a body of data which is used to develop a search engine index).  Access to this information is not public, so the public cannot use Term Vector Theory as a method of determining the weighting of every word.  And the public need only to determine the relative difference between one website page and another.  For these reasons, keyword frequency and density do have value to search engine optimizers.

 

Field

Dataline Description

Keyword Density

Body

B-Words

Number of words found in the Top Page <BODY> tag less STOP words and words found in the Ignore tags.  Words found <A href> tag attributes are not included in the body count.

The method used to calculate density considers each keyword phrase to be a logical word or term.  Although this method is not the traditional process for calculating density, it does help normalize the differences when calculating multi term keyword density values.

Formula Used:

                              (Kw Found)

% Density = --------------------------------------------------
      Total count - (KwTerms/Kw x
          Kw Found) + Kw Found)

KDA calculation examples:

Example (words or terms on page)

Keywords

Total count

% Density

Terms/kw

Found

dogs love turf dog food

2

1

5

25%

dog and cat food
 ...+10 more terms1

2

1

13

8%

dog food
… +98 more terms

2

1

100

1.01%

dog cat food
+97 more terms

3

1

100

1.02%

Dog food
… +96 + dog food

2

2

100

2.04%

Food is good for a dog2

1 (part)

2

3

66%

Notes:

Keyword (Kw) in bold.  More terms exclude stop words.

1 “and” is a stop word.  Density for broad match.

2 Density for part match.  “is”, “for” and “a” are stop words.

 

Special Notes:

  1. All punctuation marks (including dashes) are considered the same as ‘white space’ when adjoining a word.
  2. Phrase, Broad and Part matches are handled according to the rules below.

B-Phrase

% density of keyword phrase matches found in the Top Page <BODY> tag (less STOP words)

B-Broad

% density of keyword broad matches found in the Top Page <BODY> tag (less STOP words)

B-Parts

% density of keyword part matches found in the Top Page <BODY> tag (less STOP words)

Title tag

T-Words

Number of words found in the Top Page <TITLE> tag (less STOP words)

T-Phrase

% density of keyword phrase matches found in the Top Page <TITLE> tag (less STOP words)

T-Broad

% density of keyword broad matches found in the Top Page <TITLE> tag (less STOP words)

T-Parts

% density of keyword part matches found in the Top Page <TITLE> tag (less STOP words)

Description tag

D-Words

Number of words found in the Top Page <DESCRIPTION> tag (less STOP words)

D-Phrase

% density of keyword phrase matches found in the Top Page <DESCRIPTION> tag (less STOP words)

D-Broad

% density of keyword broad matches found in the Top Page <DESCRIPTION> tag (less STOP words)

D-Parts

% density of keyword part matches found in the Top Page <DESCRIPTION> tag (less STOP words)

Keyword tag

K-Words

Number of words found in the Top Page <KEYWORD> tag (less STOP words)

K-Phrase

% density of keyword phrase matches found in the Top Page <KEYWORD> tag (less STOP words)

K-Broad

% density of keyword broad matches found in the Top Page <KEYWORD> tag (less STOP words)

K-Parts

% density of keyword part matches found in the Top Page <KEYWORD> tag (less STOP words)

Number of Phrase conditions found in body

H1

Number of keyword phrase matches found in the Top Page <H1> tag (less STOP words)

H2

Number of keyword phrase matches found in the Top Page <H2> tag (less STOP words)

H3

Number of keyword phrase matches found in the Top Page <H3> tag (less STOP words)

B

Number of keyword phrase matches found in the Top Page <B> tag (less STOP words)

OL

Number of keyword phrase matches found in the Top Page <OL> tag (less STOP words)

UL

Number of keyword phrase matches found in the Top Page <UL> tag (less STOP words)

ALT

Number of keyword phrase matches found in the Top Page <ALT> tag (less STOP words)

A href

Number of keyword phrase matches found in the Top Page <A href> tag (less STOP words).  %20, dashes and underscores are counted as spaces.  These matches are not included in Body Phrase, Broad or Part matches.

 

Density computations for Phrase, Broad and Part matches are handled according to the following rules.

Phrase Matchs:

All found conditions of Phrase Match are included.

Broad Match:

All found conditions of Broad Match are included except those conditions that are also Phrase Matches. 

Part Match:

All found conditions of Part Match are included except those conditions that are also Phrase or Broad Matches.