Breaking up a Web page into its components to identify worthy words/terms and indexing them using a set of rules is called *
a. preprocessing the documents.
b. document analysis.
c creating the term-by-document matrix.
d. parsing the documents.