Information Retrieval: A NOVEL INDEXING TECHNIQUE FOR WEB DOCUMENTS USING HIERARCHICAL CLUSTERING - Deepti Gupta, Prof. A., Dr. Komal

Information Retrieval: A NOVEL INDEXING TECHNIQUE FOR WEB DOCUMENTS USING HIERARCHICAL CLUSTERING

Автор: Deepti Gupta, Prof. A., Dr. Komal, 128 стр., ISBN: 363917657X

The information on the WWW is growing at an exponential rate; therefore, search engines are required to index the downloaded Web documents more efficiently. A typical search engine comprises of the three main components. (1) Crawler: Given a URL, it combs through the pages on the web and gathers the required information for the search engine. (2) Indexer: While an index of 100,000 documents can be queried within millisecond; a sequential scan may take hours.An indexer that optimizes speed and performance for finding relevant documents for a search query (3) Page Repository: The information retrieved by the web crawler is stored in a database called page repository. Web mining techniques like clustering can be used for this purpose. The performance of a search engine is limited because of these two problems. (1) Low precision (2) Low recall. Thus, there is a need to develop efficient indexing technique. In this book, a novel technique is being discussed that not only indexes the...

Под заказ:
	OZON.ru - 5152 руб.	Перейти

Рейтинг книги:

4 из 5, 5 голос(-ов).

99 руб.	319 руб.
289 руб.	302 руб.

Популярные книги по минимальной цене:

Дополнительно: