|
Navigation: Appendixes > Appendix A Reference > Data Gathered from the Search Engines > What is extracted for news reports |
![]() ![]()
|
Where does the data for News reports come from? This is probably best described in an example. Figure 1 is a search engine news result page using the keyword phrase 'web site management tools' in the query.
Total Search Results
Highlighted in green is the total number of news results the search engine claims to have for the query. Some search engines don't display this information, and is therefore not collected on those engines.
Spelling Suggestion
Not collected for news search engines. (yellow highlight)
News Results
Highlighted in red are the items extracted from the news search listings. The title for the news result is always extracted. The URL that points to the news article is extracted from the HREF in the title. The author or origination of the news article and the publish date or age is also extracted.
For reporting purposes, Website-Manager will also extract out the domain name from the URL and store it in a separate field. This allows for faster report queries when filtering by domains.
Sponsored Results
Not collected for news search engines. (none highlighted)
Figure 1. Search Engine News Result Page.

Page url: http://www.helpandmanual.com/help/index.html?what_is_extracted_from_news_li.htm