Can SearchBlox ignore certain sections of the page during indexing?

Yes, SearchBlox can ignore sections of a page. Use the following tags and the indexer will ignore non-search content like navigation, menus, etc.:

<noindex> </noindex>
or
<!--stopindex--> <!--startindex-->
or 

<!--googleoff: all--><!--googleon: all-->

For example:

<noindex> 
Content to exclude 
</noindex>

<!--stopindex--> 
Content to exclude 
<!--startindex-->

<!--googleoff: all-->
Content to Exclude
<!--googleon: all-->

The content to exclude will be ignored when a search is performed.

All these above-mentioned tags accomplish the same end result. Please note that content will be excluded in index however the links will be crawled i.e., the links within the tags will not be ignored.

 



To learn more about SearchBlox please visit our developer document site https://developer.searchblox.com/docs

Have more questions? Submit a request

Comments

  • Avatar
    David Hoernig

    Can you clarify what is meant by "content will be ignored". Does it mean that searchblox will not crawl into any links found in this section?

    For example, we make a practice of putting stop-start tags around our headers and footers that appear on every page so that common/shared text doesn't contribute to the index for every page. But if that also makes it so that searchblox won't crawl the menu links, then we need a different solution.

     

    Edited by David Hoernig
  • Avatar
    Rossen Angelov

    We are running into the same issue and would be really nice to have this article written in a more clear way with some examples that include text and links.

    Our tests show that even if links were surrounded with , they were still indexed. I have to guess that "content to exclude" applies only to text and not hyper links.