Can SearchBlox ignore certain sections of the page during indexing?

Yes, SearchBlox can ignore sections of a page. Use the following tags and the indexer will ignore non-search content like navigation, menus, etc.:

<noindex> </noindex>
or
<!--stopindex--> <!--startindex-->

For example:

<noindex> 
Content to exclude 
</noindex>

<!--stopindex--> 
Content to exclude 
<!--startindex-->

The content to exclude will be ignored when a search is performed.

Both sets of tags accomplish the same end result. We included two different sets since Google supports one standard, and couple of other search providers support another tag format. We decided to support both sets for those of you who are migrating older content. Having both means you don't need to go through every page and update those tags.

 

Have more questions? Submit a request

Comments

  • Avatar
    David Hoernig

    Can you clarify what is meant by "content will be ignored". Does it mean that searchblox will not crawl into any links found in this section?

    For example, we make a practice of putting stop-start tags around our headers and footers that appear on every page so that common/shared text doesn't contribute to the index for every page. But if that also makes it so that searchblox won't crawl the menu links, then we need a different solution.

     

    Edited by David Hoernig
  • Avatar
    Rossen Angelov

    We are running into the same issue and would be really nice to have this article written in a more clear way with some examples that include text and links.

    Our tests show that even if links were surrounded with , they were still indexed. I have to guess that "content to exclude" applies only to text and not hyper links.