How can I limit the crawler to index within my website or domain?

Permanently deleted user

November 16, 2021 14:53 Updated

You can ensure that the crawler stays within the same domain or website by specifying the pattern within the Allow Path box found here: Collections > Paths > Allow Paths.

For example, if you want to index all URLs on within edition.cnn.com, and want the spider to only crawl within the same website, just enter edition.cnn.com in the Allow Paths box.

Image 2021-11-16 at 8.23.31 PM

Comments

0 comments

Please sign in to leave a comment.