wget - Robot names for robots.txt -


suppose have website uses wget crawl other websites. provide website owners chance of not being crawled website. should use robot name wget in robots.txt file, or have create other name?

common practice disallow , allow popular uas this:

user-agent: google disallow:   user-agent: * disallow: / 

so think don't have problems using wget way


Comments

Popular posts from this blog

Perl - how to grep a block of text from a file -

delphi - How to remove all the grips on a coolbar if I have several coolbands? -

javascript - Animating array of divs; only the final element is modified -