wget - Robot names for robots.txt -
suppose have website uses wget crawl other websites. provide website owners chance of not being crawled website. should use robot name wget in robots.txt file, or have create other name?
common practice disallow , allow popular uas this:
user-agent: google disallow: user-agent: * disallow: / so think don't have problems using wget way
Comments
Post a Comment