wget - Robot names for robots.txt -
suppose have website uses wget
crawl other websites. provide website owners chance of not being crawled website. should use robot name wget
in robots.txt
file, or have create other name?
common practice disallow , allow popular uas this:
user-agent: google disallow: user-agent: * disallow: /
so think don't have problems using wget
way
Comments
Post a Comment