A practical look at modern robots.txt use, from allow and disallow logic to wildcards, crawl-rate control and avoiding common pitfalls. The Robots Exclusion Protocol (REP), better known as robots.txt, ...