This crawl was run with a Heritrix setting of "maxHops=0" (URLs including their embeds)
Survey 7 is based on a seed list of 339,249,218 URLs which is all the URLs in the Wayback Machine that we saw a 200 response code from in 2017 based on a query we ran on Feb. 1st, 2018.
The WARC files associated with this crawl are not currently available to the general public.

Get our FREE eBook "10 Programming Tips That Changed Everything" when you subscribe!
No spam. Unsubscribe anytime.