First of all I want you to keep in mind that if you're going to to scrape google you'll need proxies. A lot of proxies. Also multithreading, response validation, some kind of background task runner and supervisor in case that you wanna get something stable enough.
You'll need to limit the link crawling depth for websites at most to 3.
I can provide you all of about, even proxies. Needs to be discussed.