What Type Of FootPrint Do You Use To Scrape Up Pligg/Shuffle Sites?
I was wondering what type of footprint do you guys use to scrape up pligg/shuffle sites?I tried using these footrpint to scrape up some pligg/shuffle sites.
inurl:"upcoming" intitle:"pligg"
inurl:"register" intitle:"pligg"
inurl:"cloud.php" intitle:"pligg"
inurl:"live_comments" intitle:"pligg"
inurl:"faq-en.php" intext:"pligg"
inanchor:"Pligg beta 9 Home"
inanchor:"About Pligg"
inurl:"/pligg" inurl:/register.php
inurl:register.php intext:"upcoming" intext:"published" intext:"submit"
inurl:/register intext:"upcoming" intext:"published" intext:"submit" intext:"Tag Cloud" -inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php intitle:"register"
inurl:/register intext:"Powered by Pligg" -inurl:.php
inurl:/register.php intext:"Powered by Pligg"
"Powered by Pligg"
intitle:"Pligg beta"
"What Is Pligg?"
intitle:"Pligg Beta 9"
"http://www.pligg.com"
inurl:register.php intext:"upcoming" intext:"published" intext:"submit"
inurl:/register intext:"upcoming" intext:"published" intext:"submit" intext:"Tag Cloud" -inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php intitle:"register"
inurl:/register intext:"Powered by Pligg" -inurl:.php
inurl:/register.php intext:"Powered by Pligg"
I insert them in scrapebox but didn't really harvest that much URL. I'm not sure if I'm doing it right or not. Anyway if people got some tips or ways to scrape up massive list of pligg/shuffle please let me know thanks
页:
[1]