Can we crawl and index documents in Google Drive using nutch and solr?

I've tried indexing the google doc public url but it doesn't seem to work. Is there a way to crawl google docs with nutch and index them using solr?

+3


source to share


1 answer


Use the Google Drive API to read / manage files

https://developers.google.com/drive/web/about-sdk



A disk page with a public URL will not have direct links to subdirectories, so you won't get anything if you crawl those pages.

+2


source







All Articles