Can we crawl and index documents in Google Drive using nutch and solr?
I've tried indexing the google doc public url but it doesn't seem to work. Is there a way to crawl google docs with nutch and index them using solr?
+3
Chaturvedi saurabh
source
to share
1 answer
Use the Google Drive API to read / manage files
https://developers.google.com/drive/web/about-sdk
A disk page with a public URL will not have direct links to subdirectories, so you won't get anything if you crawl those pages.
+2
Ramanan R
source
to share