Brew home webmaster issues
I created a webcrawler in C ++. I am using an API called URLdownloadToFile()
.
- Can any other API be used?
URLdownloadToFile()
Does the API work well for some urls and it doesn't work well for some other urls? Please suggest some ways to overcome this problem?
Thank you, Dnyaneshwari C.
+1
dnyaneshwari
source
to share
3 answers
You might want to take a look at libcurl, which should allow you to fetch content using various protocols. This should also support proxies, etc., which can be causing problems with certain urls. See also; http://curl.haxx.se/
+2
source to share
Unless you have a specific reason to stick with C ++, you might be better off switching to Python and using BeautifulSoup . I used curl, which is fine, but all of my web files are already done in Python.
0
source to share