This post has been de-listed
It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.
I know that I always have to check the robots.txt . But how do I actually know what can I do and how can I do It?
I'll give you an example. Recently a client asked me to scrape restaurants from Foodpanda. I've checked the robots.txt and It seems like they wouldn't like that. But is It actually illegal or It may bring just to IP block (which could be bypassed through proxies in that case)? I don't want any problems so I just want to understand if I can actually scrape Foodpanda's restaurant.
(Yes, I'm a beginner)
Subreddit
Post Details
- Posted
- 3 weeks ago
- Reddit URL
- View post on reddit.com
- External URL
- reddit.com/r/webscraping...