facebook pages scraping need login

Question

I am scraping the facebook pages data, but to access all the data I need to log in to my account I am using.

import wget
from bs4 import BeautifulSoup
url = "https://www.facebook.com/hellomeets/events"

down = wget.download(url)

f = open(down, 'r')
htmlText = "\n".join(f.readlines())
f.close()
print htmlText

How do I log in to my account and scrape all the data of pages?

use [requests](http://docs.python-requests.org/en/latest/) OR use their facebook api — taesu, Jun 16 '15 at 18:37

score 1 · Answer 1 · edited May 23 '17 at 12:22

1

After some investigation, I found that Facebook implements some kind of CRSF protection, thus simple urllib3 or requests wouldn't work.

Try something like this : Login to Facebook using python requests which still uses requests, but with session

edited May 23 '17 at 12:22

Community

1
1

answered Jun 16 '15 at 18:47

taesu

4,482
4
23
41

score 0 · Answer 2 · edited May 23 '17 at 12:29

0

For python3, you could use the urllib library.

Here's an example of someone using it to log in to a site.

How to use urllib in python 3?

edited May 23 '17 at 12:29

Community

1
1

answered Jun 16 '15 at 18:42

cameron-f

431
1
3
15

facebook pages scraping need login

2 Answers2