Questions tagged [web-scraping]
29 questions
5
votes
1 answer
Scraping financial web data
I recently started working as a data scientist and I am starting a web scraping and NLP project using Python. The idea is to create a program that searches for public information on the company's clients. These information can come from various…
maxime.b28
- 51
- 1
2
votes
2 answers
looking for public dataset for stock market
I want to do some modelling and data visualization on historical stock data, including price, volume, financials, etc. Is there an public dataset available for stock price history? I looked at a few, but either they have a high cost, or not sure…
Donald S
- 2,079
- 3
- 9
- 28
2
votes
1 answer
getting error while scrapping Amazon using Selenium and bs4
I'm working on a class project using BeautifulSoup and webdriver to scrap
Disposable Diapers on
amazon for the name of the item, price, reviews, rating.
My goal is to have something like this where I will split this info in
different column:
…
cesco
- 29
- 2
- 7
2
votes
0 answers
Is there a way to scrape tweets in realtime from a list of specified users?
I am trying to build a scraper that will run continuously and save the tweets from a list of users instantaneously or within seconds of the user tweeting it. It could save the tweet details to a continuously updated csv file.
niusoski
- 21
- 2
1
vote
1 answer
Beautifulsoup iterating through scraped data
I have this html code that is repeating multiple times:
Steven
- 129
- 2
- 5
- 16
1
vote
1 answer
Python writing to Excel file: writerow() takes no keyword arguments
I have this script:
import requests
from requests import get
from bs4 import BeautifulSoup
import csv
import pandas as pd
f = open('olanda.csv', 'wb')
writer = csv.writer(f)
url = ('https://www......')
response…
Steven
- 129
- 2
- 5
- 16
1
vote
0 answers
Web scraping using Beatiful Soup
I have this code and I wanna extract holidays, petrol and temperature but I don't know where is the problem. I need your help as soon as possible, please.
I want to add this extraction to my dataset that is based on date columns, so comparing the…
IKRAM EL MBARKI
- 11
- 1
1
vote
1 answer
Web Scraping: Multiple small files or one large file?
I plan to scrape some forums (Reddit, 4chan) for a research project. We will scrape the newest posts, every 10 minutes for around 3 months. I am wondering how best to store the JSON data from each scrape, so that pre-processing (via Python) later…
thomasjeeverson
- 13
- 2
1
vote
0 answers
Good database/plug-in to scrape for academic paper info?
I am trying to scrape the web to find information about a set of academic papers.
Unfortunately, for many of the papers, I only have the author's name, a year, and part of a title.
(For example, [BANDURA A, 1997, SELF EFFICACY EXERCI])
I have tried…
Mox
- 19
- 1
1
vote
0 answers
Cyber crime - data set
I'm doing science project on my university to make an app, which uses AI to detect cyber crimes. I'm looking for sites to make my data set. Do you know any with ads like tobacco, alcohol, prescriptions, drugs, forgery? I'm using LASER embedder, so…
Piotr Rarus
- 854
- 1
- 5
- 15
1
vote
1 answer
Scraping mixed elements and passing to SQL
I'm running webscrapes via python which are retrieving data from csv's hosted on the web. I'd like to pass the data into a MSSQL database. An issue I have is the mixed elements/data types in the csv. Here is an example of the data
Item Val1 …
thesimplevoodoo
- 23
- 3
1
vote
1 answer
Turning Histogram values into Numerical format ( Excel-xslx, Pandas-DataFrame, etc.)
I am trying to do a correlation study about personality traits as described in Hofstede's :https://www.hofstede-insights.com/product/compare-countries/ . I would like to have the values described in the bar charts numerically into, say an Excel or…
MSIS
- 113
- 4
1
vote
0 answers
Bootstrap Template for a Dash App
Where Can I find a Bootstrap Template for a Dash app?
Are there implemented examples? I cannot find nothing about this.
I am building a Dashboard with Dash/Plotly but I want to improve the style of the application.
Laura
- 161
- 5
1
vote
0 answers
Web scrapping with beautifulSoup is done slowly
I have developed a web scrapping code in Python which takes data from Hattrick.org's matches and returns them in a table so it can be mined, determined likelihood of goals, etc.
I have the difficult that is really slow, returning 12.000 rows in 5…
Juan Esteban de la Calle
- 2,318
- 9
- 28
1
vote
1 answer
What is the NLP problem I am solving called and how should i go about solving it?
I am working on a POC where I am required to write a NLP code after web scraping. The prompt to my code is
How good is the online Data Science degree offered by MIT?
I am required to do web scrapping and other information resources and generate a…
The White Cloud
- 169
- 7