Questions tagged [learning]

43 questions
4
votes
2 answers

Linear Regression

I'm starting to learn machine learning and one of the first things that is mentioned is the usage of a linear regression method. Basically, we have a bunch of data points and we want to fit a line such that the errors we get from the line and the…
Mozza
  • 41
  • 1
4
votes
4 answers

How to know for sure if we can learn from a given data or not?

I want to know that given a set of data and a target, how we can know for sure whether we can learn from that data to make any inference or not?
3
votes
1 answer

How to explain the connection between the input layer and H1 of this CNN Architecture?

I am currently reading the paper proposed by LeCun et al. for handwritten zip code recognition. There is this figure below visualizing the CNN architecture. But I do not really understand how the connection between Layer H1 and input layer makes…
user77241
3
votes
0 answers

Which machine learning algorithm can be used for traffic prediction using images from Google Maps?

I am doing a traffic prediction project. I am using data from Google Maps. I'm collecting maps of different location and analyzing them. I am converting the maps to images, analyzing the image, and predict traffic. Which machine learning algorithm…
Shruthi Mc
  • 31
  • 1
3
votes
1 answer

Machine Learning/Text Analysis/Algorithm Pointers

I'm looking for a reference or point in the right direction since I'm not too familiar with machine learning or algorithms. I'd prefer to work in R, but I could also do Python. Any reference to a specific package I could use would be great. So I…
DCK
  • 31
  • 1
3
votes
1 answer

Columns with no (or nearly no) differences between rows worth keeping?

I have a dataset which contains multiple columns. On analysis, I found out that there were few columns which contain just a single value. There was 0 in all the rows of these columns. Does it hold any information or should I remove such columns from…
3
votes
3 answers

How to prepare data for LSTM time series prediction

I have a binary classification task for time series data. Every 14 rows in my CSV is relevant to one time slot. How should I prepare this data to be used in LSTM? In other word how to feed the model with this data?
Kaggle
  • 2,977
  • 5
  • 15
  • 8
3
votes
1 answer

Interpret User Interfaces with Machine Learning

I am currently working on a prototype of an application that should be able to interact with user interfaces. Now every user interface has some common elements, like buttons, scrollbars, input fields etc. I would like to use Machine Learning to…
2
votes
1 answer

Response variable at the group level, independent variables at the entity level

I have a dataset of entities which each belong to a particular group (i.e. entity=schools and group=school district). I also have lots of auxiliary variables on each entity. However, for my response variable I only have information at the group…
2
votes
1 answer

Colab can not connect to GPU from a python file

I am trying to run a github deep learning repository in Colab but I can not connect the python files to colab GPU. I can connect to GPU when writing a script in the colab notebook e.g. when I run this cod from a notebook cell : import os,…
Ibraheem
  • 23
  • 4
2
votes
0 answers

How to use learning curve in reality

CONTEXT: I have some simulated data by which I made and trained a model. during my training, I enjoyed having a large number of samples, and therefore my model is leveraging it by being decently complex. Yet since this model is trained on simulated…
arash
  • 185
  • 4
2
votes
0 answers

Learning to rank: how is the label calculated?

I am studying learning to rank and not sure I understand how the train sample and final label (relevance score) is constructed. Lets assume we sell furniture online. We have logged customer's query, product customer bought, clicked. Example: User A…
Alina
  • 143
  • 1
  • 7
2
votes
1 answer

CNN to many outputs

I have a dataset with 100 columns (categorial one-hot encoded) and 1 column with text data (simple sentences) and i want to build a neural network to arround 380.000 outputs labels. I have no idea what can i do. I was thinking about a CNN with…
2
votes
0 answers

Why can't continuous variables be used for the estimator of learning curves, when using StratifiedKFold to split the dataset?

I want to produce learning curves for three regression models run on data containing 200 samples, 10 features and 1 target variable. The target variable contains two clusters/peaks, making it imbalanced between and within the clusters, so I applied…
Chris
  • 21
  • 3
2
votes
2 answers

Which is better for Data Science, a double major in Math & CS or Physics & CS?

My dream occupation is to work as a Data Scientist. I'm soon going to be completing my first semester as a physics student, as part of an incredibly prestigious physics program. We get to run an experiment at CERN in our third year! I'm currently…
Amit Levy
  • 129
  • 6
1
2 3