Questions tagged [online-learning]

Online learning refers to courses, books, forums, tutorials, and videos on data science or machine learning topics/concepts available online.

58 questions
23
votes
4 answers

Incremental Learning with sklearn: warm_start, partial_fit(), fit()

I have built an ML model with the goal of making predictions for targets of the following week. In general, new data will come in and be processed at the end of each week and be in the same data structure as before. In other words, the same number…
Adam
  • 936
  • 1
  • 6
  • 13
15
votes
1 answer

On-line random forests by adding more single Decisions Trees

A Random Forest (RF) is created by an ensemble of Decision Trees's (DT). By using bagging, each DT is trained in a different data subset. Hence, is there any way of implementing an on-line random forest by adding more decision tress on new data? For…
tashuhka
  • 566
  • 5
  • 10
11
votes
3 answers

Is there a difference between on-line learning, incremental learning and sequential learning?

What I mean is the following: Instead of processing all the training data at once and calculating a model, we process one data point at a time and update the model directly afterwards. I have seen the terms "on-line (or online) learning" and…
Suzana
  • 210
  • 2
  • 9
10
votes
4 answers

SGDClassifier: Online Learning/partial_fit with a previously unknown label

My training set contains about 50k entries with which I do an initial learning. On a weekly basis, ~ 5k entries are added; but the same amount "disappears" (as it is user data which has to be deleted after some time). Therefore I use online learning…
swalkner
  • 111
  • 6
10
votes
3 answers

Libraries for Online Machine Learning

I am looking for packages (either in python, R, or a standalone package) to perform online learning to predict stock data. I have found and read about Vowpal Wabbit (https://github.com/JohnLangford/vowpal_wabbit/wiki), which seems to be quite…
mike1886
  • 933
  • 9
  • 17
7
votes
0 answers

differences between LSQR and FTRL when working with very sparse data

I have a 2M instances dataset with millions of very very sparse dummy variables created using the hashing trick = hash(orig_feature_name + orig_feature_value)=1. Note that the data is sparse both on rows (every instance has only a limited <100…
6
votes
1 answer

First order vs Second order online machine learning algorithms

In this git repo the online learning algorithms are classified as first order and second order. I tried searching what it means, yet I'm unable to understand the different between such first oder and second order online learning algorithms. What is…
Nadheesh
  • 61
  • 3
5
votes
2 answers

is neural networks an online algorithm by nature?

I have been doing machine learning for a while, but bits and pieces come together even after some time of practicing. In neural networks, you adjust the weights by doing one pass (forward pass), and then computing the partial derivatives for the…
5
votes
1 answer

What is the difference between continual learning and active learning?

As per my understanding, active learning is a kind of continual learning. Is there any difference between them?
5
votes
2 answers

How to continue incremental learning when a categorical variable has been assigned additional category labels?

Please help answer this question or point me to any resource. There is a model in an environment where training happens with new data and the data is discarded after training is completed. This keeps on happening in cycles. Hence we dealing with…
5
votes
1 answer

Real time noise removal using Savitzky-Golay Method

I would like to ask if Savitzky-Golay can be implemented on real-time data. I have used it on a fixed array size, but would like to extend it to output values for real-time sensor data. Can anyone refer me to appropriate implementation or hint…
5
votes
2 answers

online detection of plateaus in time series

I need to detect plateaus in time series data online. The data I am working with represents the magnitude of acceleration of a tri-axis accelerometer. I want to find a reference time window that I can use for calibration purposes. Because of that,…
R. Doe
  • 251
  • 2
  • 6
4
votes
2 answers

Difference between regret and pseudo-regret definitions in multi-armed bandits

I posted this question Cross Validated, but didn't get any answer. So I am posting it here too, as the question is very relevant to machine learning I am following the book Bandit Algorithms. In page 48, they introduces regret after $n$ rounds…
3
votes
1 answer

Online/incremental unsupervised dimensionality reduction for use with classification for event prediction

Consider the application: We have a set of users and items. Users can perform different action types (think browsing, clicking, upvoting etc.) on different items. Users and items accumulate a "profile" for each action type. For users such profile…
3
votes
1 answer

Understanding experiments in Continual Learning

Via paper Continual Learning Through Synaptic Intelligence, I see this figure for Split MNIST benchmark, but there is a point I can get. Here there are 5 tasks, and finally we summarize the average accuracy over the 5 tasks. Here, how the tasks are…
Giang Nguyen
  • 235
  • 2
  • 8
1
2 3 4