Questions tagged [bayesian]

Bayesian statistics is a statistical paradigm that contrasts with that of frequentist statistics. Bayesian methods rely on prior information do determine the degree of belief in the probability of a value.

139 questions
21
votes
2 answers

What is the correct meaning and interpretation of p-values?

I’m posting this question, and an answer, to help dispel a few misunderstandings about what p-values are. As a hiring manager interviewing mid-level and senior data scientists, I have noticed these misunderstandings many times. I have also noticed…
Robert Long
  • 3,518
  • 12
  • 30
13
votes
2 answers

What makes a Tree-Structured Parzen Estimator "tree-structured?"

From what I understand the Tree-Structured Parzen Estimator (TPE) creates two probability models based on hyperparameters that exceed the performance of some threshold and hyperparameters that don't. What I don't fully understand is why TPE is…
AAC
  • 509
  • 2
  • 6
  • 13
6
votes
1 answer

Does the Bayesian MAP give a probability distribution over unseen data?

I'm working my way through the Bayesian world. So far I've understood that the MLE or the MPA are point estimates, therefore using such models just outputs one specific value and not a distribution. Moreover, vanilla neural networks do in fact…
6
votes
1 answer

Changing the batch size during training

The choice of batch size is in some sense the measure of stochasticity : On one hand, smaller batch sizes make the gradient descent more stochastic, the SGD can deviate significantly from the exact GD on the whole data, but allows for more…
6
votes
2 answers

Effect of outliers on Naive Bayes

Are Naive Bayes algorithms affected by outliers in the data? Suppose there is a data set, does one need to remove outliers before applying Naive Bayes?
James Smith
  • 61
  • 1
  • 2
6
votes
1 answer

Modeling uncertainty from Logistic Regression

Logistic regression is a part in a simulation pipeline that I use for some scenario analysis. The dataset that this is based on is not small but relatively noisy, and only one explanatory variable/feature. Of course I can say something about this…
Jan van der Vegt
  • 9,448
  • 37
  • 52
6
votes
3 answers

Which tribe does Probabilistic Graphical Models fall under?

Pedro Domingos in "The Master Algorithm" listed five tribes of machine learning algorithms: Symbolists Connectionists Evolutionaries Bayesians Analogizers Which category do probabilistic graphical models fall under? From wikipedia…
alvas
  • 2,510
  • 7
  • 28
  • 40
5
votes
2 answers

How useful is Bayesian Inference

Last few months, I had been exposed to Bayesian Inference in ML course With further investigation, I come to place where there is MCMC technique to simulate the posterior distribution. It seems interesting. However, I am not sure if it is really…
chris tan
  • 53
  • 4
4
votes
2 answers

High Recall but too low Precision result in imbalanced data

I was training a model using XGBoost Classifier on a heavy imbalanced database with 232:1 of binary class. Because my training data contains 750k rows and 320 features (after doing many feature engineering, feature correlation filtering, and low…
4
votes
1 answer

How is bayesian risk computed to prune decision trees?

I've been trying to follow this paper on Bayesian Risk Pruning. I'm not very familiar with this type of pruning, but I'm wondering a few things: (1) The paper describes risk-rates to be defined per example. We have $R_k(a_i|x)=\sum\limits_{j=1,j…
nrael
  • 51
  • 8
4
votes
1 answer

When to use bayesian linear regression instead of linear regression?

When does it make sense to use a bayesian approach, maybe in context to linear regression? To be more concrete: Assume you measure a certain number of devices and you wanna' check the linear relationship between, e.g., voltage and current. Why…
Ben
  • 570
  • 5
  • 16
4
votes
2 answers

Mean estimation for nested location data

I want to estimate the average income for a location. I have nested data in the following way: A block is inside a neighborhood, which is inside a zipcode, which is inside a district, which is inside a region, which is inside a state. I want to…
David Masip
  • 6,136
  • 2
  • 28
  • 62
4
votes
1 answer

Bayesian regularization vs dropout for basic ann

Does it make sense conceptually to apply dropout to an artificial neutral network while also applying bayesian regularization? On one hand I would think that technically this should work just fine, but on the other hand if bayesian regularization is…
Bram
  • 143
  • 2
4
votes
3 answers

What is the meaning of likelihood?

I am studying Bayes probability applied to machine learning, and I have encoutered the concept of likelihood, which I don't understand. I have seen that the Bayes rule is: $P(A|B)=\frac{P(B|A)P(A)}{P(B)}$ where $P(B|A)P(A)$ is the conditional…
J.D.
  • 941
  • 6
  • 20
  • 33
4
votes
2 answers

Bayesian optimisation in deeplearning

Has anyone tried using Bayesian optimisation to get best learning rates, and other hyperparameters for deeplearning. How to change the parameters between the training. Any examples on callbacks? Can I see some codes to implement them?
vizakshat
  • 465
  • 2
  • 5
  • 16
1
2 3
9 10