Questions tagged [statistical-significance]

13 questions
21
votes
2 answers

What is the correct meaning and interpretation of p-values?

I’m posting this question, and an answer, to help dispel a few misunderstandings about what p-values are. As a hiring manager interviewing mid-level and senior data scientists, I have noticed these misunderstandings many times. I have also noticed…
Robert Long
  • 3,518
  • 12
  • 30
3
votes
1 answer

Best Tests for Correlation Between Categorical and Numeric Variables (Non-Normal Data)

I’m still learning data science and trying to improve my understanding of statistical tests. Right now, I’m working with a dataset where I have a categorical feature (e.g., “School Type” with values like Public, Private…) and a numeric target (e.g.,…
2
votes
0 answers

Multiclassification McNemar

I have a multiclassification problem and I want ro compare two classifiers using McNemar's test to find if there is any statistical significance. Do I have to do it for every class (one vs all approach) or is there another way? Thanks in advance,…
1
vote
0 answers

Match between Regression prediction and human-produced guesses for upper and lower threshold

I have some database, containing numerical data about products. I use different models for predicting the value of a feature, e.g. batttery capacity of a laptop, given other features, such as size, CPU core count, etc. The models, among those also…
1
vote
0 answers

How to calcualte LR_statistic for the below data

I am trying to calculate LR_statistic for two models where my log-likelihood ratio values as follows. model1.llf = -2326.9 and model2.llf = -4649.1. When read online some of the articles showing The LR statistic is calculated by taking the…
1
vote
0 answers

Find out Statistical test for binary data

we are ran an email campaign for users which is called my treatment and there is control group as well. We want to measure the performance using a Metric called A. If he belongs to metric A then we call the user as 1 else 0. My data for metric A:…
1
vote
1 answer

Testing Marketing vehicle effectiveness

I have a set of customers who have been targetted using a marketing vehicle. Now i want to estimate if the marketing vehicle resulted in a better conversion (did they buy after that or not). Currently, im taking a simple approach, where I check the…
0
votes
0 answers

What statistical sets should be used to compare 2 dichotomous variables?

You have an ML model that outputs 100,000 out of a million observations of a dichotomous variable 0 or 1. You want to see whether the distribution of the outputted 100,000 observations of the model is similar to the 1 mil. You apply the model 3…
0
votes
0 answers

What model to use for binary independent variable and count dependent variable

I have two datasets in long formats. Dataset #1: The columns are weeks (202301, 202302, ..., 202352). The indices are item ID's. The values are sales units (count). Dataset #2: The columns are weeks (202301, 202302, ..., 202352). The indices are…
0
votes
0 answers

statistical test of cell type proportions in clusters

I have a compostion dataframe which contains spotwise compositons of cell types and their cluster informations. I am required to compare the mean abundance of cell types in each cluster with other clusters. Which test should I use and how should I…
0
votes
1 answer

Selecting optimal regression model using cross validation

I have a logistic mixed model (lme4 package in R). I want to assess whether participants scores on the measures 'sumspq', 'sumpdi', and 'sumcaps' significantly affect the difference in performance between 2 conditions. I first run the…
0
votes
1 answer

How do I differentiate validity-coefficient from alpha-coefficient for a significance test?

Two terms are in vogue for a longtime.Alpha_coefficient is used for denoting significance. And validity coefficient denotes measurement error by two or more raters or evaluators of an object or human-being. What are the distinctions between them? …