Use for questions about graphing a dataset into a visualisation (e.g. line plot, histogram or pie chart). Visualisation and plotting is an important EDA tool as well as presenting the results of a data science project.
Questions tagged [plotting]
156 questions
28
votes
6 answers
make seaborn heatmap bigger
I create a corr() df out of an original df. The corr() df came out 70 X 70 and it is impossible to visualize the heatmap... sns.heatmap(df). If I try to display the corr = df.corr(), the table doesn't fit the screen and I can see all the…
redeemefy
- 661
- 1
- 6
- 9
7
votes
1 answer
How to interpret Shapley value plot for a model?
I was trying to use Shapley value approach for understanding the model predictions. I am trying this on a Xgboost model. My plot looks like as below
Can someone help me interpret this? Or confirm my understanding is correct?
My interpretation
1)…
The Great
- 2,725
- 3
- 23
- 49
7
votes
3 answers
How to plot clusters in nice a way?
I have a large text dataset clusterized. Each cluster is represented by a centroid of the vectorized texts that belong to it, the number of texts, the created date, and other parameters. I can't plot the clusters in an n-dimensional space. Which…
Federico Caccia
- 760
- 1
- 6
- 18
6
votes
4 answers
How can I observe my CSV files better?
I'm running a lot of experiments that give their output as CSV files. An experiment might be running for hours, with a new line being added to the CSV every 10 seconds.
Right now I'm opening these CSV files in a text editor, which isn't too…
Ram Rachum
- 255
- 1
- 6
5
votes
2 answers
Data Visualization with multiple dimension, and linear separability
I have a dataset of two classes with several features, how can I visualise such data using Matlab to decide if it is linear separable or not?
Atheer
- 341
- 2
- 14
5
votes
3 answers
Scatter plot for binary class dataset with two features in python
I have my dataset that has multiple features and based on that the dependent variable is defined to be 0 or 1.
I want to get a scatter plot such that all my positive examples are marked with 'o' and negative ones with 'x'.
I am using python and here…
Nitish
- 91
- 1
- 1
- 7
4
votes
2 answers
Maximum Likelihood estimation
Given a sample $ X_1,X_2 \dots X_{100}$ and the density function $ f(x;\theta) = \frac{1}{\pi \cdot \left(1+\left(x-\theta \right)^2\right)}$ , find an approximate solution for $\hat{\theta}_{MLE.}$
My attempt:
I have found the joint likelihood…
Mahajna
- 53
- 4
4
votes
1 answer
How can I make seaborn do PairPlot such that it wraps around?
I'm trying to do some PairPlots using seaborn, so I can compare a bunch of features against the label I'm trying to model. There are a bunch of features though (~50). So I'm basically doing:
g =…
GrundleMoof
- 311
- 2
- 4
- 7
4
votes
2 answers
Plots with shaded standard deviation
What tools can I use to make a visualization similar to this one? I want to have the mean be bolded and the standard deviation be shaded.
Jeffrey
- 41
- 1
- 1
- 2
4
votes
1 answer
How to use plot model in keras?
I'm trying to draw the keras model with the plotmodel.
Setup:
I installed graphviz binaries with:
choco install graphviz
added path to the bin folder, and then I did:
pip install pydotplus
pip install graphviz
Code:
Getting this error when I try…
Srihari
- 797
- 4
- 12
- 27
4
votes
1 answer
Why do I need to call np.transpose() on this?
I have the following python script:
import numpy as np
import matplotlib.pyplot as plt
np.random.seed(123)
# initialize and populate all_walks
all_walks = []
for i in range(5) :
random_walk = [0]
for x in range(100) :
step =…
JouJour
- 91
- 4
3
votes
1 answer
Plotting multiple precision-recall curves in one plot
I have an imbalanced dataset and I was reading this article which looks into SMOTE and RUS to address the imbalance. So I have defined the following 3 models:
# AdaBoost
ada = AdaBoostClassifier(n_estimators=100, random_state=42)
…
sums22
- 447
- 5
- 15
3
votes
3 answers
What is this type of plot and how to interpret it?
I was presented to this plot and i've seen it before, however i would like to know what does the countour around the main line mean.
Ps: i couldn't find enough information about it to investigate by myself, so i had to create this question.
heresthebuzz
- 395
- 3
- 11
3
votes
1 answer
Plotting Gradient Descent in 3d - Contour Plots
I have generated 3 parameters along with the cost function. I have the $\theta$ lists and the cost list of 100 values from the 100 iterations. I would like to plot the last 2 parameters against cost in 3d to visualize the level sets on the contour…
m2rik
- 321
- 2
- 11
3
votes
1 answer
Visualizing the difference of a set of strings
I have a distance metric on a collection of strings on the order of tens of thousands. What would be an intuitive way to summarize how 'different' these strings are or when they overlap?
My goal is, to visually ensure high entropy and be able to…
Anisotropic
- 131
- 3