2

My question is similar to this post: What does the long curve-shape t-SNE mean?

but the problem is that my data is not time series, but I also get these long shape line in tSNE plot .

enter image description here

I don't know what that means or how I can get rid of that. I suspect some structure in the data causing that. But I could not find a useful resource explaining what structure in data could cause this.

My data is single cell data, each row is a cell, each column is a biology marker (e.g. CD3, CD4), so the data is the marker expression on each cell.

dalide
  • 21
  • 3

1 Answers1

1

Welcome to the community Dalide!

There is a sequential pattern in your data. As a non-Bioinformatics guy, I simply googled and saw that biomarker are also vastly used for learning the Progression of a disease (e.g. Alzheimer). I hope I am right about example and if I am, then it is not a big surprise that you see those curves which might indicate the progress of something. If you need more precise answer you better give some more info about the origin of data (or a link to your data if it is publicly available).

From ML point of view, about which I fortunately know a bit, those curves in tSNE are caused by sequential patterns.

Fun: As an equally surprising example, once I found sequential pattern in characteristics of different Wines!! (each row a Wine and each column a wine property like Alcohole%, density, etc.)

Kasra Manshaei
  • 6,752
  • 1
  • 23
  • 46