I have a column in a dask data frame that contains comma separated lists of different categories. I'm looking to replicate the functionality of sklearn's MultiLabelBinarizer or the pandas function pd.get_dummies(',') exactly as this thread describes: Create dummies from column with multiple values in dask
Is there absolutely no way to do this as the one answer there states? Is there a way to implement this if I got a list of all of the values?