How to find outliers for categorical variables in python?
Answers
Answered by
2
Answer:
Using IQR
Arrange the data in increasing order.
Calculate first(q1) and third quartile(q3)
Find interquartile range (q3-q1)
Find lower bound q1*1.5.
Find upper bound q3*1.5.
Anything that lies outside of lower and upper bound is an outlier.
With categorical data you have to explain why choosing an apple is considered an anomaly (that data point does not behave as the rest 99.9% of the population).
hope it helps ☺
Similar questions