Use, pd.cut
带可选参数include_lowest=True
,将total_sale
中的值归入Quartiles
。
df['total_sale_Quartile'] = (
pd.cut(
df['total_sale'], bins=[50, 100, 250, 300, 600],
labels=['Q1', 'Q2', 'Q3', 'Q4'], include_lowest=True)
OR, use pd.qcut
如果你想仅仅根据量化范围对该列进行分类。
df['total_sale_Quartile'] = (
pd.qcut(df['total_sale'], 4, labels=['Q1', 'Q2', 'Q3', 'Q4'])
Result:
# print(df)
product_x year total_price total_sale total_sale_Quartile
0 A 2016 50 200 Q2
1 B 2016 200 100 Q1
2 A 2017 250 250 Q2
3 B 2017 1000 300 Q3