

grouby(by=None, axis=0, level=None, as_index=True, sort=True, group_keys=True, squeeze=False, **kwargs)
import pandas as pd
df = pd.DataFrame(data={'books':['bk1','bk1','bk1','bk2','bk2','bk3'], 'price': [12,12,12,15,15,17]})
print df
print df.groupby('books', as_index=True).sum()
print df.groupby('books', as_index=False).sum()


  books  price
0   bk1     12
1   bk1     12
2   bk1     12
3   bk2     15
4   bk2     15
5   bk3     17
bk1       36
bk2       30
bk3       17
  books  price
0   bk1     36
1   bk2     30
2   bk3     17

When as_index=True the key(s) you use in groupby will become an index in the new dataframe.

The benefit of as_index=True is that you can yank out the rows you want by using key names. For eg. if you want 'bk1' you can get it like this: df.loc['bk1'] as opposed to when as_index=False then you will have to get it like this: df.loc[df.books=='bk1']

Including the other main benefit of using as_index=True raised by @ayhan in comments: df.loc['bk1'] would be faster because it doesn't have to traverse the entire books column to find 'bk1' when it's indexed. It will just calculate the hash value of 'bk1' and find it in 1 go.

官网是这样解释的:https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.groupby.html as_index:bool, default True For aggregated output, return object with group labels as the index. Only ...
函数 pandas .DataFrame. group by 参数 as_ index 的意义 含义:as_ index 决定了分组使用的属性是否成为新的表格的索引,默认是as_ index = True ,我的代码 常用:as_ index = False . 使用作为索引只是会影响查询速度,而一般没有这样的需求。 as_ index = True 是常用的表格形式,而as_ index = False 除了表格有变化,显示也会不同。 文档 ...
df = pd.DataFrame(data = {'book':['bk1','bk1','bk2','bk2','bk3'], 'price':['12','12','5','5','45']}) print(df) print(df. group by('book',as_ index = True ).sum()) print(df. group by('book',as_ index = False ).sum()) output: books
我们经常需要对某些标签或索引的局部进行累计分析,这时候需要用到 group by函数了。 其 group by函数的as_ index 参数 有以下介绍: as_ index : boolean, default True For aggregated output, return object with group labels as the index . Only relevant for DataFram...
