pyspark.pandas.Series.cat.remove_unused_categories#
- cat.remove_unused_categories()#
Remove categories which are not used.
- Returns
- catSeries
Categorical with unused categories dropped
See also
rename_categoriesRename categories.
reorder_categoriesReorder categories.
add_categoriesAdd new categories.
remove_categoriesRemove the specified categories.
set_categoriesSet the categories to the specified ones.
Examples
>>> s = ps.Series(pd.Categorical(list("abbccc"), categories=['a', 'b', 'c', 'd'])) >>> s 0 a 1 b 2 b 3 c 4 c 5 c dtype: category Categories (4, object): ['a', 'b', 'c', 'd']
>>> s.cat.remove_unused_categories() 0 a 1 b 2 b 3 c 4 c 5 c dtype: category Categories (3, object): ['a', 'b', 'c']