dtoolkit.accessor.series.drop_not_duplicates#

dtoolkit.accessor.series.drop_not_duplicates(s: Series, /, keep: Literal['first', 'last', False] = False) Series[source]#

Return duplicate Series values.

A sugary syntax wraps duplicated():

s[s.duplicated(keep=keep)]
Parameters:
keep{‘first’, ‘last’, False}, default False

Method to handle duplicates:

  • ‘first’ : Keep duplicates except for the first occurrence.

  • ‘last’ : Keep duplicates except for the last occurrence.

  • False : Keep all duplicates.

Returns:
Series

Kept duplicate values.

Examples

>>> import dtoolkit
>>> import pandas as pd
>>> animals = pd.Series(['llama', 'cow', 'llama', 'beetle', 'llama'])
>>> animals
0      llama
1       cow
2      llama
3    beetle
4      llama
dtype: object
>>> animals.drop_not_duplicates()
0      llama
2      llama
4      llama
dtype: object