dtoolkit.geoaccessor.geodataframe.drop_duplicates_geometry#

dtoolkit.geoaccessor.geodataframe.drop_duplicates_geometry(df: GeoDataFrame, /, predicate: Literal['intersects', 'crosses', 'overlaps', 'touches', 'covered_by', 'contains_properly', 'contains', 'within', 'covers'] | None = None, keep: Literal['first', 'last', False] = 'first') → GeoDataFrame[source]#

Remove duplicate geometry rows.

Parameters:

predicate{‘intersects’, ‘crosses’, ‘overlaps’, ‘touches’, ‘covered_by’, ‘contains_properly’, ‘contains’, ‘within’, ‘covers’}, default None

The binary predicate is used to validate whether the geometries are duplicates or not. If None, the geometries will directly compares via value relation instead of the spatial relation.

Changed in version 0.0.19: The ‘predicate’ default value is changed from ‘intersects’ to None.

keep{‘first’, ‘last’, False}, default ‘first’

first : Mark duplicates as True except for the first occurrence.
last : Mark duplicates as True except for the last occurrence.
False : Mark all duplicates as True.

Returns:

GeoDataFrame

See also

geopandas.sjoin
dtoolkit.geoaccessor.geoseries.duplicated_geometry
dtoolkit.geoaccessor.geoseries.drop_duplicates_geometry
dtoolkit.geoaccessor.geodataframe.duplicated_geometry
dtoolkit.geoaccessor.geodataframe.drop_duplicates_geometry

Examples

>>> import dtoolkit.geoaccessor
>>> import geopandas as gpd
>>> from shapely import Polygon
>>> df = gpd.GeoDataFrame(
...     geometry=[
...         Polygon([(0,0), (1,0), (1,1), (0,1)]),
...         Polygon([(1,1), (2,1), (2,2), (1,2)]),
...         Polygon([(2,2), (3,2), (3,3), (2,3)]),
...         Polygon([(2, 0), (3, 0), (3, 1)]),
...     ],
... )
>>> df
                              geometry
0  POLYGON ((0 0, 1 0, 1 1, 0 1, 0 0))
1  POLYGON ((1 1, 2 1, 2 2, 1 2, 1 1))
2  POLYGON ((2 2, 3 2, 3 3, 2 3, 2 2))
3       POLYGON ((2 0, 3 0, 3 1, 2 0))

Work for GeoSeries.

>>> df.geometry.drop_duplicates_geometry('intersects')
0  POLYGON ((0 0, 1 0, 1 1, 0 1, 0 0))
3       POLYGON ((2 0, 3 0, 3 1, 2 0))
Name: geometry, dtype: geometry

Work for GeoDataFrame too.

>>> df.drop_duplicates_geometry('intersects')
                              geometry
0  POLYGON ((0 0, 1 0, 1 1, 0 1, 0 0))
3       POLYGON ((2 0, 3 0, 3 1, 2 0))