depictr.missingness_map#

depictr.missingness_map(data, sort=True, legend_inside=False, title=None)#

Tile map of missing values, one column per variable and one row per record.

Variables are ordered most- to least-missing, so the worst offenders sit on the left. The percentage missing is shown in each axis label.

Parameters:
  • data (pandas.DataFrame)

  • sort (bool) – Order columns by their proportion of missing values.

  • legend_inside (bool) – When True (and sort is on), place the legend in the top-right. Because the columns are sorted, the right-hand ones are the most complete, so a legend there sits over a solid “Present” block and hides no missing marks.

  • title (str, optional)

Return type:

plotnine.ggplot