dependencies Package

Functions

plot_dependency_metadata

Plot a graph of functional dependencies based on metadata contained in the provided dataframe.

The input "metadata" dataframe should contain one row per functional dependency. Each row names the "determinant" column(s) and the "dependent" column of the actual "data" dataframe, where the determinant/dependent relationship is exhibited by 1:m mapping of the value pairs.

If the "conditional_entropy" column is present, it will determine the line style: solid line for zero entropy that signifies exact mapping, and dashed line for non-zero entropy that signifies approximate mapping. The values for the "conditional_entropy" are generated by find_dependencies.

plot_dependency_metadata(metadata_df: DataFrame) -> graphviz.Digraph

Parameters

Name Description
metadata_df
Required

Pandas DataFrame listing functional dependencies. At a minimum, it must contain columns "determinant" and "dependent".

Returns

Type Description

Graphviz object.