API
- class oagdedupe.api.BaseModel(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>)[source]
Abstract base class from which all model classes inherit. All descendent classes must implement predict, train, and candidates methods.
- __init__(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>) None
- _abc_impl = <_abc_data object>
- cluster
alias of
ConnectedComponents
- predict() Union[DataFrame, Tuple[DataFrame]][source]
fast-api trains model on latest labels then submits scores to postgres
clusterer loads scores and uses comparison indices and predicted probabilities to generate clusters
- Returns
df (pd.DataFrame) – if dedupe, returns single df
df,df2 (tuple) – if recordlinkage, two dataframes
- class oagdedupe.api.Dedupe(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>)[source]
General dedupe block, inherits from BaseModel.
- __init__(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>) None
- _abc_impl = <_abc_data object>
- class oagdedupe.api.Fapi(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>)[source]
General dedupe block, inherits from BaseModel.
- __init__(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>) None
- _abc_impl = <_abc_data object>
- class oagdedupe.api.RecordLinkage(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>)[source]
General dedupe block, inherits from BaseModel.
- __init__(settings: ~oagdedupe.settings.Settings, cluster: ~oagdedupe.base.BaseCluster = <class 'oagdedupe.cluster.cluster.ConnectedComponents'>) None
- _abc_impl = <_abc_data object>