Entity Matching by Similarity Join
 
Loading...
Searching...
No Matches
simjoin_entitymatching.feature.feature_base.NewFeatureExtractor Class Reference

Public Member Functions

 __init__ (self)
 

Static Public Member Functions

 extract_feature_vecs (candset, attrs_before=None, feature_table=None, attrs_after=None, group=None, cluster=None, verbose=False, n_jobs=1)
 

Static Protected Member Functions

 _get_num_procs (n_jobs, min_procs)
 
 _apply_feat_fns (tuple1, tuple2, feat_dict, group, cluster)
 
 _get_feature_vals_by_cand_split (pickled_obj, pickled_grp, pickled_clt, fk_ltable_idx, fk_rtable_idx, l_df, r_df, candsplit)
 
 _extract_from (candset, feature_table, group, cluster, n_jobs, verbose=False)
 

Detailed Description

re-write a feature extractor from the py_entitymatching module
support interchangeable values for calculating values

Constructor & Destructor Documentation

◆ __init__()

simjoin_entitymatching.feature.feature_base.NewFeatureExtractor.__init__ ( self)

Member Function Documentation

◆ _apply_feat_fns()

simjoin_entitymatching.feature.feature_base.NewFeatureExtractor._apply_feat_fns ( tuple1,
tuple2,
feat_dict,
group,
cluster )
staticprotected
Apply feature functions to two tuples, consider interchangeable values

◆ _extract_from()

simjoin_entitymatching.feature.feature_base.NewFeatureExtractor._extract_from ( candset,
feature_table,
group,
cluster,
n_jobs,
verbose = False )
staticprotected

◆ _get_feature_vals_by_cand_split()

simjoin_entitymatching.feature.feature_base.NewFeatureExtractor._get_feature_vals_by_cand_split ( pickled_obj,
pickled_grp,
pickled_clt,
fk_ltable_idx,
fk_rtable_idx,
l_df,
r_df,
candsplit )
staticprotected

◆ _get_num_procs()

simjoin_entitymatching.feature.feature_base.NewFeatureExtractor._get_num_procs ( n_jobs,
min_procs )
staticprotected

◆ extract_feature_vecs()

simjoin_entitymatching.feature.feature_base.NewFeatureExtractor.extract_feature_vecs ( candset,
attrs_before = None,
feature_table = None,
attrs_after = None,
group = None,
cluster = None,
verbose = False,
n_jobs = 1 )
static

The documentation for this class was generated from the following file: