CREW: Clusters of RElated Words to explain Entity Matching

CREW is a cluster-based explainer for Entity Matching, that generates faithful and easily interpretable explanations to matching decisions. Given a pair of entity descriptions, classified as "matching" or "not matching" by any model, CREW:

Clusters the words according to the relatedness heuristics detailed in the paper;
Measures the importance of each cluster on the model behavior with a local, post-hoc explainer.

An explanation consists of each word cluster pairs with its importance coefficient.

For a detailed description of the work please read our paper. Please cite the paper if you use the code from this repository in your work.

@article{}

Usage

crew.py [-h] [--semantic {True,False}] [--importance {True,False}] [--schema {True,False}] 
        [--wordpieced {True,False}] [--max_seq_length MAX] 
        [--lime_n_word_samples WS] [--lime_n_group_samples GS] 
        [--lime_n_word_features WF] 
        [--gpu {True,False}] [-seed SEED]
        data_dir model_dir output_dir

Positional arguments:

data_dir : The directory with the input data.
model_dir : The directory with the model.
expls_dir : The directory in which the explanations and other additional outputs will be saved.

Optional arguments:

--semantic : Include semantic relatedness. By default is True.
--importance : Include importance relatedness. By default is True.
--schema : Include schema relatedness. By default is True.
--wordpieced : if True, tokenize the output. If False, preserve original words. By default is False.
--max_seq_length : The maximum total length of the tokenized input sequence. Sequences longer than this will be truncated, and sequences shorter will be padded. By default is False.
--lime_n_word_samples : number of perturbed samples to learn a word-level explanation [LIME]. By default is 1000.
--lime_n_group_samples : number of perturbed samples to learn a group-level explanation [LIME]. By default is 200.
--lime_n_word_features : maximum number of features present in explanation [LIME]. By default is 70.
--gpu : if True, run on GPU. If False, run on CPU. By default is True.
-seed, --seed

Degradation score

To evaluate the degradation score on a particular experiment, run:

degra.py [-h] [--degra_step STEP] [--gpu {True,False}] exp_path model_path

Positional arguments:

expls_dir : The directory with the explanation and additional outputs.
model_dir : The directory with the model.

Optional arguments:

--degra_step : The percentage step between two degradation levels. 100% divided by this step should be an integer number, that is the number of degradation levels. By default is 0.10 (10%).
--gpu : if True, run on GPU. If False, run on CPU. By default is True.

This command creates a binary file, degrad_score, in the explanation path. This file contains a dictionary with the degradation score alongside the information to plot the LERF and MORF curves. To do so, run the following cell in a notebook:

with open(f'{exp_path}/degrad_score', 'rb') as f:
    degradict = pickle.load(f)
steps   = degradict['degrad_steps']
lerf_f1 = degradict['lerf_f1']
morf_f1 = degradict['morf_f1']
steps   = [0.] + steps
lerf_f1 = [1.] + lerf_f1
morf_f1 = [1.] + morf_f1
plt.plot(steps, lerf_f1, label='lerf', marker='o')
plt.plot(steps, morf_f1, label='morf', marker='o')
plt.show()

Render explanations

To render an explanation in a .html file, run:

render.py [-h] expls_dir data_inx render_dir

Positional arguments:

expls_dir : The directory with the explanations and additional outputs.
data_inx : The indexes of the pairs of entity descriptions for which to render the explanation. If -1, render all the explanations.
render_dir : The directory where to write the .html that renders the explanation.

Inside render_dir, the command creates two folders, wexpls and gexpls, containing the word-level and group-level renderer explanations respectively. Each rendered explanation is a .html file, such as expl11.html, with 11 being the index of the EM record in the original dataset.

Word-level explanation	Group-level explanation

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
phevals		phevals
pyccalg		pyccalg
render		render
README.md		README.md
adapters.py		adapters.py
crew.py		crew.py
crew_degra.py		crew_degra.py
crew_impl.py		crew_impl.py
crew_impl_shap.py		crew_impl_shap.py
data_parser.py		data_parser.py
degra.py		degra.py
embeddings_cache.py		embeddings_cache.py
expldf.py		expldf.py
groups.py		groups.py
models.py		models.py
my_corrclust.py		my_corrclust.py
prefix_words.py		prefix_words.py
ptb_classifiers.py		ptb_classifiers.py
requirements.txt		requirements.txt
technicalreport_CREW.pdf		technicalreport_CREW.pdf
utils.py		utils.py
wordcorr.py		wordcorr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CREW: Clusters of RElated Words to explain Entity Matching

Usage

Degradation score

Render explanations

About

Releases

Packages

Contributors 2

Languages

softlab-unimore/crew

Folders and files

Latest commit

History

Repository files navigation

CREW: Clusters of RElated Words to explain Entity Matching

Usage

Degradation score

Render explanations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages