FactCC Scores - cant replicate scores #4

Lukecn1 · 2021-07-28T15:57:05Z

Hi Again :)

I was checking my own implementation of the factCC scoring you described in the paper against your data, and noticed that for 90 cases we derived different scores

I suspect this is due to difference in how we split summaries into their individual sentences prior to classification and scoring.

How did you split summary sentences for factCC scoring?

(I use nltk sent_tokenize function)

artidoro · 2021-07-28T16:03:49Z

Hello,
I think you are right and this is probably due to differences in how we split sentences. I used spacy's sentence segmentation https://spacy.io/usage/linguistic-features#sbd

Let me know if the results are significantly different and we could investigate further otherwise we can close the issue.

Lukecn1 · 2021-07-28T16:05:13Z

Hello,
I think you are right and this is probably due to differences in how we split sentences. I used spacy's sentence segmentation https://spacy.io/usage/linguistic-features#sbd

Let me know if the results are significantly different and we could investigate further otherwise we can close the issue.

The differences I have found is only for 90 cases so its not a massive difference already. But thanks for the reply, ill test it using spacy and get back to you :)

Lukecn1 · 2021-07-30T12:06:25Z

Hello,
I think you are right and this is probably due to differences in how we split sentences. I used spacy's sentence segmentation https://spacy.io/usage/linguistic-features#sbd

Let me know if the results are significantly different and we could investigate further otherwise we can close the issue.

Using spacy i have 30 more differences in scores than using nltk. Do yo do any other preprocessing of data before scoring?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FactCC Scores - cant replicate scores #4

FactCC Scores - cant replicate scores #4

Lukecn1 commented Jul 28, 2021

artidoro commented Jul 28, 2021

Lukecn1 commented Jul 28, 2021

Lukecn1 commented Jul 30, 2021

FactCC Scores - cant replicate scores #4

FactCC Scores - cant replicate scores #4

Comments

Lukecn1 commented Jul 28, 2021

artidoro commented Jul 28, 2021

Lukecn1 commented Jul 28, 2021

Lukecn1 commented Jul 30, 2021