Skip to content

Literature Review

MOAAS edited this page Apr 7, 2022 · 12 revisions

Literature Review

This page contains a list with all screened articles, as well as links to their wiki pages. Article evaluation is heavily focused on the Research Questions, Inclusion/Exclusion Criteria, as well as Quality Score Questions.

Objective

Examine state of the art of existing features to automatically measure Wikipedia article quality, as well as the usage of machine learning algorithms for the same purpose.

Research Questions

Id Question
R1 What are the most common features or metrics used to evaluate article quality in Wikipedia?
R2 To what extent are those features currently used, and how effective are they?
R3 Which machine learning approaches can be applied to predict article quality?

Inclusion Criteria

Id Criteria
I1 Paper discusses possible features or metrics to assess data quality in Wikipedia.
I2 Paper discusses its findings and evaluates the features' effectiveness.
I3 Paper discusses machine learning approaches to predict article quality.

Exclusion Criteria

Id Criteria
E1 Paper discusses vandalism and quality of user edits instead of the articles themselves.
E2 Paper discusses manual approaches to assess article quality, as opposed to automatic ones
E3 Paper is in a language other than English

Relevance Score Questions

Id Question Score Range
Q1 Does the paper describe and compare possible features or metrics with detail? [0-3]
Q2 Does the paper experiment and compare different ML approaches? [0-3]
Q3 Are the results, benefits and limitations well described? [0-3]
Q4 Does the paper bring focus to an article language that’s not English? [0-1]

Abstract Scanning (Phase 3)

This section lists the publications that were assessed in Phase 3 of the Literature Review process (Abstract scanning). For a more complete explanation follow this link.

Id Year Result Title
1 2017 An end-to-end learning solution for assessing the quality of Wikipedia articles.
2 2009 Automatic quality assessment of content created collaboratively by web communities: A case study of wikipedia
3 2009 Seeking Health Information Online: Does Wikipedia Matter?
4 2010 WikipediaViz: Conveying article quality for casual wikipedia readers
5 2014 Assessing the quality of Thai Wikipedia articles using concept and statistical features
6 2017 A psycho-lexical approach to the assessment of information quality on wikipedia
7 2015 Measuring article quality in Wikipedia using the collaboration network
8 2015 Accuracy and readability of cardiovascular entries on Wikipedia: Are they reliable learning resources for medical students?
9 2017 Measuring quality of collaboratively edited documents: The case of Wikipedia
10 2017 Relative quality and popularity evaluation of multilingual wikipedia articles
11 2018 Readability and quality of wikipedia pages on neurosurgical topics
12 2019 Automatically assessing the quality of Wikipedia contents
13 2019 A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
14 2019 Quality assessment of peer-produced content in knowledge repositories using big data and social networks: The case of implicit collaboration in wikipedia
15 2020 Assessing the quality of information on wikipedia: A deep-learning approach
16 2021 Assessing the quality of health-related Wikipedia articles with generic and specific metrics
17 2021 Measuring Wikipedia Article Quality in One Dimension by Extending ORES with Ordinal Regression
18 2016 Automating assessment of collaborative writing quality in multiple stages: The case of wiki
19 2016 Quality assessment of Wikipedia articles without feature engineering
20 2008 A "quick and dirty" website data quality indicator
21 2012 Measuring the quality of web content using factual information
22 2016 Web content classification using distributions of subjective quality evaluations
23 2012 On measuring the lexical quality of the web
24 2019 Interactive quality analytics of user-generated content: An integrated toolkit for the case of Wikipedia
25 2014 Measuring the quality of edits to Wikipedia
26 2007 Measuring article quality in wikipedia: Models and evaluation
27 2016 Topic quality metrics based on distributed word representations
28 2020 Proposal and Comparison of Health Specific Features for the Automatic Assessment of Readability
29 2011 Exploring wiki: Measuring the quality of social media using ant colony metaphor
30 2009 So you know you're getting the best possible information: A tool that increases wikipedia credibility
31 2010 Do you know your IQ? A research agenda for information quality in systems
32 2014 Reliability of user-generated data: The case of biographical data in Wikipedia
33 2008 Size matters: Word count as a measure of quality on Wikipedia
34 2005 Measuring Wikipedia
35 2010 Statistical measure of quality in Wikipedia
36 2007 Cooperation and quality in Wikipedia
37 2011 Who does what: Collaboration patterns in the Wikipedia and their impact on article quality
38 2009 A jury of your peers: quality, experience and ownership in Wikipedia
39 2007 Does it matter who contributes - A study on featured articles in the german wikipedia
40 2013 Tell me more: An actionable quality model for wikipedia
41 2019 Assessing the quality of Wikipedia articles with lifecycle based metrics
42 2008 Measuring author contributions to the Wikipedia
43 2008 On ranking controversies in wikipedia: Models and evaluation
44 2011 Don't bite the newbies: How reverts affect the quantity and quality of Wikipedia work
45 2010 Identifying featured articles in Wikipedia: Writing style matters
46 2010 Determinants of wikipedia quality: The roles of global and local contribution inequality
47 2011 Information quality assessment of community generated content: A user study of Wikipedia
48 2010 Learning to predict the quality of contributions to wikipedia
49 2010 Trust in wikipedia: How users trust information from an unknown source
50 2008 Information quality work organization in Wikipedia
51 2008 Assigning trust to Wikipedia content
52 2010 On measuring the quality of wikipedia articles
53 2010 Detecting wikipedia vandalism using wikitrust
54 2006 Measuring qualities of articles contributed by online communities
55 2013 Automatically classifying edit categories in Wikipedia revisions
56 2008 Can you ever trust a Wiki? Impacting perceived trustworthiness in Wikipedia
57 2017 Estimating the quality of articles in Russian wikipedia using the logical-linguistic model of fact extraction
58 2016 Disinformation on the web: Impact, characteristics, and detection of wikipedia hoaxes
59 2010 Detecting wikipedia vandalism with active learning and statistical language models
60 2016 Content and collaboration: An affiliation network approach to information quality in online peer production communities
61 2016 Pagerank on wikipedia: Towards general importance scores for entities
62 2021 Quality of information sources about mental disorders: a comparison of Wikipedia with centrally controlled web and printed sources
63 2011 On the measurability of information quality
64 2005 Information quality in a community-based encyclopedia
65 2007 Computational trust in Web content quality: a comparative evalutation on the Wikipedia project
66 2014 Quality of patient health information on the Internet: reviewing a complex and evolving landscape
67 2009 Reputation and reliability in collective goods: The case of the online encyclopedia Wikipedia
68 2012 Identifying controversial articles in Wikipedia: A comparative study
69 2007 A framework for information quality assessment
70 2009 What's on wikipedia, and what's not... ?: Assessing completeness of information

Assessed Publications (Phase 4)

This section lists the publications that were assessed in Phase 4 of the Literature Review process (Full text assessment). For a more complete explanation follow this link.

Id Year Result Title
1 2017 An end-to-end learning solution for assessing the quality of Wikipedia articles.
2 2009 Automatic quality assessment of content created collaboratively by web communities: A case study of wikipedia
4 2010 WikipediaViz: Conveying article quality for casual wikipedia readers
5 2014 Assessing the quality of Thai Wikipedia articles using concept and statistical features
6 2017 A psycho-lexical approach to the assessment of information quality on wikipedia
7 2015 Measuring article quality in Wikipedia using the collaboration network
9 2017 Measuring quality of collaboratively edited documents: The case of Wikipedia
12 2019 Automatically assessing the quality of Wikipedia contents
13 2019 A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
14 2019 Quality assessment of peer-produced content in knowledge repositories using big data and social networks: The case of implicit collaboration in wikipedia
15 2020 Assessing the quality of information on wikipedia: A deep-learning approach
16 2021 Assessing the quality of health-related Wikipedia articles with generic and specific metrics
17 2021 Measuring Wikipedia Article Quality in One Dimension by Extending ORES with Ordinal Regression
18 2016 Automating assessment of collaborative writing quality in multiple stages: The case of wiki
19 2016 Quality assessment of Wikipedia articles without feature engineering
21 2012 Measuring the quality of web content using factual information
25 2014 Measuring the quality of edits to Wikipedia
26 2007 Measuring article quality in wikipedia: Models and evaluation
28 2020 Proposal and Comparison of Health Specific Features for the Automatic Assessment of Readability
29 2011 Exploring wiki: Measuring the quality of social media using ant colony metaphor
33 2008 Size matters: Word count as a measure of quality on Wikipedia
36 2007 Cooperation and quality in Wikipedia
38 2009 A jury of your peers: quality, experience and ownership in Wikipedia
39 2007 Does it matter who contributes - A study on featured articles in the german wikipedia
41 2019 Assessing the quality of Wikipedia articles with lifecycle based metrics
42 2008 Measuring author contributions to the Wikipedia
45 2010 Identifying featured articles in Wikipedia: Writing style matters
48 2010 Learning to predict the quality of contributions to wikipedia
49 2010 Trust in wikipedia: How users trust information from an unknown source
51 2008 Assigning trust to Wikipedia content
52 2010 On measuring the quality of wikipedia articles
53 2010 Detecting wikipedia vandalism using wikitrust
56 2008 Can you ever trust a Wiki? Impacting perceived trustworthiness in Wikipedia
57 2017 Estimating the quality of articles in Russian wikipedia using the logical-linguistic model of fact extraction
63 2011 On the measurability of information quality
64 2005 Information quality in a community-based encyclopedia
65 2007 Computational trust in Web content quality: a comparative evalutation on the Wikipedia project
68 2012 Identifying controversial articles in Wikipedia: A comparative study
69 2007 A framework for information quality assessment

Relevance Scores (Phase 5)

This section lists the relevance scores for all of the approved papers. For a more complete explanation follow this link.

Q1 - Discussion of Features [0-3] Q2 - Discussion of ML approaches [0-3] Q3 - Discussion of Results [0-3] Q4 - Article Language other than English [0-1]

Id Year Q1 Q2 Q3 Q4 Total (0 - 10) Title
1 2017 1 2 2 1 6 An end-to-end learning solution for assessing the quality of Wikipedia articles.
2 2009 3 1 3 1 8 Automatic quality assessment of content created collaboratively by web communities: A case study of wikipedia
5 2014 2 2 2 1 7 Assessing the quality of Thai Wikipedia articles using concept and statistical features
6 2017 2 1 3 0 6 A psycho-lexical approach to the assessment of information quality on wikipedia
9 2017 3 3 3 0 9 Measuring quality of collaboratively edited documents: The case of Wikipedia
12 2019 3 3 3 0 9 Automatically assessing the quality of Wikipedia contents
13 2019 3 2 3 0 8 A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
14 2019 3 2 2 0 7 Quality assessment of peer-produced content in knowledge repositories using big data and social networks: The case of implicit collaboration in wikipedia
16 2021 3 0 2 0 5 Assessing the quality of health-related Wikipedia articles with generic and specific metrics
19 2016 1 1 2 0 4 Quality assessment of Wikipedia articles without feature engineering
33 2008 1 2 2 0 5 Size matters: Word count as a measure of quality on Wikipedia
45 2010 1 1 2 0 4 Identifying featured articles in Wikipedia: Writing style matters
52 2010 2 1 2 0 5 On measuring the quality of wikipedia articles
64 2005 3 1 2 0 6 Information quality in a community-based encyclopedia

PRISMA

Quality Features

Textinho com link para o outro ficheiro pelo menos

ML algorithms

Textinho com link para o outro ficheiro pelo menos