Literature Review

This page contains a list with all screened articles, as well as links to their wiki pages. Article evaluation is heavily focused on the Research Questions, Inclusion/Exclusion Criteria, as well as Quality Score Questions.

Objective

Examine state of the art of existing features to automatically measure Wikipedia article quality, as well as the usage of machine learning algorithms for the same purpose.

Research Questions

Id	Question
R1	What are the most common features or metrics used to evaluate article quality in Wikipedia?
R2	To what extent are those features currently used, and how effective are they?
R3	Which machine learning approaches can be applied to predict article quality?

Inclusion Criteria

Id	Criteria
I1	Paper discusses possible features or metrics to assess data quality in Wikipedia.
I2	Paper discusses its findings and evaluates the features' effectiveness.
I3	Paper discusses machine learning approaches to predict article quality.

Exclusion Criteria

Id	Criteria
E1	Paper discusses vandalism and quality of user edits instead of the articles themselves.
E2	Paper discusses manual approaches to assess article quality, as opposed to automatic ones
E3	Paper is in a language other than English

Relevance Score Questions

Id	Question	Score Range
Q1	Does the paper describe and compare possible features or metrics with detail?	[0-3]
Q2	Does the paper experiment and compare different ML approaches?	[0-3]
Q3	Are the results, benefits and limitations well described?	[0-3]
Q4	Does the paper bring focus to an article language that’s not English?	[0-1]

Abstract Scanning (Phase 3)

This section lists the publications that were assessed in Phase 3 of the Literature Review process (Abstract scanning). For a more complete explanation follow this link.

Id	Year	Result	Title
1	2017	✅	An end-to-end learning solution for assessing the quality of Wikipedia articles.
2	2009	✅	Automatic quality assessment of content created collaboratively by web communities: A case study of wikipedia
3	2009	❌	Seeking Health Information Online: Does Wikipedia Matter?
4	2010	✅	WikipediaViz: Conveying article quality for casual wikipedia readers
5	2014	✅	Assessing the quality of Thai Wikipedia articles using concept and statistical features
6	2017	✅	A psycho-lexical approach to the assessment of information quality on wikipedia
7	2015	✅	Measuring article quality in Wikipedia using the collaboration network
8	2015	❌	Accuracy and readability of cardiovascular entries on Wikipedia: Are they reliable learning resources for medical students?
9	2017	✅	Measuring quality of collaboratively edited documents: The case of Wikipedia
10	2017	❌	Relative quality and popularity evaluation of multilingual wikipedia articles
11	2018	❌	Readability and quality of wikipedia pages on neurosurgical topics
12	2019	✅	Automatically assessing the quality of Wikipedia contents
13	2019	✅	A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
14	2019	✅	Quality assessment of peer-produced content in knowledge repositories using big data and social networks: The case of implicit collaboration in wikipedia
15	2020	✅	Assessing the quality of information on wikipedia: A deep-learning approach
16	2021	✅	Assessing the quality of health-related Wikipedia articles with generic and specific metrics
17	2021	✅	Measuring Wikipedia Article Quality in One Dimension by Extending ORES with Ordinal Regression
18	2016	✅	Automating assessment of collaborative writing quality in multiple stages: The case of wiki
19	2016	✅	Quality assessment of Wikipedia articles without feature engineering
20	2008	❌	A "quick and dirty" website data quality indicator
21	2012	✅	Measuring the quality of web content using factual information
22	2016	❌	Web content classification using distributions of subjective quality evaluations
23	2012	❌	On measuring the lexical quality of the web
24	2019	❌	Interactive quality analytics of user-generated content: An integrated toolkit for the case of Wikipedia
25	2014	✅	Measuring the quality of edits to Wikipedia
26	2007	✅	Measuring article quality in wikipedia: Models and evaluation
27	2016	❌	Topic quality metrics based on distributed word representations
28	2020	✅	Proposal and Comparison of Health Specific Features for the Automatic Assessment of Readability
29	2011	✅	Exploring wiki: Measuring the quality of social media using ant colony metaphor
30	2009	❌	So you know you're getting the best possible information: A tool that increases wikipedia credibility
31	2010	❌	Do you know your IQ? A research agenda for information quality in systems
32	2014	❌	Reliability of user-generated data: The case of biographical data in Wikipedia
33	2008	✅	Size matters: Word count as a measure of quality on Wikipedia
34	2005	❌	Measuring Wikipedia
35	2010	❌	Statistical measure of quality in Wikipedia
36	2007	✅	Cooperation and quality in Wikipedia
37	2011	❌	Who does what: Collaboration patterns in the Wikipedia and their impact on article quality
38	2009	✅	A jury of your peers: quality, experience and ownership in Wikipedia
39	2007	✅	Does it matter who contributes - A study on featured articles in the german wikipedia
40	2013	❌	Tell me more: An actionable quality model for wikipedia
41	2019	✅	Assessing the quality of Wikipedia articles with lifecycle based metrics
42	2008	✅	Measuring author contributions to the Wikipedia
43	2008	❌	On ranking controversies in wikipedia: Models and evaluation
44	2011	❌	Don't bite the newbies: How reverts affect the quantity and quality of Wikipedia work
45	2010	✅	Identifying featured articles in Wikipedia: Writing style matters
46	2010	❌	Determinants of wikipedia quality: The roles of global and local contribution inequality
47	2011	❌	Information quality assessment of community generated content: A user study of Wikipedia
48	2010	✅	Learning to predict the quality of contributions to wikipedia
49	2010	✅	Trust in wikipedia: How users trust information from an unknown source
50	2008	❌	Information quality work organization in Wikipedia
51	2008	✅	Assigning trust to Wikipedia content
52	2010	✅	On measuring the quality of wikipedia articles
53	2010	✅	Detecting wikipedia vandalism using wikitrust
54	2006	❌	Measuring qualities of articles contributed by online communities
55	2013	❌	Automatically classifying edit categories in Wikipedia revisions
56	2008	✅	Can you ever trust a Wiki? Impacting perceived trustworthiness in Wikipedia
57	2017	✅	Estimating the quality of articles in Russian wikipedia using the logical-linguistic model of fact extraction
58	2016	❌	Disinformation on the web: Impact, characteristics, and detection of wikipedia hoaxes
59	2010	❌	Detecting wikipedia vandalism with active learning and statistical language models
60	2016	❌	Content and collaboration: An affiliation network approach to information quality in online peer production communities
61	2016	❌	Pagerank on wikipedia: Towards general importance scores for entities
62	2021	❌	Quality of information sources about mental disorders: a comparison of Wikipedia with centrally controlled web and printed sources
63	2011	✅	On the measurability of information quality
64	2005	✅	Information quality in a community-based encyclopedia
65	2007	✅	Computational trust in Web content quality: a comparative evalutation on the Wikipedia project
66	2014	❌	Quality of patient health information on the Internet: reviewing a complex and evolving landscape
67	2009	❌	Reputation and reliability in collective goods: The case of the online encyclopedia Wikipedia
68	2012	✅	Identifying controversial articles in Wikipedia: A comparative study
69	2007	✅	A framework for information quality assessment
70	2009	❌	What's on wikipedia, and what's not... ?: Assessing completeness of information

Assessed Publications (Phase 4)

This section lists the publications that were assessed in Phase 4 of the Literature Review process (Full text assessment). For a more complete explanation follow this link.

Id	Year	Result	Title
1	2017	✅	An end-to-end learning solution for assessing the quality of Wikipedia articles.
2	2009	✅	Automatic quality assessment of content created collaboratively by web communities: A case study of wikipedia
4	2010	❌	WikipediaViz: Conveying article quality for casual wikipedia readers
5	2014	✅	Assessing the quality of Thai Wikipedia articles using concept and statistical features
6	2017	✅	A psycho-lexical approach to the assessment of information quality on wikipedia
7	2015	❌	Measuring article quality in Wikipedia using the collaboration network
9	2017	✅	Measuring quality of collaboratively edited documents: The case of Wikipedia
12	2019	✅	Automatically assessing the quality of Wikipedia contents
13	2019	✅	A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
14	2019	✅	Quality assessment of peer-produced content in knowledge repositories using big data and social networks: The case of implicit collaboration in wikipedia
15	2020	❌	Assessing the quality of information on wikipedia: A deep-learning approach
16	2021	✅	Assessing the quality of health-related Wikipedia articles with generic and specific metrics
17	2021	❌	Measuring Wikipedia Article Quality in One Dimension by Extending ORES with Ordinal Regression
18	2016	❌	Automating assessment of collaborative writing quality in multiple stages: The case of wiki
19	2016	✅	Quality assessment of Wikipedia articles without feature engineering
21	2012	❌	Measuring the quality of web content using factual information
25	2014	❌	Measuring the quality of edits to Wikipedia
26	2007	❌	Measuring article quality in wikipedia: Models and evaluation
28	2020	❌	Proposal and Comparison of Health Specific Features for the Automatic Assessment of Readability
29	2011	❌	Exploring wiki: Measuring the quality of social media using ant colony metaphor
33	2008	✅	Size matters: Word count as a measure of quality on Wikipedia
36	2007	❌	Cooperation and quality in Wikipedia
38	2009	❌	A jury of your peers: quality, experience and ownership in Wikipedia
39	2007	❌	Does it matter who contributes - A study on featured articles in the german wikipedia
41	2019	❌	Assessing the quality of Wikipedia articles with lifecycle based metrics
42	2008	❌	Measuring author contributions to the Wikipedia
45	2010	✅	Identifying featured articles in Wikipedia: Writing style matters
48	2010	❌	Learning to predict the quality of contributions to wikipedia
49	2010	❌	Trust in wikipedia: How users trust information from an unknown source
51	2008	❌	Assigning trust to Wikipedia content
52	2010	✅	On measuring the quality of wikipedia articles
53	2010	❌	Detecting wikipedia vandalism using wikitrust
56	2008	❌	Can you ever trust a Wiki? Impacting perceived trustworthiness in Wikipedia
57	2017	❌	Estimating the quality of articles in Russian wikipedia using the logical-linguistic model of fact extraction
63	2011	❌	On the measurability of information quality
64	2005	✅	Information quality in a community-based encyclopedia
65	2007	❌	Computational trust in Web content quality: a comparative evalutation on the Wikipedia project
68	2012	❌	Identifying controversial articles in Wikipedia: A comparative study
69	2007	❌	A framework for information quality assessment

Relevance Scores (Phase 5)

This section lists the relevance scores for all of the approved papers. For a more complete explanation follow this link.

Q1 - Discussion of Features [0-3] Q2 - Discussion of ML approaches [0-3] Q3 - Discussion of Results [0-3] Q4 - Article Language other than English [0-1]

Id	Year	Q1	Q2	Q3	Q4	Total (0 - 10)	Title
1	2017	1	2	2	1	6	An end-to-end learning solution for assessing the quality of Wikipedia articles.
2	2009	3	1	3	1	8	Automatic quality assessment of content created collaboratively by web communities: A case study of wikipedia
5	2014	2	2	2	1	7	Assessing the quality of Thai Wikipedia articles using concept and statistical features
6	2017	2	1	3	0	6	A psycho-lexical approach to the assessment of information quality on wikipedia
9	2017	3	3	3	0	9	Measuring quality of collaboratively edited documents: The case of Wikipedia
12	2019	3	3	3	0	9	Automatically assessing the quality of Wikipedia contents
13	2019	3	2	3	0	8	A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
14	2019	3	2	2	0	7	Quality assessment of peer-produced content in knowledge repositories using big data and social networks: The case of implicit collaboration in wikipedia
16	2021	3	0	2	0	5	Assessing the quality of health-related Wikipedia articles with generic and specific metrics
19	2016	1	1	2	0	4	Quality assessment of Wikipedia articles without feature engineering
33	2008	1	2	2	0	5	Size matters: Word count as a measure of quality on Wikipedia
45	2010	1	1	2	0	4	Identifying featured articles in Wikipedia: Writing style matters
52	2010	2	1	2	0	5	On measuring the quality of wikipedia articles
64	2005	3	1	2	0	6	Information quality in a community-based encyclopedia

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Literature Review

Literature Review

Objective

Research Questions

Inclusion Criteria

Exclusion Criteria

Relevance Score Questions

Abstract Scanning (Phase 3)

Assessed Publications (Phase 4)

Relevance Scores (Phase 5)

PRISMA

Quality Features

ML algorithms

Clone this wiki locally