This article presents and classifies features that can be extracted from Wikipedia articles for the purpose of automatic information quality assessment. Based on a state of the art analysis and our own experiments, specific measures for various aspects of quality have been defined.
The use of the logistic regression in the assessment of the quality of data may have a significant impact on data management in the era of big data, where we are all dealing with a number of variables and amount of information describing some interesting phenomenon or behaviour.
In the special issue “Quality Management in Big Data” of the Informatics journal, paper about relative quality and popularity assessment of over 28 million Wikipedia articles in 44 different language versions was published. The article is published on an open access basis.
The article “Analysis of References Across Wikipedia Languages” received the Best Paper Award at ICIST 2017 Conference (The 23rd International Conference on Information and Software Technologies).
23rd International Conference on Information and Software Technologies (ICIST 2017) took place in Druskininkai on October 12-14. During the proceedings were presented the papers „Analysis of References Across Wikipedia Languages” and „Using Morphological and Semantic Features for the Quality Assessment of Russian Wikipedia”.
On 28-30 June 2017 the 20th edition of the International Conference on Business Information Systems (BIS 2017) was held in Poznań. During the conference 4 papers on various aspects of quality assessment and enrichment of information in different language versions of Wikipedia have been presented
Dr Krzysztof Węcel from the Department of Information Systems received a grant for conducting research with the tools available through the Microsoft Azure cloud. The award was presented as part of the Microsoft Azure for Research Award programme after a positive assessment of the project proposal entitled “Data Science for improving the quality of crowdsourced ...
On May 31, 2017 Włodzimierz Lewoniewski provided lectures and workshops for pupils of the High School no 12 in Poznań as part of the Infizmania project. The topic focused on the quality of information on Wikipedia in various languages.
Since founding and with the increasing popularity of Wikipedia there are more and more scientific publications on the quality of the information. One of the first studies in the area related to the automatic assessment of Wikipedia quality – “Assessing information quality of a community-based encyclopedia” by Besiki Stvilia, Michael B. Twidale, Linda C. Smith, ...
At the DBpedia meetup was presented presentation “Improving the quality of DBpedia based on the analysis of the quality of Wikipedia articles in different languages”. The meeting was held on 22 November 2016 at the Poznań University of Economics and Business. The presentation is placed on slideshare