Measures for Quality Assessment of Articles and Infoboxes in Multilingual Wikipedia

The scientific work about measures for quality assessment of articles and infoboxes in multilingual Wikipedia was published in a volume number 339 in Lecture Notes in Business Information Processing series by Springer Verlag.

Wikipedia is one of the most popular collaborative knowledge bases on the Internet. Articles of this free encyclopedia are created and edited by users from different countries in about 300 languages. Depending on topic and language version, quality of information there may vary. This study presents and classifies measures that can be extracted from Wikipedia articles for the purpose of automatic quality assessment in different languages. Based on a state of the art analysis and own experiments, specific measures for various aspects of quality have been defined. Additional, in this work they were also defined measures for quality assessment of data contained in the structural parts of Wikipedia articles – infoboxes. This study describes also an extraction methods for various sources of measures, that can be used in quality assessment.

The publication is available at
Preprint version is available at