Wikilinks are internal hyperlinks on Wikipedia, a popular Internet encyclopaedia. A unique article identifier is hidden behind so-called surface form, which is a grammatical match of a given term accordingly to the context in which it occurs. Therefore, a given term can have multiple surface forms.
This article presents and classifies features that can be extracted from Wikipedia articles for the purpose of automatic information quality assessment. Based on a state of the art analysis and our own experiments, specific measures for various aspects of quality have been defined.
The use of the logistic regression in the assessment of the quality of data may have a significant impact on data management in the era of big data, where we are all dealing with a number of variables and amount of information describing some interesting phenomenon or behaviour.
The team of the Department of Information Systems wishes You Merry Christmas and a Happy New Year!
In the special issue “Quality Management in Big Data” of the Informatics journal, paper about relative quality and popularity assessment of over 28 million Wikipedia articles in 44 different language versions was published. The article is published on an open access basis.
The article “Analysis of References Across Wikipedia Languages” received the Best Paper Award at ICIST 2017 Conference (The 23rd International Conference on Information and Software Technologies).
As a part of the Researchers’ Night, which took place on September 29, 2017, scientists from the Department of Information Systems organized workshops “WHAT DO YOU THINK? GUIDE FOR SMALL PROGRAMMERS “(MSc. Piotr Kałużny, Dr. Bartosz Perkowski) for children aged 8 to 12 and “WHAT DO YOU THINK? INTERNET WITHOUT SECRETS” (Dr. Agnieszka Figiel, MSc. ...
23rd International Conference on Information and Software Technologies (ICIST 2017) took place in Druskininkai on October 12-14. During the proceedings were presented the papers „Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles” and „Using Morphological and Semantic Features for the Quality Assessment of Russian Wikipedia”.
On 28-30 June 2017 the 20th edition of the International Conference on Business Information Systems (BIS 2017) was held in Poznań. During the conference 4 papers on various aspects of quality assessment and enrichment of information in different language versions of Wikipedia have been presented
Dr Krzysztof Węcel from the Department of Information Systems received a grant for conducting research with the tools available through the Microsoft Azure cloud. The award was presented as part of the Microsoft Azure for Research Award programme after a positive assessment of the project proposal entitled “Data Science for improving the quality of crowdsourced ...