[DL] work on analyzing or improving the Wikidata ontology
Peter F. Patel-Schneider
pfpschneider at gmail.com
Thu Jan 4 16:33:31 CET 2024
Wikidata (https://www.wikidata.org/wiki/Wikidata:Main_Page) is a large (close
to 110 million entities) open-source repository of information. Wikidata uses
a data model that is similar to, but more general than that of either RDF or
labelled property graphs. Wikidata incorporates a large ontology using regular
properties, as does RDFS.
The Wikidata ontology has become very large, somewhere around 4 million
class-like entities. As for the rest of Wikidata, the Wikidata ontology is the
result of edits from many different sources and by many different agents. As a
result, there are many problems in the ontology.
There have been several investigations and surveys about the Wikidata ontology
and thus an increased awareness of the problems in the ontology. See
https://www.wikidata.org/wiki/Wikidata:Ontology_issues_prioritization,
https://www.wikidata.org/wiki/Wikidata_talk:Ontology_issues_prioritization#Overview_of_potential_solutions,
and
https://commons.wikimedia.org/wiki/File:Wikidata_Challenges_in_Semantic_Web_Community.pdf
for more information. There is a task force starting up to address issues in
the Wikidata ontology. See
https://www.wikidata.org/wiki/Wikidata:WikiProject_Ontology/Cleaning_Task_Force
for more information.
If you are interested in analysis of the Wikidata ontology or in helping to
improve the ontology please contact me. Any kind of interest is welcome,
ranging from theoretical analyses of the ontology, to techniques for reasoning
in Wikidata, to implementation of tools that help improve the ontology, to
direct editing of the ontology. I can assist you in finding out more about the
ontology, introducing you to others involved with the ontology, forming groups
that can address issues with the ontology, or developing a topic suitable for
academic investigation.
Peter F. Patel-Schneider
More information about the dl
mailing list